Image manipulation by text instruction

    公开(公告)号:US11562518B2

    公开(公告)日:2023-01-24

    申请号:US17340671

    申请日:2021-06-07

    Applicant: Google LLC

    Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

    IMAGE MANIPULATION BY TEXT INSTRUCTION

    公开(公告)号:US20210383584A1

    公开(公告)日:2021-12-09

    申请号:US17340671

    申请日:2021-06-07

    Applicant: Google LLC

    Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

    Image manipulation by text instruction

    公开(公告)号:US11900517B2

    公开(公告)日:2024-02-13

    申请号:US18085487

    申请日:2022-12-20

    Applicant: Google LLC

    Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

    IMAGE MANIPULATION BY TEXT INSTRUCTION
    4.
    发明公开

    公开(公告)号:US20230177754A1

    公开(公告)日:2023-06-08

    申请号:US18085487

    申请日:2022-12-20

    Applicant: Google LLC

    Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

    IMAGE MANIPULATION BY TEXT INSTRUCTION
    5.
    发明公开

    公开(公告)号:US20240212246A1

    公开(公告)日:2024-06-27

    申请号:US18400629

    申请日:2023-12-29

    Applicant: Google LLC

    Abstract: A method for generating an output image from an input image and an input text instruction that specifies a location and a modification of an edit applied to the input image using a neural network is described. The neural network includes an image encoder, an image decoder, and an instruction attention network. The method includes receiving the input image and the input text instruction; extracting, from the input image, an input image feature that represents features of the input image using the image encoder; generating a spatial feature and a modification feature from the input text instruction using the instruction attention network; generating an edited image feature from the input image feature, the spatial feature and the modification feature; and generating the output image from the edited image feature using the image decoder.

Patent Agency Ranking