GENERATING NOVEL IMAGES USING SKETCH IMAGE REPRESENTATIONS

    公开(公告)号:US20230419551A1

    公开(公告)日:2023-12-28

    申请号:US17808261

    申请日:2022-06-22

    Applicant: Adobe Inc.

    CPC classification number: G06T9/001 G06T9/008 G06T7/10

    Abstract: Techniques for generating a novel image using tokenized image representations are disclosed. In some embodiments, a method of generating the novel image includes generating, via a first machine learning model, a first sequence of coded representations of a first image having one or more features; generating, via a second machine learning model, a second sequence of coded representations of a sketch image having one or more edge features associated with the one or more features; predicting, via a third machine learning model, one or more subsequent coded representations based on the first sequence of coded representations and the second sequence of coded representations; and based on the subsequent coded representations, generating, via the third machine learning model, a first portion of a reconstructed image having one or more image attributes of the first image, and a second portion of the reconstructed image associated with the one or more edge features.

    IMAGE SEGMENTATION USING TEXT EMBEDDING
    27.
    发明公开

    公开(公告)号:US20230206525A1

    公开(公告)日:2023-06-29

    申请号:US18117155

    申请日:2023-03-03

    Applicant: Adobe Inc.

    Abstract: A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, using a model, a learned image representation of a target image. The operations further include generating, using a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image based on the convolving of the learned image representation of the target image with the text embedding.

Patent Agency Ranking