Image segmentation using text embedding

    公开(公告)号:US12008698B2

    公开(公告)日:2024-06-11

    申请号:US18117155

    申请日:2023-03-03

    Applicant: Adobe Inc.

    Abstract: A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, using a model, a learned image representation of a target image. The operations further include generating, using a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image based on the convolving of the learned image representation of the target image with the text embedding.

    Systems and methods for generating text descriptive of digital images

    公开(公告)号:US11687714B2

    公开(公告)日:2023-06-27

    申请号:US16998730

    申请日:2020-08-20

    CPC classification number: G06F40/279

    Abstract: Disclosed are computer-implemented methods and systems for generating text descriptive of digital images, comprising using a machine learning model to pre-process an image to generate initial text descriptive of the image; adjusting one or more inferences of the machine learning model, the inferences biasing the machine learning model away from associating negative words with the image; using the machine learning model comprising the adjusted inferences to post-process the image to generate updated text descriptive of the image; and processing the generated updated text descriptive of the image outputted by the machine learning model to fine-tune the updated text descriptive of the image.

    IMAGE SEGMENTATION USING TEXT EMBEDDING

    公开(公告)号:US20220156992A1

    公开(公告)日:2022-05-19

    申请号:US16952008

    申请日:2020-11-18

    Applicant: Adobe Inc.

    Abstract: A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, by a model that includes trainable components, a learned image representation of a target image. The operations further include generating, by a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include generating a class activation map of the target image by, at least, convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image using the class activation map of the target image.

    TEXT STYLE AND EMPHASIS SUGGESTIONS

    公开(公告)号:US20220138402A1

    公开(公告)日:2022-05-05

    申请号:US17090055

    申请日:2020-11-05

    Applicant: ADOBE INC.

    Abstract: Embodiments provide systems, methods, and computer storage media for text style suggestions and/or text emphasis suggestions. In an example embodiment, an electronic design application provides a text style suggestion tool that generates text style suggestions to stylize a selected text element based on the context of the design. A text emphasis tool allows a user to select a text element and generate text emphasis suggestions for which words should be emphasized with a different text styling. Various interaction elements allow the user to iterate through the suggestions. For example, a set of style suggestions may be mapped to successive rotational increments around a style wheel, and as the user rotates through the positions on the style wheel, a corresponding text style suggestion is previewed and/or applied.

    GENERATING CONTEXTUAL TAGS FOR DIGITAL CONTENT

    公开(公告)号:US20210034657A1

    公开(公告)日:2021-02-04

    申请号:US16525366

    申请日:2019-07-29

    Applicant: Adobe Inc.

    Abstract: Systems, methods, and non-transitory computer-readable media are disclosed for determining multi-term contextual tags for digital content and propagating the multi-term contextual tags to additional digital content. For instance, the disclosed systems can utilize search query supervision to determine and associate multi-term contextual tags (e.g., tags that represent a specific concept based on the order of the terms in the tag) with digital content. Furthermore, the disclosed systems can propagate the multi-term contextual tags determined for the digital content to additional digital content based on similarities between the digital content and additional digital content (e.g., utilizing clustering techniques). Additionally, the disclosed systems can provide digital content as search results based on the associated multi-term contextual tags.

    SYSTEMS AND METHODS FOR GENERATING TEXT DESCRIPTIVE OF DIGITAL IMAGES

    公开(公告)号:US20230315988A1

    公开(公告)日:2023-10-05

    申请号:US18315391

    申请日:2023-05-10

    Applicant: Adobe Inc.

    CPC classification number: G06F40/279

    Abstract: Disclosed are computer-implemented methods and systems for generating text descriptive of digital images, comprising using a machine learning model to pre-process an image to generate initial text descriptive of the image; adjusting one or more inferences of the machine learning model, the inferences biasing the machine learning model away from associating negative words with the image; using the machine learning model comprising the adjusted inferences to post-process the image to generate updated text descriptive of the image; and processing the generated updated text descriptive of the image outputted by the machine learning model to fine-tune the updated text descriptive of the image.

    IMAGE SEGMENTATION USING TEXT EMBEDDING
    10.
    发明公开

    公开(公告)号:US20230206525A1

    公开(公告)日:2023-06-29

    申请号:US18117155

    申请日:2023-03-03

    Applicant: Adobe Inc.

    Abstract: A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, using a model, a learned image representation of a target image. The operations further include generating, using a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image based on the convolving of the learned image representation of the target image with the text embedding.

Patent Agency Ranking