Patent search ap:("ADOBE INC.") AND inv:"Midhun Harikumar" Page 2

11.

发明申请
IMAGE SEGMENTATION USING TEXT EMBEDDING 有权

公开(公告)号：US20220156992A1

公开(公告)日：2022-05-19

申请号：US16952008

申请日：2020-11-18

Applicant: Adobe Inc.

Inventor： Midhun Harikumar , Pranav Aggarwal , Baldo Faieta , Ajinkya Kale , Zhe Lin

IPC: G06T11/60 , G06T7/11 , G06T7/162

Abstract: A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, by a model that includes trainable components, a learned image representation of a target image. The operations further include generating, by a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include generating a class activation map of the target image by, at least, convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image using the class activation map of the target image.

12.

发明授权
Generating novel images using sketch image representations 有权

公开(公告)号：US12299939B2

公开(公告)日：2025-05-13

申请号：US17808261

申请日：2022-06-22

Applicant: Adobe Inc.

Inventor： Midhun Harikumar , Pranav Aggarwal , Ajinkya Gorakhnath Kale

IPC: G06T7/10 , G06T9/00

Abstract: Techniques for generating a novel image using tokenized image representations are disclosed. In some embodiments, a method of generating the novel image includes generating, via a first machine learning model, a first sequence of coded representations of a first image having one or more features; generating, via a second machine learning model, a second sequence of coded representations of a sketch image having one or more edge features associated with the one or more features; predicting, via a third machine learning model, one or more subsequent coded representations based on the first sequence of coded representations and the second sequence of coded representations; and based on the subsequent coded representations, generating, via the third machine learning model, a first portion of a reconstructed image having one or more image attributes of the first image, and a second portion of the reconstructed image associated with the one or more edge features.

13.

发明申请
STYLE-BASED IMAGE GENERATION 有权

公开(公告)号：US20250117973A1

公开(公告)日：2025-04-10

申请号：US18903151

申请日：2024-10-01

Applicant: ADOBE INC.

Inventor： Fengbin Chen , Midhun Harikumar , Ajinkya Gorakhnath Kale , Hareesh Ravi , Venkata Naveen Kumar Yadav Marri

IPC: G06T11/00

Abstract: A method, apparatus, non-transitory computer readable medium, and system for media processing includes obtaining a text prompt and a style input, where the text prompt describes image content and the style input describes an image style, generating a text embedding based on the text prompt, where the text embedding represents the image content, generating a style embedding based on the style input, where the style embedding represents the image style, and generating a synthetic image based on the text embedding and the style embedding, where the text embedding is provided to the image generation model at a first step and the style embedding is provided to the image generation model at a second step after the first step.

14.

发明申请
GENERATING TILE-ABLE PATTERNS FROM TEXT 有权

公开(公告)号：US20240420389A1

公开(公告)日：2024-12-19

申请号：US18526855

申请日：2023-12-01

Applicant: ADOBE INC.

Inventor： Vineet Batra , Sumit Chaturvedi , Abhishek Rai , Pranav Vineet Aggarwal , Ajinkya Gorakhnath Kale , Aman Jeph , Ankit Phogat , Sumit Dhingra , Fengbin Chen , Kshitiz Garg , Milos Hasan , Midhun Harikumar , Gaurav Suresh Pathak , Souymodip Chakraborty

IPC: G06T11/20 , G06V10/764 , G06V10/774

Abstract: Systems and methods for generating tile-able patterns from text include obtaining a text prompt and generating, by a generation prior model, a latent vector based on the text prompt, where the generation prior model is trained to output vectors within a distribution of tile-able patterns. An image generation model then generates an output image based on the latent vector. The output image comprises a tile-able pattern including an element from the text prompt.

15.

发明公开
UTILIZING A DIFFUSION NEURAL NETWORK FOR MASK AWARE IMAGE AND TYPOGRAPHY EDITING 审中-公开

公开(公告)号：US20240355018A1

公开(公告)日：2024-10-24

申请号：US18303898

申请日：2023-04-20

Applicant: Adobe Inc.

Inventor： Pranav Aggarwal , Hareesh Ravi , Midhun Harikumar , Ajinkya Gorakhnath Kale , Fengbin Chen , Venkata Naveen Kumar Yadav Marri

IPC: G06T11/60 , G06T5/00 , G06T5/50 , G06T7/11 , G06T7/50 , G06T13/00

CPC classification number: G06T11/60 , G06T5/50 , G06T5/70 , G06T7/11 , G06T7/50 , G06T13/00 , G06T2200/24 , G06T2207/20021 , G06T2207/20081 , G06T2207/20084 , G06T2207/20092 , G06T2207/20212

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for utilizing a diffusion neural network for mask aware image and typography editing. For example, in one or more embodiments the disclosed systems utilize a text-image encoder to generate a base image embedding from a base digital image. Moreover, the disclosed systems generate a mask-segmented image by combining a shape mask with the base digital image. In one or more implementations, the disclosed systems utilize noising steps of a diffusion noising model to generate a mask-segmented image noise map from the mask-segmented image. Furthermore, the disclosed systems utilize a diffusion neural network to create a stylized image corresponding to the shape mask from the base image embedding and the mask-segmented image noise map.

16.

发明公开
PRESET STYLE TRANSFER 审中-公开

公开(公告)号：US20240354895A1

公开(公告)日：2024-10-24

申请号：US18303271

申请日：2023-04-19

Applicant: ADOBE INC.

Inventor： Hareesh Ravi , Midhun Harikumar , Taesung Park , Ajinkya Gorakhnath Kale

IPC: G06T5/50 , G06T5/00 , G06T11/60 , G06V10/764

CPC classification number: G06T5/50 , G06T5/00 , G06T11/60 , G06V10/764 , G06T2200/24 , G06T2207/20076 , G06T2207/20081 , G06T2207/20084 , G06T2207/20092 , G06T2207/20212

Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure include an image generation network configured to encode a plurality of abstract images using a style encoder to obtain a plurality of abstract style encodings, wherein the style encoder is trained to represent image style separately from image content. A clustering component clusters the plurality of abstract style encodings to obtain an abstract style cluster comprising a subset of the plurality of abstract style encodings. A preset component generates an abstract style transfer preset representing the abstract style cluster.

17.

发明授权
Image segmentation using text embedding 有权

公开(公告)号：US11615567B2

公开(公告)日：2023-03-28

申请号：US16952008

申请日：2020-11-18

Applicant: Adobe Inc.

Inventor： Midhun Harikumar , Pranav Aggarwal , Baldo Faieta , Ajinkya Kale , Zhe Lin

IPC: G06T11/60 , G06T7/11 , G06T7/162

Abstract: A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, by a model that includes trainable components, a learned image representation of a target image. The operations further include generating, by a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include generating a class activation map of the target image by, at least, convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image using the class activation map of the target image.

18.

发明公开
UTILIZING A DIFFUSION PRIOR NEURAL NETWORK FOR TEXT GUIDED DIGITAL IMAGE EDITING 审中-公开

公开(公告)号：US20240362842A1

公开(公告)日：2024-10-31

申请号：US18308017

申请日：2023-04-27

Applicant: Adobe Inc.

Inventor： Hareesh Ravi , Sachin Kelkar , Midhun Harikumar , Ajinkya Gorakhnath Kale

IPC: G06T11/60 , G06T5/00

CPC classification number: G06T11/60 , G06T5/70 , G06T2200/24 , G06T2207/20084

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for utilizing a diffusion prior neural network for text guided digital image editing. For example, in one or more embodiments the disclosed systems utilize a text-image encoder to generate a base image embedding from the base digital image and an edit text embedding from edit text. Moreover, the disclosed systems utilize a diffusion prior neural network to generate a text-image embedding. In particular, the disclosed systems inject the base image embedding at a conceptual editing step of the diffusion prior neural network and condition a set of steps of the diffusion prior neural network after the conceptual editing step utilizing the edit text embedding. Furthermore, the disclosed systems utilize a diffusion neural network to create a modified digital image from the text-edited image embedding and the base image embedding.

19.

发明公开
PRIOR GUIDED LATENT DIFFUSION 审中-公开

公开(公告)号：US20240346629A1

公开(公告)日：2024-10-17

申请号：US18301671

申请日：2023-04-17

Applicant: ADOBE INC.

Inventor： Midhun Harikumar , Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Vinh Ngoc Khuc

IPC: G06T5/00 , G06F40/279 , G06T5/50

CPC classification number: G06T5/73 , G06F40/279 , G06T5/50

Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure obtain a text prompt for text guided image generation. A multi-modal encoder of an image processing apparatus encodes the text prompt to obtain a text embedding. A diffusion prior model of the image processing apparatus converts the text embedding to an image embedding. A latent diffusion model of the image processing apparatus generates an image based on the image embedding, wherein the image includes an element described by the text prompt.

20.

发明公开
IMAGE GENERATION USING A TEXT AND IMAGE CONDITIONED MACHINE LEARNING MODEL 审中-公开

公开(公告)号：US20240320872A1

公开(公告)日：2024-09-26

申请号：US18426763

申请日：2024-01-30

Applicant: ADOBE INC.

Inventor： Tobias Hinz , Venkata Naveen Kumar Yadav Marri , Midhun Harikumar , Ajinkya Gorakhnath Kale , Zhe Lin , Oliver Wang , Jingwan Lu

IPC: G06T11/00 , G06F40/284 , G06F40/40

CPC classification number: G06T11/00 , G06F40/284 , G06F40/40 , G06T2207/20081 , G06T2207/20084

Abstract: A method, apparatus, non-transitory computer readable medium, and system for image generation include obtaining a text embedding of a text prompt and an image embedding of an image prompt. Some embodiments map the text embedding into a joint embedding space to obtain a joint text embedding and map the image embedding into the joint embedding space to obtain a joint image embedding. Some embodiments generate a synthetic image based on the joint text embedding and the joint image embedding.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification