Patent search ap:("Adobe Inc.") AND inv:"Midhun Harikumar" Page 2

11.

发明授权
Unsupervised style and color cues for transformer-based image generation 有权

公开(公告)号：US12277630B2

公开(公告)日：2025-04-15

申请号：US17662560

申请日：2022-05-09

Applicant: ADOBE INC.

Inventor： Pranav Vineet Aggarwal , Midhun Harikumar , Ajinkya Gorakhnath Kale

IPC: G06T11/40 , G06N3/045 , G06N3/088 , G06T7/13

Abstract: Systems and methods for image processing are configured. Embodiments of the present disclosure identify target style attributes and target structure attributes for a composite image; generate a matrix of composite feature tokens based on the target style attributes and the target structure attributes, wherein subsequent feature tokens of the matrix of composite feature tokens are sequentially generated based on previous feature tokens of the matrix of composite feature tokens according to a linear ordering of the matrix of composite feature tokens; and generate the composite image based on the matrix of composite feature tokens, wherein the composite image includes the target style attributes and the target structure attributes.

12.

发明申请
CONTROLLING COMPOSITION AND STRUCTURE IN GENERATED IMAGES 有权

公开(公告)号：US20250117974A1

公开(公告)日：2025-04-10

申请号：US18908075

申请日：2024-10-07

Applicant: ADOBE INC.

Inventor： Midhun Harikumar , Nicholas Isaac Kolkin , Sachin Madhav Kelkar , Purvak Lapsiya , Elya Shechtman , Ajinkya Gorakhnath Kale , Jalansh Saumil Munshi

IPC: G06T11/00 , G06F40/284 , G06T5/70

Abstract: A method, apparatus, non-transitory computer readable medium, and system for generating synthetic images depicting an image element with a target composition include obtaining a content input and a composition input. The content input indicates an image element and the composition input indicates a target composition of the image element. Embodiments then encode the composition input to obtain a composition embedding representing the target composition. Subsequently, embodiments generate, using an image generation model, a synthetic image based on the content input and the composition embedding. The synthetic image depicts the image element with the target composition.

13.

发明授权
Image segmentation using text embedding 有权

公开(公告)号：US12008698B2

公开(公告)日：2024-06-11

申请号：US18117155

申请日：2023-03-03

Applicant: Adobe Inc.

Inventor： Midhun Harikumar , Pranav Aggarwal , Baldo Faieta , Ajinkya Kale , Zhe Lin

IPC: G06T11/60 , G06T7/11 , G06T7/162

CPC classification number: G06T11/60 , G06T7/11 , G06T7/162 , G06T2207/20081 , G06T2207/20084

Abstract: A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, using a model, a learned image representation of a target image. The operations further include generating, using a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image based on the convolving of the learned image representation of the target image with the text embedding.

14.

发明公开
OBJECT-AGNOSTIC IMAGE REPRESENTATION 审中-公开

公开(公告)号：US20240020954A1

公开(公告)日：2024-01-18

申请号：US17812596

申请日：2022-07-14

Applicant: ADOBE INC.

Inventor： Sachin Kelkar , Ajinkya Gorakhnath Kale , Midhun Harikumar

IPC: G06V10/774 , G06T5/00 , G06T7/194 , G06V10/771 , G06V10/776 , G06V10/26 , G06V10/75 , G06F16/532

CPC classification number: G06V10/774 , G06T5/005 , G06T7/194 , G06V10/771 , G06V10/776 , G06V10/267 , G06V10/759 , G06F16/532 , G06T2207/20081 , G06V2201/10

Abstract: Systems and methods for image processing, and specifically for generating object-agnostic image representations, are described. Embodiments of the present disclosure receive a training image including a foreground object and a background, remove the foreground object from the training image to obtain a modified training image, inpaint a portion of the modified training image corresponding to the foreground object to obtain an inpainted training image, encode the training image and the inpainted training image using a machine learning model to obtain an encoded training image and an encoded inpainted training image, and update parameters of the machine learning model based on the encoded training image and the encoded inpainted training image.

15.

发明公开
UNSUPERVISED STYLE AND COLOR CUES FOR TRANSFORMER-BASED IMAGE GENERATION 审中-公开

公开(公告)号：US20230360294A1

公开(公告)日：2023-11-09

申请号：US17662560

申请日：2022-05-09

Applicant: ADOBE INC.

Inventor： Pranav Vineet Aggarwal , Midhun Harikumar , Ajinkya Gorakhnath Kale

IPC: G06T11/40 , G06N3/04 , G06N3/08 , G06T7/13

CPC classification number: G06T11/40 , G06N3/0454 , G06N3/088 , G06T7/13 , G06T2207/20081 , G06T2207/20084

Abstract: Systems and methods for image processing are configured. Embodiments of the present disclosure identify target style attributes and target structure attributes for a composite image; generate a matrix of composite feature tokens based on the target style attributes and the target structure attributes, wherein subsequent feature tokens of the matrix of composite feature tokens are sequentially generated based on previous feature tokens of the matrix of composite feature tokens according to a linear ordering of the matrix of composite feature tokens; and generate the composite image based on the matrix of composite feature tokens, wherein the composite image includes the target style attributes and the target structure attributes.

16.

发明申请
IMAGE SEGMENTATION USING TEXT EMBEDDING 有权

公开(公告)号：US20220156992A1

公开(公告)日：2022-05-19

申请号：US16952008

申请日：2020-11-18

Applicant: Adobe Inc.

Inventor： Midhun Harikumar , Pranav Aggarwal , Baldo Faieta , Ajinkya Kale , Zhe Lin

IPC: G06T11/60 , G06T7/11 , G06T7/162

Abstract: A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, by a model that includes trainable components, a learned image representation of a target image. The operations further include generating, by a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include generating a class activation map of the target image by, at least, convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image using the class activation map of the target image.

17.

发明公开
UTILIZING A DIFFUSION PRIOR NEURAL NETWORK FOR TEXT GUIDED DIGITAL IMAGE EDITING 审中-公开

公开(公告)号：US20240362842A1

公开(公告)日：2024-10-31

申请号：US18308017

申请日：2023-04-27

Applicant: Adobe Inc.

Inventor： Hareesh Ravi , Sachin Kelkar , Midhun Harikumar , Ajinkya Gorakhnath Kale

IPC: G06T11/60 , G06T5/00

CPC classification number: G06T11/60 , G06T5/70 , G06T2200/24 , G06T2207/20084

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for utilizing a diffusion prior neural network for text guided digital image editing. For example, in one or more embodiments the disclosed systems utilize a text-image encoder to generate a base image embedding from the base digital image and an edit text embedding from edit text. Moreover, the disclosed systems utilize a diffusion prior neural network to generate a text-image embedding. In particular, the disclosed systems inject the base image embedding at a conceptual editing step of the diffusion prior neural network and condition a set of steps of the diffusion prior neural network after the conceptual editing step utilizing the edit text embedding. Furthermore, the disclosed systems utilize a diffusion neural network to create a modified digital image from the text-edited image embedding and the base image embedding.

18.

发明公开
PRIOR GUIDED LATENT DIFFUSION 审中-公开

公开(公告)号：US20240346629A1

公开(公告)日：2024-10-17

申请号：US18301671

申请日：2023-04-17

Applicant: ADOBE INC.

Inventor： Midhun Harikumar , Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Vinh Ngoc Khuc

IPC: G06T5/00 , G06F40/279 , G06T5/50

CPC classification number: G06T5/73 , G06F40/279 , G06T5/50

Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure obtain a text prompt for text guided image generation. A multi-modal encoder of an image processing apparatus encodes the text prompt to obtain a text embedding. A diffusion prior model of the image processing apparatus converts the text embedding to an image embedding. A latent diffusion model of the image processing apparatus generates an image based on the image embedding, wherein the image includes an element described by the text prompt.

19.

发明公开
IMAGE GENERATION USING A TEXT AND IMAGE CONDITIONED MACHINE LEARNING MODEL 审中-公开

公开(公告)号：US20240320872A1

公开(公告)日：2024-09-26

申请号：US18426763

申请日：2024-01-30

Applicant: ADOBE INC.

Inventor： Tobias Hinz , Venkata Naveen Kumar Yadav Marri , Midhun Harikumar , Ajinkya Gorakhnath Kale , Zhe Lin , Oliver Wang , Jingwan Lu

IPC: G06T11/00 , G06F40/284 , G06F40/40

CPC classification number: G06T11/00 , G06F40/284 , G06F40/40 , G06T2207/20081 , G06T2207/20084

Abstract: A method, apparatus, non-transitory computer readable medium, and system for image generation include obtaining a text embedding of a text prompt and an image embedding of an image prompt. Some embodiments map the text embedding into a joint embedding space to obtain a joint text embedding and map the image embedding into the joint embedding space to obtain a joint image embedding. Some embodiments generate a synthetic image based on the joint text embedding and the joint image embedding.

20.

发明授权
Keyword localization digital image search 有权

公开(公告)号：US11934448B2

公开(公告)日：2024-03-19

申请号：US18302201

申请日：2023-04-18

Applicant: Adobe Inc.

Inventor： Pramod Srinivasan , Zhe Lin , Samarth Gulati , Saeid Motiian , Midhun Harikumar , Baldo Antonio Faieta , Alex C. Filipkowski

IPC: G06F16/532 , G06F16/51 , G06F16/538 , G06F16/54 , G06F16/583 , G06F40/30

CPC classification number: G06F16/532 , G06F16/51 , G06F16/538 , G06F16/54 , G06F16/583 , G06F40/30

Abstract: Keyword localization digital image search techniques are described. These techniques support an ability to indicate “where” a corresponding keyword is to be expressed with respect to a layout in a respective digital image resulting from a search query. The search query may also include an indication of a size of the keyword as expressed in the digital image, a number of instances of the keyword, and so forth. Additionally, the techniques and systems as described herein support real time search through use of keyword signatures.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification