-
公开(公告)号:US11604822B2
公开(公告)日:2023-03-14
申请号:US16426369
申请日:2019-05-30
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian
IPC: G06F7/00 , G06F16/583 , G06N3/084 , G06N20/00 , G06F16/538 , G06F16/532 , G06F16/33 , G06F3/04855
Abstract: Multi-modal differential search with real-time focus adaptation techniques are described that overcome the challenges of conventional techniques in a variety of ways. In one example, a model is trained to support a visually guided machine-learning embedding space that supports visual intuition as to “what” is represented by text. The visually guided language embedding space supported by the model, once trained, may then be used to support visual intuition as part of a variety of functionality. In one such example, the visually guided language embedding space as implemented by the model may be leveraged as part of a multi-modal differential search to support search of digital images and other digital content with real-time focus adaptation which overcomes the challenges of conventional techniques.
-
公开(公告)号:US12299055B2
公开(公告)日:2025-05-13
申请号:US17887694
申请日:2022-08-15
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Ali Aminian , Ajinkya Gorakhnath Kale , Aashish Kumar Misraa
IPC: G06F7/00 , G06F16/903 , G06F16/908 , G06F16/953 , G06F18/2413 , G06N20/00 , G06V10/426 , G06V10/764
Abstract: Technology is disclosed herein for enhanced similarity search. In an implementation, a search environment includes one or more computing hardware, software, and/or firmware components in support of enhanced similarity search. The one or more components identify a modality for a similarity search with respect to a query object. The components generate an embedding for the query object based on the modality and based on connections between the query object and neighboring nodes in a graph. The embedding for the query object provides the basis for the search for similar objects.
-
公开(公告)号:US20240346629A1
公开(公告)日:2024-10-17
申请号:US18301671
申请日:2023-04-17
Applicant: ADOBE INC.
Inventor: Midhun Harikumar , Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Vinh Ngoc Khuc
IPC: G06T5/00 , G06F40/279 , G06T5/50
CPC classification number: G06T5/73 , G06F40/279 , G06T5/50
Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure obtain a text prompt for text guided image generation. A multi-modal encoder of an image processing apparatus encodes the text prompt to obtain a text embedding. A diffusion prior model of the image processing apparatus converts the text embedding to an image embedding. A latent diffusion model of the image processing apparatus generates an image based on the image embedding, wherein the image includes an element described by the text prompt.
-
公开(公告)号:US11605019B2
公开(公告)日:2023-03-14
申请号:US16426298
申请日:2019-05-30
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian
Abstract: Visually guided machine-learning language model and embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. In one example, a model is trained to support a visually guided machine-learning embedding space that supports visual intuition as to “what” is represented by text. The visually guided language embedding space supported by the model, once trained, may then be used to support visual intuition as part of a variety of functionality. In one such example, the visually guided language embedding space as implemented by the model may be leveraged as part of a multi-modal differential search to support search of digital images and other digital content with real-time focus adaptation which overcomes the challenges of conventional techniques.
-
公开(公告)号:US20210326393A1
公开(公告)日:2021-10-21
申请号:US16854697
申请日:2020-04-21
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Ali Aminian , Ajinkya Gorakhnath Kale , Aashish Kumar Misraa
IPC: G06F16/903 , G06N20/00 , G06K9/62 , G06F16/908
Abstract: Technology is disclosed herein for enhanced similarity search. In an implementation, a search environment includes one or more computing hardware, software, and/or firmware components in support of enhanced similarity search. The one or more components identify a modality for a similarity search with respect to a query object. The components generate an embedding for the query object based on the modality and based on connections between the query object and neighboring nodes in a graph. The embedding for the query object provides the basis for the search for similar objects
-
公开(公告)号:US20200380403A1
公开(公告)日:2020-12-03
申请号:US16426298
申请日:2019-05-30
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian
IPC: G06N20/00 , G06K9/62 , G06F16/538 , G06N3/08
Abstract: Visually guided machine-learning language model and embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. In one example, a model is trained to support a visually guided machine-learning embedding space that supports visual intuition as to “what” is represented by text. The visually guided language embedding space supported by the model, once trained, may then be used to support visual intuition as part of a variety of functionality. In one such example, the visually guided language embedding space as implemented by the model may be leveraged as part of a multi-modal differential search to support search of digital images and other digital content with real-time focus adaptation which overcomes the challenges of conventional techniques.
-
公开(公告)号:US12277630B2
公开(公告)日:2025-04-15
申请号:US17662560
申请日:2022-05-09
Applicant: ADOBE INC.
Inventor: Pranav Vineet Aggarwal , Midhun Harikumar , Ajinkya Gorakhnath Kale
Abstract: Systems and methods for image processing are configured. Embodiments of the present disclosure identify target style attributes and target structure attributes for a composite image; generate a matrix of composite feature tokens based on the target style attributes and the target structure attributes, wherein subsequent feature tokens of the matrix of composite feature tokens are sequentially generated based on previous feature tokens of the matrix of composite feature tokens according to a linear ordering of the matrix of composite feature tokens; and generate the composite image based on the matrix of composite feature tokens, wherein the composite image includes the target style attributes and the target structure attributes.
-
公开(公告)号:US20250117967A1
公开(公告)日:2025-04-10
申请号:US18443590
申请日:2024-02-16
Applicant: ADOBE INC.
Inventor: Ritiz Tambi , Purvak Lapsiya , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Nikolaos Vlassis
IPC: G06T11/00 , G06F40/166 , G06V10/764
Abstract: A method, apparatus, non-transitory computer readable media, and system for image generation include obtaining an input text prompt and an indication of a level of a target characteristic, where the target characteristic comprises a characteristic used to train an image generation model. Some embodiments generate an augmented text prompt comprising the input text and an objective text corresponding to the level of the target characteristic. Some embodiments generate, using the image generation model, an image based on the augmented text prompt, where the image depicts content of the input text prompt and has the level of the target characteristic.
-
公开(公告)号:US20240037881A1
公开(公告)日:2024-02-01
申请号:US17814940
申请日:2022-07-26
Applicant: ADOBE INC.
Inventor: Pranav Vineet Aggarwal , Alvin Ghouas , Ajinkya Gorakhnath Kale
CPC classification number: G06T19/20 , G06T7/11 , G06T5/005 , G06T5/50 , G06T7/194 , G06T2207/20224 , G06T2207/20021 , G06T2210/62 , G06T2200/24 , G06T2207/20092 , G06T2207/20084 , G06T2219/2024 , G06T2219/2004
Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure receive a first image depicting a scene and a second image that includes a style; segment the first image to obtain a first segment and a second segment, wherein the first segment has a shape of an object in the scene; apply a style transfer network to the first segment and the second image to obtain a first image part, wherein the first image part has the shape of the object and the style from the second image; combine the first image part with a second image part corresponding to the second segment to obtain a combined image; and apply a lenticular effect to the combined image to obtain an output image.
-
公开(公告)号:US20230360294A1
公开(公告)日:2023-11-09
申请号:US17662560
申请日:2022-05-09
Applicant: ADOBE INC.
Inventor: Pranav Vineet Aggarwal , Midhun Harikumar , Ajinkya Gorakhnath Kale
CPC classification number: G06T11/40 , G06N3/0454 , G06N3/088 , G06T7/13 , G06T2207/20081 , G06T2207/20084
Abstract: Systems and methods for image processing are configured. Embodiments of the present disclosure identify target style attributes and target structure attributes for a composite image; generate a matrix of composite feature tokens based on the target style attributes and the target structure attributes, wherein subsequent feature tokens of the matrix of composite feature tokens are sequentially generated based on previous feature tokens of the matrix of composite feature tokens according to a linear ordering of the matrix of composite feature tokens; and generate the composite image based on the matrix of composite feature tokens, wherein the composite image includes the target style attributes and the target structure attributes.
-
-
-
-
-
-
-
-
-