-
公开(公告)号:US11436865B1
公开(公告)日:2022-09-06
申请号:US17207178
申请日:2021-03-19
Applicant: Adobe Inc.
Inventor: Saeid Motiian , Zhe Lin , Shabnam Ghadar , Baldo Faieta
IPC: G06V40/16 , G06K9/62 , G06V30/194 , G06V40/10
Abstract: Embodiments are disclosed for finding similar persons in images. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an image query, the image query including an input image that includes a representation of a person, generating a first cropped image including a representation of the person's face and a second cropped image including a representation of the person's body, generating an image embedding for the input image by combining a face embedding corresponding to the first cropped image and a body embedding corresponding to the second cropped image, and querying an image repository in embedding space by comparing the image embedding to a plurality of image embeddings associated with a plurality of images in the image repository to obtain one or more images based on similarity to the input image in the embedding space.
-
公开(公告)号:US20220138247A1
公开(公告)日:2022-05-05
申请号:US17090150
申请日:2020-11-05
Applicant: ADOBE INC.
Inventor: Mikhail Kotov , Roland Geisler , Saeid Motiian , Dylan Nathaniel Warnock , Michele Saad , Venkata Naveen Kumar Yadav Marri , Ajinkya Kale , Ryan Rozich , Baldo Faieta
IPC: G06F16/532 , G06F16/583 , G06F16/538 , G06F16/2457
Abstract: Embodiments of the technology described herein, provide improved visual search results by combining a visual similarity and a textual similarity between images. In an embodiment, the visual similarity is quantified as a visual similarity score and the textual similarity is quantified as a textual similarity score. The textual similarity is determined based on text, such as a title, associated with the image. The overall similarity of two images is quantified as a weighted combination of the textual similarity score and the visual similarity score. In an embodiment, the weighting between the textual similarity score and the visual similarity score is user configurable through a control on the search interface. In one embodiment, the aggregate similarity score is the sum of a weighted visual similarity score and a weighted textual similarity score.
-
公开(公告)号:US11915520B2
公开(公告)日:2024-02-27
申请号:US17902349
申请日:2022-09-02
Applicant: Adobe Inc.
Inventor: Saeid Motiian , Zhe Lin , Shabnam Ghadar , Baldo Faieta
IPC: G06V40/16 , G06V30/194 , G06V40/10 , G06F18/00 , G06F18/20
CPC classification number: G06V40/172 , G06F18/00 , G06F18/29 , G06V30/194 , G06V40/10
Abstract: Embodiments are disclosed for finding similar persons in images. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an image query, the image query including an input image that includes a representation of a person, generating a first cropped image including a representation of the person's face and a second cropped image including a representation of the person's body, generating an image embedding for the input image by combining a face embedding corresponding to the first cropped image and a body embedding corresponding to the second cropped image, and querying an image repository in embedding space by comparing the image embedding to a plurality of image embeddings associated with a plurality of images in the image repository to obtain one or more images based on similarity to the input image in the embedding space.
-
公开(公告)号:US20230137774A1
公开(公告)日:2023-05-04
申请号:US17453595
申请日:2021-11-04
Applicant: ADOBE INC.
Inventor: Baldo Faieta , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Naveen Marri , Saeid Motiian , Tracy Holloway King , Alex Filipkowski , Shabnam Ghadar
IPC: G06F16/583 , G06F16/58 , G06F16/538 , G06F40/295 , G06F16/535 , G06N3/08
Abstract: Systems and methods for image retrieval are described. Embodiments of the present disclosure receive a search query from a user; extract an entity and a color phrase describing the entity from the search query; generate an entity color embedding in a color embedding space from the color phrase using a multi-modal color encoder; identify an image in a database based on metadata for the image including an object label corresponding to the extracted entity and an object color embedding in the color embedding space corresponding to the object label; and provide image information for the image to the user based on the metadata.
-
公开(公告)号:US20220391611A1
公开(公告)日:2022-12-08
申请号:US17341778
申请日:2021-06-08
Applicant: ADOBE INC.
Inventor: RATHEESH KALAROT , Siavash Khodadadeh , Baldo Faieta , Shabnam Ghadar , Saeid Motiian , Wei-An Lin , Zhe Lin
Abstract: Systems and methods for image processing are described. One or more embodiments of the present disclosure identify a latent vector representing an image of a face, identify a target attribute vector representing a target attribute for the image, generate a modified latent vector using a mapping network that converts the latent vector and the target attribute vector into a hidden representation having fewer dimensions than the latent vector, wherein the modified latent vector is generated based on the hidden representation, and generate a modified image based on the modified latent vector, wherein the modified image represents the face with the target attribute.
-
公开(公告)号:US20220277039A1
公开(公告)日:2022-09-01
申请号:US17186625
申请日:2021-02-26
Applicant: ADOBE INC.
Inventor: PRANAV AGGARWAL , Ajinkya Kale , Baldo Faieta , Saeid Motiian , Venkata naveen kumar yadav Marri
IPC: G06F16/583 , G06F40/279 , G06K9/46 , G06F16/532 , G06F16/51 , G06F16/538 , G06N3/08
Abstract: The present disclosure describes systems and methods for information retrieval. Embodiments of the disclosure provide a color embedding network trained using machine learning techniques to generate embedded color representations for color terms included in a text search query. For example, techniques described herein are used to represent color text in a same space as color embeddings (e.g., an embedding space created by determining a histogram of LAB based colors in a three-dimensional (3D) space). Further, techniques are described for indexing color palettes for all the searchable images in the search space. Accordingly, color terms in a text query are directly converted into a color palette and an image search system can return one or more search images with corresponding color palettes that are relevant to (e.g., within a threshold distance from) the color palette of the text query.
-
公开(公告)号:US20220121705A1
公开(公告)日:2022-04-21
申请号:US17565816
申请日:2021-12-30
Applicant: Adobe Inc.
Inventor: Saeid Motiian , Zhe Lin , Samarth Gulati , Pramod Srinivasan , Jose Ignacio Echevarria Vallespi , Baldo Antonio Faieta
IPC: G06F16/583 , G06F17/16 , G06F16/55 , G06F16/532
Abstract: In implementations of multi-resolution color-based image search, an image search system determines a color vector for a query image based on a color histogram of the query image by concatenating two color histograms having different resolutions. The image search system can compute distance measures between the color vector of the query image and color vectors of candidate images. The image search system can select one or more of the candidate images to return based on the distance measures utilizing the distance measures as indication of color similarity of the candidate images to the query image.
-
公开(公告)号:US20220092108A1
公开(公告)日:2022-03-24
申请号:US17025041
申请日:2020-09-18
Applicant: Adobe Inc.
Inventor: John Collomosse , Zhe Lin , Saeid Motiian , Hailin Jin , Baldo Faieta , Alex Filipkowski
IPC: G06F16/583 , G06F16/535 , G06F16/532 , G06N3/08
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately and flexibly identifying digital images with similar style to a query digital image using fine-grain style determination via weakly supervised style extraction neural networks. For example, the disclosed systems can extract a style embedding from a query digital image using a style extraction neural network such as a novel two-branch autoencoder architecture or a weakly supervised discriminative neural network. The disclosed systems can generate a combined style embedding by combining complementary style embeddings from different style extraction neural networks. Moreover, the disclosed systems can search a repository of digital images to identify digital images with similar style to the query digital image. The disclosed systems can also learn parameters for one or more style extraction neural network through weakly supervised training without a specifically labeled style ontology for sample digital images.
-
公开(公告)号:US20210365727A1
公开(公告)日:2021-11-25
申请号:US17398317
申请日:2021-08-10
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian
IPC: G06K9/62 , G06F16/535 , G06N20/00 , G06K9/72
Abstract: Text-to-visual machine learning embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. These techniques include use of query-based training data which may expand availability and types of training data usable to train a model. Generation of negative digital image samples is also described that may increase accuracy in training the model using machine learning. A loss function is also described that also supports increased accuracy and computational efficiency by losses separately, e.g., between positive or negative sample embeddings a text embedding.
-
公开(公告)号:US11144784B2
公开(公告)日:2021-10-12
申请号:US16426264
申请日:2019-05-30
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian
IPC: G06K9/62 , G06K9/72 , G06F16/535 , G06N20/00 , G06F3/0482
Abstract: Text-to-visual machine learning embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. These techniques include use of query-based training data which may expand availability and types of training data usable to train a model. Generation of negative digital image samples is also described that may increase accuracy in training the model using machine learning. A loss function is also described that also supports increased accuracy and computational efficiency by losses separately, e.g., between positive or negative sample embeddings a text embedding.
-
-
-
-
-
-
-
-
-