-
公开(公告)号:US20200380298A1
公开(公告)日:2020-12-03
申请号:US16426264
申请日:2019-05-30
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian
IPC: G06K9/62 , G06K9/72 , G06F16/535 , G06N20/00
Abstract: Text-to-visual machine learning embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. These techniques include use of query-based training data which may expand availability and types of training data usable to train a model. Generation of negative digital image samples is also described that may increase accuracy in training the model using machine learning. A loss function is also described that also supports increased accuracy and computational efficiency by losses separately, e.g., between positive or negative sample embeddings a text embedding.
-
公开(公告)号:US20240153259A1
公开(公告)日:2024-05-09
申请号:US18053450
申请日:2022-11-08
Applicant: ADOBE INC.
Inventor: Saeid Motiian , Shabnam Ghadar
IPC: G06V10/82 , G06V10/75 , G06V10/771
CPC classification number: G06V10/82 , G06V10/751 , G06V10/771
Abstract: Systems and methods for image processing are provided. One aspect of the systems and methods includes identifying a style image including a target style. A style encoder network generates a style vector representing the target style based on the style image. The style encoder can be trained based on a style loss that encourages the network to match a desired style. A a diffusion model generates a synthetic image that includes the target style based on the style vector. The diffusion model is trained independently of the style encoder network.
-
公开(公告)号:US11934448B2
公开(公告)日:2024-03-19
申请号:US18302201
申请日:2023-04-18
Applicant: Adobe Inc.
Inventor: Pramod Srinivasan , Zhe Lin , Samarth Gulati , Saeid Motiian , Midhun Harikumar , Baldo Antonio Faieta , Alex C. Filipkowski
IPC: G06F16/532 , G06F16/51 , G06F16/538 , G06F16/54 , G06F16/583 , G06F40/30
CPC classification number: G06F16/532 , G06F16/51 , G06F16/538 , G06F16/54 , G06F16/583 , G06F40/30
Abstract: Keyword localization digital image search techniques are described. These techniques support an ability to indicate “where” a corresponding keyword is to be expressed with respect to a layout in a respective digital image resulting from a search query. The search query may also include an indication of a size of the keyword as expressed in the digital image, a number of instances of the keyword, and so forth. Additionally, the techniques and systems as described herein support real time search through use of keyword signatures.
-
公开(公告)号:US11605019B2
公开(公告)日:2023-03-14
申请号:US16426298
申请日:2019-05-30
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian
Abstract: Visually guided machine-learning language model and embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. In one example, a model is trained to support a visually guided machine-learning embedding space that supports visual intuition as to “what” is represented by text. The visually guided language embedding space supported by the model, once trained, may then be used to support visual intuition as part of a variety of functionality. In one such example, the visually guided language embedding space as implemented by the model may be leveraged as part of a multi-modal differential search to support search of digital images and other digital content with real-time focus adaptation which overcomes the challenges of conventional techniques.
-
公开(公告)号:US20230070390A1
公开(公告)日:2023-03-09
申请号:US17466679
申请日:2021-09-03
Applicant: ADOBE INC.
Inventor: Zhaowen Weng , Saeid Motiian , Baldo Faieta , Zegi Gu , Peter Evan O'Donovan , Alex Filipkowski , Jose Ignacio Echevarria Vallespi
IPC: G06F40/109 , G06N3/04 , G06F40/166
Abstract: Embodiments of the technology described herein, are an intelligent system that aims to expedite a text design process by providing text design predictions interactively. The system works with a typical text design scenario comprising a background image and one or more text strings as input. In the design scenario, the text string is to be placed on top of the background. The textual design agent may include a location recommendation model that recommends a location on the background image to place the text. The textual design agent may also include a font recommendation model, a size recommendation model, and a color recommendation model. The output of these four models may be combined to generate draft designs that are evaluated as a whole (combination of color, font, and size) for the best designs. The top designs may be output to the user.
-
26.
公开(公告)号:US11574392B2
公开(公告)日:2023-02-07
申请号:US16803332
申请日:2020-02-27
Applicant: Adobe Inc.
Inventor: Zhe Lin , Vipul Dalal , Vera Lychagina , Shabnam Ghadar , Saeid Motiian , Rohith mohan Dodle , Prethebha Chandrasegaran , Mina Doroudi , Midhun Harikumar , Kannan Iyer , Jayant Kumar , Gaurav Kukal , Daniel Miranda , Charles R McKinney , Archit Kalra
Abstract: The present disclosure relates to an image merging system that automatically and seamlessly detects and merges missing people for a set of digital images into a composite group photo. For instance, the image merging system utilizes a number of models and operations to automatically analyze multiple digital images to identify a missing person from a base image, segment the missing person from the second image, and generate a composite group photo by merging the segmented image of the missing person into the base image. In this manner, the image merging system automatically creates merged group photos that appear natural and realistic.
-
公开(公告)号:US11380033B2
公开(公告)日:2022-07-05
申请号:US16738359
申请日:2020-01-09
Applicant: Adobe Inc.
Inventor: Kate Sousa , Zhe Lin , Saeid Motiian , Pramod Srinivasan , Baldo Faieta , Alex Filipkowski
Abstract: Based on a received digital image and text, a neural network trained to identify candidate text placement areas within images may be used to generate a mask for the digital image that includes a candidate text placement area. A bounding box for the digital image may be defined for the text and based on the candidate text placement area, and the text may be superimposed onto the digital image within the bounding box.
-
公开(公告)号:US11216505B2
公开(公告)日:2022-01-04
申请号:US16561973
申请日:2019-09-05
Applicant: Adobe Inc.
Inventor: Saeid Motiian , Zhe Lin , Samarth Gulati , Pramod Srinivasan , Jose Ignacio Echevarria Vallespi , Baldo Antonio Faieta
IPC: G06F16/00 , G06F16/583 , G06F17/16 , G06F16/55 , G06F16/532
Abstract: In implementations of multi-resolution color-based image search, an image search system determines a color vector for a query image based on a color histogram of the query image by concatenating two color histograms having different resolutions. The image search system can compute distance measures between the color vector of the query image and color vectors of candidate images. The image search system can select one or more of the candidate images to return based on the distance measures utilizing the distance measures as indication of color similarity of the candidate images to the query image.
-
公开(公告)号:US20210073270A1
公开(公告)日:2021-03-11
申请号:US16561973
申请日:2019-09-05
Applicant: Adobe Inc.
Inventor: Saeid Motiian , Zhe Lin , Samarth Gulati , Pramod Srinivasan , Jose Ignacio Echevarria Vallespi , Baldo Antonio Faieta
IPC: G06F16/583 , G06F16/532 , G06F16/55 , G06F17/16
Abstract: In implementations of multi-resolution color-based image search, an image search system determines a color vector for a query image based on a color histogram of the query image by concatenating two color histograms having different resolutions. The image search system can compute distance measures between the color vector of the query image and color vectors of candidate images. The image search system can select one or more of the candidate images to return based on the distance measures utilizing the distance measures as indication of color similarity of the candidate images to the query image.
-
公开(公告)号:US20200380403A1
公开(公告)日:2020-12-03
申请号:US16426298
申请日:2019-05-30
Applicant: Adobe Inc.
Inventor: Pranav Vineet Aggarwal , Zhe Lin , Baldo Antonio Faieta , Saeid Motiian
IPC: G06N20/00 , G06K9/62 , G06F16/538 , G06N3/08
Abstract: Visually guided machine-learning language model and embedding techniques are described that overcome the challenges of conventional techniques in a variety of ways. In one example, a model is trained to support a visually guided machine-learning embedding space that supports visual intuition as to “what” is represented by text. The visually guided language embedding space supported by the model, once trained, may then be used to support visual intuition as part of a variety of functionality. In one such example, the visually guided language embedding space as implemented by the model may be leveraged as part of a multi-modal differential search to support search of digital images and other digital content with real-time focus adaptation which overcomes the challenges of conventional techniques.
-
-
-
-
-
-
-
-
-