-
公开(公告)号:US10853700B2
公开(公告)日:2020-12-01
申请号:US16928949
申请日:2020-07-14
Applicant: Adobe Inc.
Inventor: Jayant Kumar , Zhe Lin , Vipulkumar C. Dalal
IPC: G06K9/62
Abstract: There is described a computing device and method in a digital medium environment for custom auto tagging of multiple objects. The computing device includes an object detection network and multiple image classification networks. An image is received at the object detection network and includes multiple visual objects. First feature maps are applied to the image at the object detection network and generate object regions associated with the visual objects. The object regions are assigned to the multiple image classification networks, and each image classification network is assigned to a particular object region. The second feature maps are applied to each object region at each image classification network, and each image classification network outputs one or more classes associated with a visual object corresponding to each object region.
-
公开(公告)号:US20230274478A1
公开(公告)日:2023-08-31
申请号:US17652512
申请日:2022-02-25
Applicant: ADOBE INC.
Inventor: Kerem Can Turgutlu , Sanat Sharma , Jayant Kumar , Rohith Mohan Dodle , Vipul Dalal
IPC: G06T11/60 , G06V10/764 , G06V20/70 , G06V10/774 , G06V10/82
CPC classification number: G06T11/60 , G06V10/764 , G06V20/70 , G06V10/774 , G06V10/82 , G06T2210/12 , G06T2210/61
Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure receive an image depicting an object; generate a sequence of tokens including a set of tokens corresponding to the object and a set of mask tokens corresponding to an additional object to be inserted into the image; generate a placement token value for the set of mask tokens based on the sequence of tokens using a sequence encoder, wherein the placement token value represents position information of the additional object; and insert the additional object into the image based on the position information to obtain a composite image.
-
13.
公开(公告)号:US11574392B2
公开(公告)日:2023-02-07
申请号:US16803332
申请日:2020-02-27
Applicant: Adobe Inc.
Inventor: Zhe Lin , Vipul Dalal , Vera Lychagina , Shabnam Ghadar , Saeid Motiian , Rohith mohan Dodle , Prethebha Chandrasegaran , Mina Doroudi , Midhun Harikumar , Kannan Iyer , Jayant Kumar , Gaurav Kukal , Daniel Miranda , Charles R McKinney , Archit Kalra
Abstract: The present disclosure relates to an image merging system that automatically and seamlessly detects and merges missing people for a set of digital images into a composite group photo. For instance, the image merging system utilizes a number of models and operations to automatically analyze multiple digital images to identify a missing person from a base image, segment the missing person from the second image, and generate a composite group photo by merging the segmented image of the missing person into the base image. In this manner, the image merging system automatically creates merged group photos that appear natural and realistic.
-
公开(公告)号:US20220351513A1
公开(公告)日:2022-11-03
申请号:US17865076
申请日:2022-07-14
Applicant: Adobe Inc.
Inventor: Jayant Kumar , Vera Lychagina , Tarun Vashisth , Sudhakar Pandey , Sharad Mangalick , Rohith Mohan Dodle , Peter Baust , Mina Doroudi , Kerem Turgutlu , Kannan Iyer , Gaurav Kukal , Archit Kalra , Amine Ben Khalifa
Abstract: Disclosed are systems and methods for dynamically determining categories for images. A computer-implemented method may include training a neural network to receive an input image and determine one or more image categories associated with the input image; obtaining a set of images associated with a user; determining, using the trained neural network, one or more image categories associated with each image included in the obtained set of images; determining one or more dominant image categories associated with the user based on the determined image categories for the obtained set of images; and determining an image editing user interface for the user based on the determined one or more dominant image categories.
-
公开(公告)号:US11468674B2
公开(公告)日:2022-10-11
申请号:US16995869
申请日:2020-08-18
Applicant: Adobe Inc.
Inventor: Jayant Kumar , Vera Lychagina , Tarun Vashisth , Sudhakar Pandey , Sharad Mangalick , Rohith Mohan Dodle , Peter Baust , Mina Doroudi , Kerem Turgutlu , Kannan Iyer , Gaurav Kukal , Archit Kalra , Amine Ben Khalifa
Abstract: Disclosed are systems and methods for dynamically determining categories for images. A computer-implemented method includes training a neural network to receive an input image and determine one or more image categories associated with the input image; obtaining a set of images associated with a user; and determining, using the trained neural network, one or more image categories associated with each image included in the obtained set of images.
-
公开(公告)号:US20210248748A1
公开(公告)日:2021-08-12
申请号:US16789088
申请日:2020-02-12
Applicant: Adobe Inc.
Inventor: Kerem Can Turgutlu , Jayant Kumar , Jianming Zhang , Zhe Lin
Abstract: Techniques are disclosed for parsing a source image, to identify segments of one or more objects within the source image. The parsing is carried out by an image parsing pipeline that includes three distinct stages comprising three respectively neural network models. The source image can include one or more objects. A first neural network model of the pipeline identifies a section of the source image that includes the object comprising a plurality of segments. A second neural network model of the pipeline generates, from the section of the source image, a mask image, where the mask image identifys one or more segments of the object. A third neural network model of the pipeline further refines the identification of the segments in the mask image, to generate a parsed image. The parsed image identifies the segments of the object, by assigning corresponding unique labels to pixels of different segments of the object.
-
公开(公告)号:US10733480B2
公开(公告)日:2020-08-04
申请号:US16039311
申请日:2018-07-18
Applicant: Adobe Inc.
Inventor: Jayant Kumar , Zhe Lin , Vipulkumar C. Dalal
IPC: G06K9/62
Abstract: There is described a computing device and method in a digital medium environment for custom auto tagging of multiple objects. The computing device includes an object detection network and multiple image classification networks. An image is received at the object detection network and includes multiple visual objects. First feature maps are applied to the image at the object detection network and generate object regions associated with the visual objects. The object regions are assigned to the multiple image classification networks, and each image classification network is assigned to a particular object region. The second feature maps are applied to each object region at each image classification network, and each image classification network outputs one or more classes associated with a visual object corresponding to each object region.
-
公开(公告)号:US11775734B2
公开(公告)日:2023-10-03
申请号:US17534937
申请日:2021-11-24
Applicant: Adobe Inc.
Inventor: Sanat Sharma , Jing Zheng , Jayant Kumar
IPC: G06F40/109 , G06N3/02 , G06N5/02
CPC classification number: G06F40/109 , G06N3/02 , G06N5/02
Abstract: Embodiments are disclosed for receiving a modal input including at least one of a text input or an image input. The method may include extracting an intent label from the modal input. The method may further include generating, by an intent embedding generator, an intent embedding from the intent label. The method may further include comparing the intent embedding to a plurality of candidate font embeddings to obtain one or more candidate fonts based on a similarity of the intent embedding to the plurality of candidate font embeddings in an embedding space. The method may further include identifying a recommended font based on the similarity of the intent embedding to a selected candidate font embedding of the plurality of candidate font embeddings.
-
19.
公开(公告)号:US20230237251A1
公开(公告)日:2023-07-27
申请号:US17583818
申请日:2022-01-25
Applicant: Adobe Inc.
Inventor: Oliver Brdiczka , Sanat Sharma , Jayant Kumar , Alexandru Vasile Costin , Aliakbar Darabi , Kushith Amerasinghe
IPC: G06F40/166 , G06F40/106 , G06V30/413 , G06F16/58 , G06F16/38
CPC classification number: G06F40/166 , G06F40/106 , G06V30/413 , G06F16/5866 , G06F16/38
Abstract: An illustrator system accesses a multi-element document, the multi-element document including a plurality of elements. The illustrator system determines, for each of the plurality of elements, an element-specific topic distribution comprising a ranked list of topics. The illustrator system creates a first aggregated topic distribution from the determined element-specific topic distributions. The illustrator system determines a global intent for the multi-element document, the global intent including one or more terms from the first aggregated topic distribution. The illustrator system queries a database using the global intent to retrieve a substitute element. The illustrator system generates a replacement multi-element document that includes a substitute element in place of an element in the multi-element document The at least one substitute element is different from the element in the displayed multi-element document.
-
-
-
-
-
-
-
-