-
公开(公告)号:US20200160111A1
公开(公告)日:2020-05-21
申请号:US16191724
申请日:2018-11-15
Applicant: ADOBE INC.
Inventor: Mingyang Ling , Alex Filipkowski , Zhe Lin , Jianming Zhang , Samarth Gulati
Abstract: Techniques are disclosed for characterizing and defining the location of a copy space in an image. A methodology implementing the techniques according to an embodiment includes applying a regression convolutional neural network (CNN) to an image. The regression CNN is configured to predict properties of the copy space such as size and type (natural or manufactured). The prediction is conditioned on a determination of the presence of the copy space in the image. The method further includes applying a segmentation CNN to the image. The segmentation CNN is configured to generate one or more pixel-level masks to define the location of copy spaces in the image, whether natural or manufactured, or to define the location of a background region of the image. The segmentation CNN may include a first stage comprising convolutional layers and a second stage comprising pairs of boundary refinement layers and bilinear up-sampling layers.
-
公开(公告)号:US11886793B2
公开(公告)日:2024-01-30
申请号:US17466679
申请日:2021-09-03
Applicant: ADOBE INC.
Inventor: Zhaowen Wang , Saeid Motiian , Baldo Faieta , Zegi Gu , Peter Evan O'Donovan , Alex Filipkowski , Jose Ignacio Echevarria Vallespi
IPC: G06F40/109 , G06F40/166 , G06F40/106 , G06F40/103
CPC classification number: G06F40/109 , G06F40/103 , G06F40/106 , G06F40/166
Abstract: Embodiments of the technology described herein, are an intelligent system that aims to expedite a text design process by providing text design predictions interactively. The system works with a typical text design scenario comprising a background image and one or more text strings as input. In the design scenario, the text string is to be placed on top of the background. The textual design agent may include a location recommendation model that recommends a location on the background image to place the text. The textual design agent may also include a font recommendation model, a size recommendation model, and a color recommendation model. The output of these four models may be combined to generate draft designs that are evaluated as a whole (combination of color, font, and size) for the best designs. The top designs may be output to the user.
-
公开(公告)号:US10970599B2
公开(公告)日:2021-04-06
申请号:US16191724
申请日:2018-11-15
Applicant: ADOBE INC.
Inventor: Mingyang Ling , Alex Filipkowski , Zhe Lin , Jianming Zhang , Samarth Gulati
Abstract: Techniques are disclosed for characterizing and defining the location of a copy space in an image. A methodology implementing the techniques according to an embodiment includes applying a regression convolutional neural network (CNN) to an image. The regression CNN is configured to predict properties of the copy space such as size and type (natural or manufactured). The prediction is conditioned on a determination of the presence of the copy space in the image. The method further includes applying a segmentation CNN to the image. The segmentation CNN is configured to generate one or more pixel-level masks to define the location of copy spaces in the image, whether natural or manufactured, or to define the location of a background region of the image. The segmentation CNN may include a first stage comprising convolutional layers and a second stage comprising pairs of boundary refinement layers and bilinear up-sampling layers.
-
公开(公告)号:US12093308B2
公开(公告)日:2024-09-17
申请号:US17453595
申请日:2021-11-04
Applicant: ADOBE INC.
Inventor: Baldo Faieta , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Naveen Marri , Saeid Motiian , Tracy Holloway King , Alex Filipkowski , Shabnam Ghadar
IPC: G06F16/583 , G06F16/535 , G06F16/538 , G06F16/58 , G06F40/295 , G06N3/08
CPC classification number: G06F16/5838 , G06F16/535 , G06F16/538 , G06F16/5866 , G06F40/295 , G06N3/08
Abstract: Systems and methods for image retrieval are described. Embodiments of the present disclosure receive a search query from a user; extract an entity and a color phrase describing the entity from the search query; generate an entity color embedding in a color embedding space from the color phrase using a multi-modal color encoder; identify an image in a database based on metadata for the image including an object label corresponding to the extracted entity and an object color embedding in the color embedding space corresponding to the object label; and provide image information for the image to the user based on the metadata.
-
公开(公告)号:US11709885B2
公开(公告)日:2023-07-25
申请号:US17025041
申请日:2020-09-18
Applicant: Adobe Inc.
Inventor: John Collomosse , Zhe Lin , Saeid Motiian , Hailin Jin , Baldo Faieta , Alex Filipkowski
IPC: G06T7/00 , G06F16/583 , G06F16/532 , G06N3/08 , G06F16/535 , G06V10/82 , G06V20/30
CPC classification number: G06F16/5854 , G06F16/532 , G06F16/535 , G06F16/5838 , G06N3/08 , G06V10/82 , G06V20/30
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately and flexibly identifying digital images with similar style to a query digital image using fine-grain style determination via weakly supervised style extraction neural networks. For example, the disclosed systems can extract a style embedding from a query digital image using a style extraction neural network such as a novel two-branch autoencoder architecture or a weakly supervised discriminative neural network. The disclosed systems can generate a combined style embedding by combining complementary style embeddings from different style extraction neural networks. Moreover, the disclosed systems can search a repository of digital images to identify digital images with similar style to the query digital image. The disclosed systems can also learn parameters for one or more style extraction neural network through weakly supervised training without a specifically labeled style ontology for sample digital images.
-
公开(公告)号:US20210217215A1
公开(公告)日:2021-07-15
申请号:US16738359
申请日:2020-01-09
Applicant: Adobe Inc.
Inventor: Kate Sousa , Zhe Lin , Saeid Motiian , Pramod Srinivasan , Baldo Faieta , Alex Filipkowski
Abstract: Based on a received digital image and text, a neural network trained to identify candidate text placement areas within images may be used to generate a mask for the digital image that includes a candidate text placement area. A bounding box for the digital image may be defined for the text and based on the candidate text placement area, and the text may be superimposed onto the digital image within the bounding box.
-
公开(公告)号:US20230137774A1
公开(公告)日:2023-05-04
申请号:US17453595
申请日:2021-11-04
Applicant: ADOBE INC.
Inventor: Baldo Faieta , Ajinkya Gorakhnath Kale , Pranav Vineet Aggarwal , Naveen Marri , Saeid Motiian , Tracy Holloway King , Alex Filipkowski , Shabnam Ghadar
IPC: G06F16/583 , G06F16/58 , G06F16/538 , G06F40/295 , G06F16/535 , G06N3/08
Abstract: Systems and methods for image retrieval are described. Embodiments of the present disclosure receive a search query from a user; extract an entity and a color phrase describing the entity from the search query; generate an entity color embedding in a color embedding space from the color phrase using a multi-modal color encoder; identify an image in a database based on metadata for the image including an object label corresponding to the extracted entity and an object color embedding in the color embedding space corresponding to the object label; and provide image information for the image to the user based on the metadata.
-
公开(公告)号:US11605168B2
公开(公告)日:2023-03-14
申请号:US17215067
申请日:2021-03-29
Applicant: Adobe Inc.
Inventor: Mingyang Ling , Alex Filipkowski , Zhe Lin , Jianming Zhang , Samarth Gulati
IPC: G06K9/62 , G06T7/11 , G06T7/136 , G06T7/143 , G06T7/174 , G06F18/214 , G06N3/045 , G06V10/25 , G06V10/764 , G06V10/82 , G06V10/26
Abstract: Techniques are disclosed for characterizing and defining the location of a copy space in an image. A methodology implementing the techniques according to an embodiment includes applying a regression convolutional neural network (CNN) to an image. The regression CNN is configured to predict properties of the copy space such as size and type (natural or manufactured). The prediction is conditioned on a determination of the presence of the copy space in the image. The method further includes applying a segmentation CNN to the image. The segmentation CNN is configured to generate one or more pixel-level masks to define the location of copy spaces in the image, whether natural or manufactured, or to define the location of a background region of the image. The segmentation CNN may include a first stage comprising convolutional layers and a second stage comprising pairs of boundary refinement layers and bilinear up-sampling layers.
-
公开(公告)号:US20220092108A1
公开(公告)日:2022-03-24
申请号:US17025041
申请日:2020-09-18
Applicant: Adobe Inc.
Inventor: John Collomosse , Zhe Lin , Saeid Motiian , Hailin Jin , Baldo Faieta , Alex Filipkowski
IPC: G06F16/583 , G06F16/535 , G06F16/532 , G06N3/08
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately and flexibly identifying digital images with similar style to a query digital image using fine-grain style determination via weakly supervised style extraction neural networks. For example, the disclosed systems can extract a style embedding from a query digital image using a style extraction neural network such as a novel two-branch autoencoder architecture or a weakly supervised discriminative neural network. The disclosed systems can generate a combined style embedding by combining complementary style embeddings from different style extraction neural networks. Moreover, the disclosed systems can search a repository of digital images to identify digital images with similar style to the query digital image. The disclosed systems can also learn parameters for one or more style extraction neural network through weakly supervised training without a specifically labeled style ontology for sample digital images.
-
公开(公告)号:US20230070390A1
公开(公告)日:2023-03-09
申请号:US17466679
申请日:2021-09-03
Applicant: ADOBE INC.
Inventor: Zhaowen Weng , Saeid Motiian , Baldo Faieta , Zegi Gu , Peter Evan O'Donovan , Alex Filipkowski , Jose Ignacio Echevarria Vallespi
IPC: G06F40/109 , G06N3/04 , G06F40/166
Abstract: Embodiments of the technology described herein, are an intelligent system that aims to expedite a text design process by providing text design predictions interactively. The system works with a typical text design scenario comprising a background image and one or more text strings as input. In the design scenario, the text string is to be placed on top of the background. The textual design agent may include a location recommendation model that recommends a location on the background image to place the text. The textual design agent may also include a font recommendation model, a size recommendation model, and a color recommendation model. The output of these four models may be combined to generate draft designs that are evaluated as a whole (combination of color, font, and size) for the best designs. The top designs may be output to the user.
-
-
-
-
-
-
-
-
-