-
公开(公告)号:US11263259B2
公开(公告)日:2022-03-01
申请号:US16929429
申请日:2020-07-15
Applicant: Adobe Inc.
Inventor: Xiaohui Shen , Zhe Lin , Kalyan Krishna Sunkavalli , Hengshuang Zhao , Brian Lynn Price
Abstract: Compositing aware digital image search techniques and systems are described that leverage machine learning. In one example, a compositing aware image search system employs a two-stream convolutional neural network (CNN) to jointly learn feature embeddings from foreground digital images that capture a foreground object and background digital images that capture a background scene. In order to train models of the convolutional neural networks, triplets of training digital images are used. Each triplet may include a positive foreground digital image and a positive background digital image taken from the same digital image. The triplet also contains a negative foreground or background digital image that is dissimilar to the positive foreground or background digital image that is also included as part of the triplet.
-
公开(公告)号:US11256918B2
公开(公告)日:2022-02-22
申请号:US16874114
申请日:2020-05-14
Applicant: Adobe Inc.
Inventor: Zhe Lin , Xiaohui Shen , Mingyang Ling , Jianming Zhang , Jason Wen Yong Kuen
Abstract: In implementations of object detection in images, object detectors are trained using heterogeneous training datasets. A first training dataset is used to train an image tagging network to determine an attention map of an input image for a target concept. A second training dataset is used to train a conditional detection network that accepts as conditional inputs the attention map and a word embedding of the target concept. Despite the conditional detection network being trained with a training dataset having a small number of seen classes (e.g., classes in a training dataset), it generalizes to novel, unseen classes by concept conditioning, since the target concept propagates through the conditional detection network via the conditional inputs, thus influencing classification and region proposal. Hence, classes of objects that can be detected are expanded, without the need to scale training databases to include additional classes.
-
33.
公开(公告)号:US11232547B2
公开(公告)日:2022-01-25
申请号:US16930736
申请日:2020-07-16
Applicant: Adobe Inc.
Inventor: Chen Fang , Zhe Lin , Zhaowen Wang , Yulun Zhang , Yilin Wang , Jimei Yang
Abstract: A style of a digital image is transferred to another digital image of arbitrary resolution. A high-resolution (HR) content image is segmented into several low-resolution (LR) patches. The resolution of a style image is matched to have the same resolution as the LR content image patches. Style transfer is then performed on a patch-by-patch basis using, for example, a pair of feature transforms—whitening and coloring. The patch-by-patch style transfer process is then repeated at several increasing resolutions, or scale levels, of both the content and style images. The results of the style transfer at each scale level are incorporated into successive scale levels up to and including the original HR scale. As a result, style transfer can be performed with images having arbitrary resolutions to produce visually pleasing results with good spatial consistency.
-
公开(公告)号:US11222399B2
公开(公告)日:2022-01-11
申请号:US16384593
申请日:2019-04-15
Applicant: Adobe Inc.
Inventor: Zhe Lin , Radomir Mech , Xiaohui Shen , Brian L. Price , Jianming Zhang , Anant Gilra , Jen-Chan Jeff Chien
Abstract: Image cropping suggestion using multiple saliency maps is described. In one or more implementations, component scores, indicative of visual characteristics established for visually-pleasing croppings, are computed for candidate image croppings using multiple different saliency maps. The visual characteristics on which a candidate image cropping is scored may be indicative of its composition quality, an extent to which it preserves content appearing in the scene, and a simplicity of its boundary. Based on the component scores, the croppings may be ranked with regard to each of the visual characteristics. The rankings may be used to cluster the candidate croppings into groups of similar croppings, such that croppings in a group are different by less than a threshold amount and croppings in different groups are different by at least the threshold amount. Based on the clustering, croppings may then be chosen, e.g., to present them to a user for selection.
-
公开(公告)号:US20210374931A1
公开(公告)日:2021-12-02
申请号:US16888473
申请日:2020-05-29
Applicant: ADOBE INC.
Inventor: Akhilesh Kumar , Zhe Lin , William Lawrence Marino
Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for detecting and classifying an exposure defect in an image using neural networks trained via a limited amount of labeled training images. An image may be applied to a first neural network to determine whether the images includes an exposure defect. Detected defective image may be applied to a second neural network to determine an exposure defect classification for the image. The exposure defect classification can includes severe underexposure, medium underexposure, mild underexposure, mild overexposure, medium overexposure, severe overexposure, and/or the like. The image may be presented to a user along with the exposure defect classification.
-
公开(公告)号:US20210358130A1
公开(公告)日:2021-11-18
申请号:US17387195
申请日:2021-07-28
Applicant: Adobe Inc.
Inventor: Scott Cohen , Zhe Lin , Mingyang Ling
Abstract: The present disclosure relates to an object selection system that accurately detects and automatically selects user-requested objects (e.g., query objects) in a digital image. For example, the object selection system builds and utilizes an object selection pipeline to determine which object detection neural network to utilize to detect a query object based on analyzing the object class of the query object. In addition, the object selection system can add, update, or replace portions of the object selection pipeline to improve overall accuracy and efficiency of automatic object selection within an image.
-
公开(公告)号:US20210319255A1
公开(公告)日:2021-10-14
申请号:US17331161
申请日:2021-05-26
Applicant: Adobe Inc.
Inventor: Khoi Pham , Scott Cohen , Zhe Lin , Zhihong Ding , Walter Wei Tuh Chang
IPC: G06K9/62
Abstract: The present disclosure relates to an object selection system that automatically detects and selects objects in a digital image utilizing a large-scale object detector. For instance, in response to receiving a request to automatically select a query object with an unknown object class in a digital image, the object selection system can utilize a large-scale object detector to detect potential objects in the image, filter out one or more potential objects, and label the remaining potential objects in the image to detect the query object. In some implementations, the large-scale object detector utilizes a region proposal model, a concept mask model, and an auto tagging model to automatically detect objects in the digital image.
-
公开(公告)号:US20210319232A1
公开(公告)日:2021-10-14
申请号:US16846544
申请日:2020-04-13
Applicant: Adobe Inc
Inventor: Federico Perazzi , Zhe Lin , Ping Hu , Oliver Wang , Fabian David Caba Heilbron
Abstract: A Video Semantic Segmentation System (VSSS) is disclosed that performs accurate and fast semantic segmentation of videos using a set of temporally distributed neural networks. The VSSS receives as input a video signal comprising a contiguous sequence of temporally-related video frames. The VSSS extracts features from the video frames in the contiguous sequence and based upon the extracted features, selects, from a set of labels, a label to be associated with each pixel of each video frame in the video signal. In certain embodiments, a set of multiple neural networks are used to extract the features to be used for video segmentation and the extraction of features is distributed among the multiple neural networks in the set. A strong feature representation representing the entirety of the features is produced for each video frame in the sequence of video frames by aggregating the output features extracted by the multiple neural networks.
-
公开(公告)号:US20210248748A1
公开(公告)日:2021-08-12
申请号:US16789088
申请日:2020-02-12
Applicant: Adobe Inc.
Inventor: Kerem Can Turgutlu , Jayant Kumar , Jianming Zhang , Zhe Lin
Abstract: Techniques are disclosed for parsing a source image, to identify segments of one or more objects within the source image. The parsing is carried out by an image parsing pipeline that includes three distinct stages comprising three respectively neural network models. The source image can include one or more objects. A first neural network model of the pipeline identifies a section of the source image that includes the object comprising a plurality of segments. A second neural network model of the pipeline generates, from the section of the source image, a mask image, where the mask image identifys one or more segments of the object. A third neural network model of the pipeline further refines the identification of the segments in the mask image, to generate a parsed image. The parsed image identifies the segments of the object, by assigning corresponding unique labels to pixels of different segments of the object.
-
公开(公告)号:US20210241111A1
公开(公告)日:2021-08-05
申请号:US16782793
申请日:2020-02-05
Applicant: Adobe Inc.
Inventor: Shikun Liu , Zhe Lin , Yilin Wang , Jianming Zhang , Federico Perazzi
Abstract: The present disclosure relates to shaping the architecture of a neural network. For example, the disclosed systems can provide a neural network shaping mechanism for at least one sampling layer of a neural network. The neural network shaping mechanism can include a learnable scaling factor between a sampling rate of the at least one sampling layer and an additional sampling function. The disclosed systems can learn the scaling factor based on a dataset while jointly learning the network weights of the neural network. Based on the learned scaling factor, the disclosed systems can shape the architecture of the neural network by modifying the sampling rate of the at least one sampling layer.
-
-
-
-
-
-
-
-
-