-
公开(公告)号:US11900519B2
公开(公告)日:2024-02-13
申请号:US17455318
申请日:2021-11-17
Applicant: ADOBE INC.
Inventor: Kevin Duarte , Wei-An Lin , Ratheesh Kalarot , Shabnam Ghadar , Jingwan Lu , Elya Shechtman , John Thomas Nack
Abstract: Systems and methods for image processing are described. Embodiments of the present disclosure encode features of a source image to obtain a source appearance encoding that represents inherent attributes of a face in the source image; encode features of a target image to obtain a target non-appearance encoding that represents contextual attributes of the target image; combine the source appearance encoding and the target non-appearance encoding to obtain combined image features; and generate a modified target image based on the combined image features, wherein the modified target image includes the inherent attributes of the face in the source image together with the contextual attributes of the target image.
-
公开(公告)号:US11887216B2
公开(公告)日:2024-01-30
申请号:US17455796
申请日:2021-11-19
Applicant: ADOBE INC.
Inventor: Ratheesh Kalarot , Timothy M. Converse , Shabnam Ghadar , John Thomas Nack , Jingwan Lu , Elya Shechtman , Baldo Faieta , Akhilesh Kumar
CPC classification number: G06T11/00 , G06N3/08 , G06V40/168 , G06V40/172
Abstract: The present disclosure describes systems and methods for image processing. Embodiments of the present disclosure include an image processing apparatus configured to generate modified images (e.g., synthetic faces) by conditionally changing attributes or landmarks of an input image. A machine learning model of the image processing apparatus encodes the input image to obtain a joint conditional vector that represents attributes and landmarks of the input image in a vector space. The joint conditional vector is then modified, according to the techniques described herein, to form a latent vector used to generate a modified image. In some cases, the machine learning model is trained using a generative adversarial network (GAN) with a normalization technique, followed by joint training of a landmark embedding and attribute embedding (e.g., to reduce inference time).
-
公开(公告)号:US11854119B2
公开(公告)日:2023-12-26
申请号:US17155570
申请日:2021-01-22
Applicant: Adobe Inc.
Inventor: Siavash Khodadadeh , Zhe Lin , Shabnam Ghadar , Saeid Motiian , Richard Zhang , Ratheesh Kalarot , Baldo Faieta
CPC classification number: G06T11/001 , G06N3/045 , G06N3/08 , G06T7/90
Abstract: Embodiments are disclosed for automatic object re-colorization in images. In some embodiments, a method of automatic object re-colorization includes receiving a request to recolor an object in an image, the request including an object identifier and a color identifier, identifying an object in the image associated with the object identifier, generating a mask corresponding to the object in the image, providing the image, the mask, and the color identifier to a color transformer network, the color transformer network trained to recolor objects in input images, and generating, by the color transformer network, a recolored image, wherein the object in the recolored image has been recolored to a color corresponding to the color identifier.
-
34.
公开(公告)号:US20230386114A1
公开(公告)日:2023-11-30
申请号:US18449604
申请日:2023-08-14
Applicant: Adobe Inc.
Inventor: Akhilesh Kumar , Baldo Faieta , Piotr Walczyszyn , Ratheesh Kalarot , Archie Bagnall , Shabnam Ghadar , Wei-An Lin , Cameron Smith , Christian Cantrell , Patrick Hebron , Wilson Chan , Jingwan Lu , Holger Winnemoeller , Sven Olsen
CPC classification number: G06T11/60 , G06N3/04 , G06T11/203
Abstract: The present disclosure describes systems, methods, and non-transitory computer readable media for detecting user interactions to edit a digital image from a client device and modify the digital image for the client device by using a web-based intermediary that modifies a latent vector of the digital image and an image modification neural network to generate a modified digital image from the modified latent vector. In response to user interaction to modify a digital image, for instance, the disclosed systems modify a latent vector extracted from the digital image to reflect the requested modification. The disclosed systems further use a latent vector stream renderer (as an intermediary device) to generate an image delta that indicates a difference between the digital image and the modified digital image. The disclosed systems then provide the image delta as part of a digital stream to a client device to quickly render the modified digital image.
-
公开(公告)号:US11823490B2
公开(公告)日:2023-11-21
申请号:US17341778
申请日:2021-06-08
Applicant: ADOBE INC.
Inventor: Ratheesh Kalarot , Siavash Khodadadeh , Baldo Faieta , Shabnam Ghadar , Saeid Motiian , Wei-An Lin , Zhe Lin
CPC classification number: G06V40/169 , G06N3/045 , G06N3/084 , G06T11/60
Abstract: Systems and methods for image processing are described. One or more embodiments of the present disclosure identify a latent vector representing an image of a face, identify a target attribute vector representing a target attribute for the image, generate a modified latent vector using a mapping network that converts the latent vector and the target attribute vector into a hidden representation having fewer dimensions than the latent vector, wherein the modified latent vector is generated based on the hidden representation, and generate a modified image based on the modified latent vector, wherein the modified image represents the face with the target attribute.
-
36.
公开(公告)号:US20230316606A1
公开(公告)日:2023-10-05
申请号:US17655739
申请日:2022-03-21
Applicant: Adobe Inc.
Inventor: Hui Qu , Baldo Faieta , Cameron Smith , Elya Shechtman , Jingwan Lu , Ratheesh Kalarot , Richard Zhang , Saeid Motiian , Shabnam Ghadar , Wei-An Lin
CPC classification number: G06T11/60 , G06N3/0454
Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for latent-based editing of digital images using a generative neural network. In particular, in one or more embodiments, the disclosed systems perform latent-based editing of a digital image by mapping a feature tensor and a set of style vectors for the digital image into a joint feature style space. In one or more implementations, the disclosed systems apply a joint feature style perturbation and/or modification vectors within the joint feature style space to determine modified style vectors and a modified feature tensor. Moreover, in one or more embodiments the disclosed systems generate a modified digital image utilizing a generative neural network from the modified style vectors and the modified feature tensor.
-
37.
公开(公告)号:US20230076196A1
公开(公告)日:2023-03-09
申请号:US17466699
申请日:2021-09-03
Applicant: ADOBE INC.
Inventor: Akhilesh Kumar , Ratheesh Kalarot , Baldo Faieta , Shabnam Ghadar
Abstract: Embodiments of the present invention provide systems, methods, and computer storage media for editing images using a web-based intermediary between a user interface on a client device and an image editing neural network(s) (e.g., a generative adversarial network) on a server(s). The present image editing system supports multiple users in the same software container, advanced concurrency of projection and transformation of the same image, clubbing transformation requests from several users hosted in the same software container, and smooth display updates during a progressive projection.
-
公开(公告)号:US20220122306A1
公开(公告)日:2022-04-21
申请号:US17468487
申请日:2021-09-07
Applicant: Adobe Inc.
Inventor: Wei-An Lin , Baldo Faieta , Cameron Smith , Elya Shechtman , Jingwan Lu , Jun-Yan Zhu , Niloy Mitra , Ratheesh Kalarot , Richard Zhang , Shabnam Ghadar , Zhixin Shu
IPC: G06T11/60 , G06F3/0484 , G06N3/08 , G06N3/04
Abstract: Systems and methods dynamically adjust an available range for editing an attribute in an image. An image editing system computes a metric for an attribute in an input image as a function of a latent space representation of the input image and a filtering vector for editing the input image. The image editing system compares the metric to a threshold. If the metric exceeds the threshold, then the image editing system selects a first range for editing the attribute in the input image. If the metric does not exceed the threshold, a second range is selected. The image editing system causes display of a user interface for editing the input image comprising an interface element for editing the attribute within the selected range.
-
公开(公告)号:US20220122221A1
公开(公告)日:2022-04-21
申请号:US17384357
申请日:2021-07-23
Applicant: Adobe Inc.
Inventor: Cameron Smith , Ratheesh Kalarot , Wei-An Lin , Richard Zhang , Niloy Mitra , Elya Shechtman , Shabnam Ghadar , Zhixin Shu , Yannick Hold-Geoffrey , Nathan Carr , Jingwan Lu , Oliver Wang , Jun-Yan Zhu
IPC: G06T3/40 , G06F3/0484 , G06N3/08 , G06N3/04
Abstract: An improved system architecture uses a pipeline including a Generative Adversarial Network (GAN) including a generator neural network and a discriminator neural network to generate an image. An input image in a first domain and information about a target domain are obtained. The domains correspond to image styles. An initial latent space representation of the input image is produced by encoding the input image. An initial output image is generated by processing the initial latent space representation with the generator neural network. Using the discriminator neural network, a score is computed indicating whether the initial output image is in the target domain. A loss is computed based on the computed score. The loss is minimized to compute an updated latent space representation. The updated latent space representation is processed with the generator neural network to generate an output image in the target domain.
-
公开(公告)号:US20220121931A1
公开(公告)日:2022-04-21
申请号:US17384371
申请日:2021-07-23
Applicant: Adobe Inc.
Inventor: Ratheesh Kalarot , Wei-An Lin , Cameron Smith , Zhixin Shu , Baldo Faieta , Shabnam Ghadar , Jingwan Lu , Aliakbar Darabi , Jun-Yan Zhu , Niloy Mitra , Richard Zhang , Elya Shechtman
Abstract: Systems and methods train and apply a specialized encoder neural network for fast and accurate projection into the latent space of a Generative Adversarial Network (GAN). The specialized encoder neural network includes an input layer, a feature extraction layer, and a bottleneck layer positioned after the feature extraction layer. The projection process includes providing an input image to the encoder and producing, by the encoder, a latent space representation of the input image. Producing the latent space representation includes extracting a feature vector from the feature extraction layer, providing the feature vector to the bottleneck layer as input, and producing the latent space representation as output. The latent space representation produced by the encoder is provided as input to the GAN, which generates an output image based upon the latent space representation. The encoder is trained using specialized loss functions including a segmentation loss and a mean latent loss.
-
-
-
-
-
-
-
-
-