-
61.
公开(公告)号:US20240267597A1
公开(公告)日:2024-08-08
申请号:US18164348
申请日:2023-02-03
Applicant: Adobe Inc.
Inventor: Xiaojuan Wang , Richard Zhang , Taesung Park , Yang Zhou , Elya Shechtman
IPC: H04N21/81 , G06V10/771 , G06V10/82 , H04N21/234
CPC classification number: H04N21/8153 , G06V10/771 , G06V10/82 , H04N21/23424
Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that utilize machine learning to generate a sequence of transition frames for a gap in a clipped digital video. For example, the disclosed system receives a clipped digital video that includes a pre-cut frame prior to a gap in the clipped digital video and a post-cut frame following the gap in the clipped digital video. Moreover, the disclosed system utilizes a natural motion sequence model to generates a sequence of transition keypoint maps between the pre-cut frame and the post-cut frame. Furthermore, using a generative neural network, the disclosed system generates a sequence of transition frames for the gap in the clipped digital video from the sequence of transition keypoint maps.
-
公开(公告)号:US20240169604A1
公开(公告)日:2024-05-23
申请号:US18057453
申请日:2022-11-21
Applicant: ADOBE INC.
Inventor: Yosef Gandelsman , Taesung Park , Richard Zhang , Elya Shechtman , Alexei A. Efros
IPC: G06T11/00 , G06F3/04842 , G06F3/04845 , G06T11/20
CPC classification number: G06T11/001 , G06F3/04842 , G06F3/04845 , G06T11/20
Abstract: Systems and methods for image generation are described. Embodiments of the present disclosure obtain user input that indicates a target color and a semantic label for a region of an image to be generated. The system also generates of obtains a noise map including noise biased towards the target color in the region indicated by the user input. A diffusion model generates the image based on the noise map and the semantic label for the region. The image can include an object in the designated region that is described by the semantic label and that has the target color.
-
公开(公告)号:US11983628B2
公开(公告)日:2024-05-14
申请号:US17468487
申请日:2021-09-07
Applicant: Adobe Inc.
Inventor: Wei-An Lin , Baldo Faieta , Cameron Smith , Elya Shechtman , Jingwan Lu , Jun-Yan Zhu , Niloy Mitra , Ratheesh Kalarot , Richard Zhang , Shabnam Ghadar , Zhixin Shu
IPC: G06N3/08 , G06F3/04845 , G06F3/04847 , G06F18/21 , G06F18/211 , G06F18/214 , G06F18/40 , G06N3/045 , G06N20/20 , G06T3/02 , G06T3/18 , G06T3/40 , G06T3/4038 , G06T3/4046 , G06T5/20 , G06T5/77 , G06T11/00 , G06T11/60
CPC classification number: G06N3/08 , G06F3/04845 , G06F3/04847 , G06F18/211 , G06F18/214 , G06F18/2163 , G06F18/40 , G06N3/045 , G06N20/20 , G06T3/02 , G06T3/18 , G06T3/40 , G06T3/4038 , G06T3/4046 , G06T5/20 , G06T5/77 , G06T11/001 , G06T11/60 , G06T2207/10024 , G06T2207/20081 , G06T2207/20084 , G06T2207/20221 , G06T2210/22
Abstract: Systems and methods dynamically adjust an available range for editing an attribute in an image. An image editing system computes a metric for an attribute in an input image as a function of a latent space representation of the input image and a filtering vector for editing the input image. The image editing system compares the metric to a threshold. If the metric exceeds the threshold, then the image editing system selects a first range for editing the attribute in the input image. If the metric does not exceed the threshold, a second range is selected. The image editing system causes display of a user interface for editing the input image comprising an interface element for editing the attribute within the selected range.
-
公开(公告)号:US20240087265A1
公开(公告)日:2024-03-14
申请号:US17942101
申请日:2022-09-09
Applicant: ADOBE INC.
Inventor: Taesung Park , Richard Zhang , Elya Schechtman
IPC: G06T19/20 , G06F3/04847 , G06F40/289 , G06V10/774 , G06V10/776 , G06V10/82
CPC classification number: G06T19/20 , G06F3/04847 , G06F40/289 , G06V10/774 , G06V10/776 , G06V10/82 , G06F40/284 , G06T2200/24 , G06T2210/61
Abstract: Various disclosed embodiments are directed to changing parameters of an input image or multidimensional representation of the input image based on a user request to change such parameters. An input image is first received. A multidimensional image that represents the input image in multiple dimensions is generated via a model. A request to change at least a first parameter to a second parameter is received via user input at a user device. Such request is a request to edit or generate the multidimensional image in some way. For instance, the request may be to change the light source position or camera position from a first set of coordinates to a second set of coordinates.
-
公开(公告)号:US11893763B2
公开(公告)日:2024-02-06
申请号:US18058163
申请日:2022-11-22
Applicant: Adobe Inc.
Inventor: Taesung Park , Richard Zhang , Oliver Wang , Junyan Zhu , Jingwan Lu , Elya Shechtman , Alexei A Efros
CPC classification number: G06T9/002 , G06N3/08 , G06T3/4046 , G06T2200/24 , G06T2210/36 , G06T2219/2024
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating a modified digital image from extracted spatial and global codes. For example, the disclosed systems can utilize a global and spatial autoencoder to extract spatial codes and global codes from digital images. The disclosed systems can further utilize the global and spatial autoencoder to generate a modified digital image by combining extracted spatial and global codes in various ways for various applications such as style swapping, style blending, and attribute editing.
-
66.
公开(公告)号:US20230342893A1
公开(公告)日:2023-10-26
申请号:US17660090
申请日:2022-04-21
Applicant: Adobe Inc.
Inventor: Tobias Hinz , Shabnam Ghadar , Richard Zhang , Ratheesh Kalarot , Jingwan Lu , Elya Shechtman
CPC classification number: G06T5/50 , G06T11/60 , G06V10/82 , G06T2207/20221 , G06T2207/30201
Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for combining digital images. In particular, in one or more embodiments, the disclosed systems combine latent codes of a source digital image and a target digital image utilizing a blending network to determine a combined latent encoding and generate a combined digital image from the combined latent encoding utilizing a generative neural network. In some embodiments, the disclosed systems determine an intersection face mask between the source digital image and the combined digital image utilizing a face segmentation network and combine the source digital image and the combined digital image utilizing the intersection face mask to generate a blended digital image.
-
67.
公开(公告)号:US11615292B2
公开(公告)日:2023-03-28
申请号:US17899936
申请日:2022-08-31
Applicant: Adobe Inc.
Inventor: Richard Zhang , Sylvain Philippe Paris , Junyan Zhu , Aaron Phillip Hertzmann , Jacob Minyoung Huh
Abstract: A target image is projected into a latent space of generative model by determining a latent vector by applying a gradient-free technique and a class vector by applying a gradient-based technique. An image is generated from the latent and class vectors, and a loss function is used to determine a loss between the target image and the generated image. This determining of the latent vector and the class vector, generating an image, and using the loss function is repeated until a loss condition is satisfied. In response to the loss condition being satisfied, the latent and class vectors that resulted in the loss condition being satisfied are identified as the final latent and class vectors, respectively. The final latent and class vectors are provided to the generative model and multiple weights of the generative model are adjusted to fine-tune the generative model.
-
公开(公告)号:US11508148B2
公开(公告)日:2022-11-22
申请号:US16822878
申请日:2020-03-18
Applicant: Adobe Inc.
Inventor: Yijun Li , Zhifei Zhang , Richard Zhang , Jingwan Lu
Abstract: The present disclosure relates to systems, computer-implemented methods, and non-transitory computer readable medium for automatically transferring makeup from a reference face image to a target face image using a neural network trained using semi-supervised learning. For example, the disclosed systems can receive, at a neural network, a target face image and a reference face image, where the target face image is selected by a user via a graphical user interface (GUI) and the reference face image has makeup. The systems transfer, by the neural network, the makeup from the reference face image to the target face image, where the neural network is trained to transfer the makeup from the reference face image to the target face image using semi-supervised learning. The systems output for display the makeup on the target face image.
-
公开(公告)号:US20220254071A1
公开(公告)日:2022-08-11
申请号:US17163284
申请日:2021-01-29
Applicant: Adobe Inc.
Inventor: Utkarsh Ojha , Yijun Li , Richard Zhang , Jingwan Lu , Elya Shechtman , Alexei A. Efros
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately and efficiently modifying a generative adversarial neural network using few-shot adaptation to generate digital images corresponding to a target domain while maintaining diversity of a source domain and realism of the target domain. In particular, the disclosed systems utilize a generative adversarial neural network with parameters learned from a large source domain. The disclosed systems preserve relative similarities and differences between digital images in the source domain using a cross-domain distance consistency loss. In addition, the disclosed systems utilize an anchor-based strategy to encourage different levels or measures of realism over digital images generated from latent vectors in different regions of a latent space.
-
公开(公告)号:US20210358177A1
公开(公告)日:2021-11-18
申请号:US16874399
申请日:2020-05-14
Applicant: Adobe Inc.
Inventor: Taesung Park , Richard Zhang , Oliver Wang , Junyan Zhu , Jingwan Lu , Elya Shechtman , Alexei A Efros
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating a modified digital image from extracted spatial and global codes. For example, the disclosed systems can utilize a global and spatial autoencoder to extract spatial codes and global codes from digital images. The disclosed systems can further utilize the global and spatial autoencoder to generate a modified digital image by combining extracted spatial and global codes in various ways for various applications such as style swapping, style blending, and attribute editing.
-
-
-
-
-
-
-
-
-