Learning parameters for an image harmonization neural network to generate deep harmonized digital images

    公开(公告)号:US12299844B2

    公开(公告)日:2025-05-13

    申请号:US18440248

    申请日:2024-02-13

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately, efficiently, and flexibly generating harmonized digital images utilizing a self-supervised image harmonization neural network. In particular, the disclosed systems can implement, and learn parameters for, a self-supervised image harmonization neural network to extract content from one digital image (disentangled from its appearance) and appearance from another from another digital image (disentangled from its content). For example, the disclosed systems can utilize a dual data augmentation method to generate diverse triplets for parameter learning (including input digital images, reference digital images, and pseudo ground truth digital images), via cropping a digital image with perturbations using three-dimensional color lookup tables (“LUTs”). Additionally, the disclosed systems can utilize the self-supervised image harmonization neural network to generate harmonized digital images that depict content from one digital image having the appearance of another digital image.

    UTILIZING MACHINE LEARNING MODELS TO GENERATE REFINED DEPTH MAPS WITH SEGMENTATION MASK GUIDANCE

    公开(公告)号:US20230326028A1

    公开(公告)日:2023-10-12

    申请号:US17658873

    申请日:2022-04-12

    Applicant: Adobe Inc.

    CPC classification number: G06T7/11 G06T2207/20084 G06T7/50 G06T7/215

    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for utilizing machine learning models to generate refined depth maps of digital images utilizing digital segmentation masks. In particular, in one or more embodiments, the disclosed systems generate a depth map for a digital image utilizing a depth estimation machine learning model, determine a digital segmentation mask for the digital image, and generate a refined depth map from the depth map and the digital segmentation mask utilizing a depth refinement machine learning model. In some embodiments, the disclosed systems generate first and second intermediate depth maps using the digital segmentation mask and an inverse digital segmentation mask and merger the first and second intermediate depth maps to generate the refined depth map.

    Detecting digital objects and generating object masks on device

    公开(公告)号:US12272127B2

    公开(公告)日:2025-04-08

    申请号:US17589114

    申请日:2022-01-31

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that generates object masks for digital objects portrayed in digital images utilizing a detection-masking neural network pipeline. In particular, in one or more embodiments, the disclosed systems utilize detection heads of a neural network to detect digital objects portrayed within a digital image. In some cases, each detection head is associated with one or more digital object classes that are not associated with the other detection heads. Further, in some cases, the detection heads implement multi-scale synchronized batch normalization to normalize feature maps across various feature levels. The disclosed systems further utilize a masking head of the neural network to generate one or more object masks for the detected digital objects. In some cases, the disclosed systems utilize post-processing techniques to filter out low-quality masks.

    Reconstructing three-dimensional scenes portrayed in digital images utilizing point cloud machine-learning models

    公开(公告)号:US11443481B1

    公开(公告)日:2022-09-13

    申请号:US17186522

    申请日:2021-02-26

    Applicant: Adobe Inc.

    Abstract: This disclosure describes implementations of a three-dimensional (3D) scene recovery system that reconstructs a 3D scene representation of a scene portrayed in a single digital image. For instance, the 3D scene recovery system trains and utilizes a 3D point cloud model to recover accurate intrinsic camera parameters from a depth map of the digital image. Additionally, the 3D point cloud model may include multiple neural networks that target specific intrinsic camera parameters. For example, the 3D point cloud model may include a depth 3D point cloud neural network that recovers the depth shift as well as include a focal length 3D point cloud neural network that recovers the camera focal length. Further, the 3D scene recovery system may utilize the recovered intrinsic camera parameters to transform the single digital image into an accurate and realistic 3D scene representation, such as a 3D point cloud.

    TRANSFORMER-BASED IMAGE SEGMENTATION ON MOBILE DEVICES

    公开(公告)号:US20240281978A1

    公开(公告)日:2024-08-22

    申请号:US18170336

    申请日:2023-02-16

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for generating segmentation masks for a digital visual media item. In particular, in one or more embodiments, the disclosed systems generate, utilizing a neural network encoder, high-level features of a digital visual media item. Further, the disclosed systems generate, utilizing the neural network encoder, low-level features of the digital visual media item. In some implementations, the disclosed systems generate, utilizing a neural network decoder, an initial segmentation mask of the digital visual media item from the low-level features. Moreover, the disclosed systems generate, utilizing the neural network decoder, a refined segmentation mask of the digital visual media item from the initial segmentation mask and the high-level features.

    DETECTING DIGITAL OBJECTS AND GENERATING OBJECT MASKS ON DEVICE

    公开(公告)号:US20230128792A1

    公开(公告)日:2023-04-27

    申请号:US17589114

    申请日:2022-01-31

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that generates object masks for digital objects portrayed in digital images utilizing a detection-masking neural network pipeline. In particular, in one or more embodiments, the disclosed systems utilize detection heads of a neural network to detect digital objects portrayed within a digital image. In some cases, each detection head is associated with one or more digital object classes that are not associated with the other detection heads. Further, in some cases, the detection heads implement multi-scale synchronized batch normalization to normalize feature maps across various feature levels. The disclosed systems further utilize a masking head of the neural network to generate one or more object masks for the detected digital objects. In some cases, the disclosed systems utilize post-processing techniques to filter out low-quality masks.

    UTILIZING A SEGMENTATION NEURAL NETWORK TO PROCESS INITIAL OBJECT SEGMENTATIONS AND OBJECT USER INDICATORS WITHIN A DIGITAL IMAGE TO GENERATE IMPROVED OBJECT SEGMENTATIONS

    公开(公告)号:US20220198671A1

    公开(公告)日:2022-06-23

    申请号:US17126986

    申请日:2020-12-18

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods that utilize a deep neural network to process object user indicators and an initial object segmentation from a digital image to efficiently and flexibly generate accurate object segmentations. In particular, the disclosed systems can determine an initial object segmentation for the digital image (e.g., utilizing an object segmentation model or interactive selection processes). In addition, the disclosed systems can identify an object user indicator for correcting the initial object segmentation and generate a distance map reflecting distances between pixels of the digital image and the object user indicator. The disclosed systems can generate an image-interaction-segmentation triplet by combining the digital image, the initial object segmentation, and the distance map. By processing the image-interaction-segmentation triplet utilizing the segmentation neural network, the disclosed systems can provide an updated object segmentation for display to a client device.

Patent Agency Ranking