Patent search ap:("Adobe Inc.") AND inv:"Deepali Aneja" Page 1

1.

发明申请
Curve Generation for Sketch Vectorization 有权

公开(公告)号：US20230137233A1

公开(公告)日：2023-05-04

申请号：US17519357

申请日：2021-11-04

Applicant: Adobe Inc.

Inventor： Ashwani Chandil , Vineet Batra , Matthew David Fisher , Deepali Aneja , Ankit Phogat

IPC: G06T11/20 , G06K9/46 , G06T7/12 , G06T7/194 , G06T7/181 , G06T3/40

Abstract: Generating a vector representation of a hand-drawn sketch is described. To do so, the sketch is segmented into different superpixel regions. Superpixels are grown by distributing superpixel seeds throughout an image of the sketch and assigning unassigned pixels to a neighboring superpixel based on pixel value differences. The border between each pair of adjacent superpixels is then classified as either an active or an inactive boundary, with active boundaries indicating that the border corresponds to a salient sketch stroke. Vector paths are generated by traversing edges between pixel vertices along the active boundaries. To minimize vector paths included in the vector representation, vector paths are greedily generated first for longer curves along active boundaries until each edge is assigned to a vector path. Regions encompassed by vector paths corresponding to a foreground superpixel are filled to produce a high-fidelity vector representation of the sketch.

2.

发明授权
Using machine-learning models to determine movements of a mouth corresponding to live speech 有权

公开(公告)号：US11211060B2

公开(公告)日：2021-12-28

申请号：US16887418

申请日：2020-05-29

Applicant: Adobe Inc.

Inventor： Wilmot Li , Jovan Popovic , Deepali Aneja , David Simons

IPC: G10L15/197 , G06N3/04 , G06N3/08 , G10L15/02 , G10L15/06 , G10L21/0316 , G10L25/21 , G10L25/24

Abstract: Disclosed systems and methods predict visemes from an audio sequence. In an example, a viseme-generation application accesses a first audio sequence that is mapped to a sequence of visemes. The first audio sequence has a first length and represents phonemes. The application adjusts a second length of a second audio sequence such that the second length equals the first length and represents the phonemes. The application adjusts the sequence of visemes to the second audio sequence such that phonemes in the second audio sequence correspond to the phonemes in the first audio sequence. The application trains a machine-learning model with the second audio sequence and the sequence of visemes. The machine-learning model predicts an additional sequence of visemes based on an additional sequence of audio.

3.

发明申请
USING MACHINE-LEARNING MODELS TO DETERMINE MOVEMENTS OF A MOUTH CORRESPONDING TO LIVE SPEECH 审中-公开

公开(公告)号：US20190392823A1

公开(公告)日：2019-12-26

申请号：US16016418

申请日：2018-06-22

Applicant: Adobe Inc.

Inventor： Wilmot Li , Jovan Popovic , Deepali Aneja , David Simons

IPC: G10L15/197 , G06N3/08 , G06N3/04 , G10L15/06 , G10L15/02 , G10L25/24 , G10L25/21 , G10L21/0316

Abstract: Disclosed systems and methods predict visemes from an audio sequence. A viseme-generation application accesses a first set of training data that includes a first audio sequence representing a sentence spoken by a first speaker and a sequence of visemes. Each viseme is mapped to a respective audio sample of the first audio sequence. The viseme-generation application creates a second set of training data adjusting a second audio sequence spoken by a second speaker speaking the sentence such that the second and first sequences have the same length and at least one phoneme occurs at the same time stamp in the first sequence and in the second sequence. The viseme-generation application maps the sequence of visemes to the second audio sequence and trains a viseme prediction model to predict a sequence of visemes from an audio sequence.

4.

发明申请
Template-Based Behaviors in Machine Learning 有权

公开(公告)号：US20250111695A1

公开(公告)日：2025-04-03

申请号：US18543234

申请日：2023-12-18

Applicant: Adobe Inc.

Inventor： Wilmot Wei-Mau Li , Li-Yi Wei , Cuong D. Nguyen , Jakub Fiser , Hijung Shin , Stephen Joseph DiVerdi , Seth John Walker , Kazi Rubaiat Habib , Deepali Aneja , David Gilliaert Werner , Erica K. Schisler

IPC: G06V40/16 , G06V40/20

Abstract: In implementation of techniques for template-based behaviors in machine learning, a computing device implements a template system to receive a digital video and data executable to generate animated content. The template system determines a location within a frame of the digital video to place the animated content using a machine learning model. The template system then renders the animated content within the frame of the digital video at the location determined by the machine learning model. The template system then displays the rendered animated content within the frame of the digital video in a user interface.

5.

发明授权
Articulated part extraction from sprite sheets using machine learning 有权

公开(公告)号：US11875442B1

公开(公告)日：2024-01-16

申请号：US17829120

申请日：2022-05-31

Applicant: Adobe Inc. , University of Massachusetts

Inventor： Matthew David Fisher , Zhan Xu , Yang Zhou , Deepali Aneja , Evangelos Kalogerakis

IPC: G06T13/80 , G06V10/762 , G06T13/40 , G06V10/774 , G06T7/246 , G06V10/77

CPC classification number: G06T13/80 , G06T7/251 , G06T13/40 , G06V10/762 , G06V10/7715 , G06V10/7747

Abstract: Embodiments are disclosed for articulated part extraction using images of animated characters from sprite sheets by a digital design system. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an input including a plurality of images depicting an animated character in different poses. The disclosed systems and methods further comprise, for each pair of images in the plurality of images, determining, by a first machine learning model, pixel correspondences between pixels of the pair of images, and determining, by a second machine learning model, pixel clusters representing the animated character, each pixel cluster corresponding to a different structural segment of the animated character. The disclosed systems and methods further comprise selecting a subset of clusters that reconstructs the different poses of the animated character. The disclosed systems and methods further comprise creating a rigged animated character based on the selected subset of clusters.

6.

发明授权
Image-to-image translation using an auto-regressive generative model 有权

公开(公告)号：US11663763B2

公开(公告)日：2023-05-30

申请号：US17452217

申请日：2021-10-25

Applicant: Adobe Inc.

Inventor： Matthew David Fisher , Vineet Batra , Sumit Dhingra , Praveen Kumar Dhanuka , Deepali Aneja , Ankit Phogat

IPC: G06T9/00 , G06T11/60 , G06T5/50 , G06T11/00 , G06T11/20

CPC classification number: G06T11/60 , G06T5/50 , G06T9/002 , G06T11/001 , G06T11/203 , G06T2207/20081 , G06T2207/20084

Abstract: A computer-implemented method including receiving an input image at a first image stage and receiving a request to generate a plurality of variations of the input image at a second image stage. The method including generating, using an auto-regressive generative deep learning model, the plurality of variations of the input image at the second image stage and outputting the plurality of variations of the input image at the second image stage.

7.

发明授权
Video editing using transcript text stylization and layout 有权

公开(公告)号：US12206930B2

公开(公告)日：2025-01-21

申请号：US18154412

申请日：2023-01-13

Applicant: Adobe Inc.

Inventor： Kim Pascal Pimmel , Stephen Joseph Diverdi , Jiaju MA , Rubaiat Habib , Li-Yi Wei , Hijung Shin , Deepali Aneja , John G. Nelson , Wilmot Li , Dingzeyu Li , Lubomira Assenova Dontcheva , Joel Richard Brandt

IPC: H04N21/431 , G06F3/04812 , G06F3/0482 , H04N21/4402

Abstract: Embodiments of the present disclosure provide, a method, a system, and a computer storage media that provide mechanisms for multimedia effect addition and editing support for text-based video editing tools. The method includes generating a user interface (UI) displaying a transcript of an audio track of a video and receiving, via the UI, input identifying selection of a text segment from the transcript. The method also includes in response to receiving, via the UI, input identifying selection of a particular type of text stylization or layout for application to the text segment. The method further includes identifying a video effect corresponding to the particular type of text stylization or layout, applying the video effect to a video segment corresponding to the text segment, and applying the particular type of text stylization or layout to the text segment to visually represent the video effect in the transcript.

8.

发明授权
Re-timing a video sequence to an audio sequence based on motion and audio beat detection 有权

公开(公告)号：US11682238B2

公开(公告)日：2023-06-20

申请号：US17175441

申请日：2021-02-12

Applicant: Adobe Inc.

Inventor： Jimei Yang , Deepali Aneja , Dingzeyu Li , Jun Saito , Yang Zhou

IPC: G06V40/20 , G06T7/215 , G06V20/40 , G06V40/10 , H04N5/06 , H04N21/8547 , G11B27/031 , G10H1/36 , G11B27/10 , H04N21/845

CPC classification number: G06V40/23 , G06T7/215 , G06V20/41 , G06V20/46 , G06V40/103 , H04N5/06 , H04N21/8456 , H04N21/8547

Abstract: Embodiments are disclosed for re-timing a video sequence to an audio sequence based on the detection of motion beats in the video sequence and audio beats in the audio sequence. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving a first input, the first input including a video sequence, detecting motion beats in the video sequence, receiving a second input, the second input including an audio sequence, detecting audio beats in the audio sequence, modifying the video sequence by matching the detected motions beats in the video sequence to the detected audio beats in the audio sequence, and outputting the modified video sequence.

9.

发明公开
Stroke-Guided Sketch Vectorization 审中-公开

公开(公告)号：US20230162413A1

公开(公告)日：2023-05-25

申请号：US17530760

申请日：2021-11-19

Applicant: Adobe Inc.

Inventor： Vineet Batra , Matthew David Fisher , Deepali Aneja , Ashwani Chandil , Ankit Phogat

IPC: G06T11/20 , G06T7/30 , G06T5/00 , G06V10/75

CPC classification number: G06T11/203 , G06T7/30 , G06T5/002 , G06V10/751 , G06T2207/20081

Abstract: A stroke-guided vectorization system is described that generates, from an input sketch and guide image depicting an approximate vector representation of the sketch, an aligned guide image depicting an improved vector representation of the sketch. To do so, the stroke-guided vectorization system determines black levels representing a vector stroke in the input sketch and white levels representing a background in the input sketch. The stroke-guided vectorization system determines a black threshold value and a white threshold value for discrete portions of the aligned guide image using subsets of the black levels and subsets of the white levels determined using the input sketch. Each discrete portion of the aligned guide image is then mapped to a vector stroke or a background based on the black threshold value and the white threshold value of the portion.

10.

发明授权
Using machine-learning models to determine movements of a mouth corresponding to live speech 有权

公开(公告)号：US10699705B2

公开(公告)日：2020-06-30

申请号：US16016418

申请日：2018-06-22

Applicant: Adobe Inc.

Inventor： Wilmot Li , Jovan Popovic , Deepali Aneja , David Simons

IPC: G10L15/197 , G06N3/04 , G06N3/08 , G10L15/02 , G10L15/06 , G10L21/0316 , G10L25/21 , G10L25/24

Abstract: Disclosed systems and methods predict visemes from an audio sequence. A viseme-generation application accesses a first set of training data that includes a first audio sequence representing a sentence spoken by a first speaker and a sequence of visemes. Each viseme is mapped to a respective audio sample of the first audio sequence. The viseme-generation application creates a second set of training data adjusting a second audio sequence spoken by a second speaker speaking the sentence such that the second and first sequences have the same length and at least one phoneme occurs at the same time stamp in the first sequence and in the second sequence. The viseme-generation application maps the sequence of visemes to the second audio sequence and trains a viseme prediction model to predict a sequence of visemes from an audio sequence.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification