Patent search ap:("QUALCOMM Incorporated") AND inv:"Juntae LEE" Page 2

11.

发明公开
SYSTEMS AND METHODS OF IMAGE PROCESSING BASED ON GAZE DETECTION 审中-公开

公开(公告)号：US20230281885A1

公开(公告)日：2023-09-07

申请号：US17685278

申请日：2022-03-02

Applicant: QUALCOMM Incorporated

Inventor： Hyunsin PARK , Juntae LEE , Simyung CHANG , Byeonggeun KIM , Jaewon CHOI , Kyu Woong HWANG

IPC: G06T11/00 , G06V40/18 , G06F3/01 , G06V40/16

CPC classification number: G06T11/00 , G06F3/013 , G06V40/174 , G06V40/18

Abstract: Imaging systems and techniques are described. An imaging system receives image data representing at least a portion (e.g., a face) of a first user as captured by a first image sensor. The imaging system identifies that a gaze of the first user as represented in the image data is directed toward a displayed representation of at least a portion (e.g., a face) of a second user. The imaging system identifies an arrangement of representations of users for output. The imaging system generates modified image data based on the gaze and the arrangement at least in part by modifying the image data to modify at least the portion of the first user in the image data to be visually directed toward a direction corresponding to the second user based on the gaze and the arrangement. The imaging system outputs the modified image data arranged according to the arrangement.

12.

发明申请
RELAXED INSTANCE FREQUENCY NORMALIZATION FOR NEURAL-NETWORK-BASED AUDIO PROCESSING 有权

公开(公告)号：US20230119791A1

公开(公告)日：2023-04-20

申请号：US17937765

申请日：2022-10-03

Applicant: QUALCOMM Incorporated

Inventor： Byeonggeun KIM , Seunghan YANG , Hyunsin PARK , Juntae LEE , Simyung CHANG

IPC: G10L21/034 , G10L17/18 , G10L25/30 , G10L25/51 , G10L17/04

Abstract: Techniques and apparatus for training a neural network to classify audio into one of a plurality of categories and using such a trained neural network. An example method generally includes receiving a data set including a plurality of audio samples. A relaxed feature-normalized data set is generated by normalizing each audio sample of the plurality of audio samples. A neural network is trained to classify audio into one of a plurality of categories based on the relaxed feature-normalized data set, and the trained neural network is deployed.

13.

发明申请
MULTI-MODAL REPRESENTATION BASED EVENT LOCALIZATION 有权

公开(公告)号：US20220101087A1

公开(公告)日：2022-03-31

申请号：US17405879

申请日：2021-08-18

Applicant: QUALCOMM Incorporated

Inventor： Juntae LEE , Mihir JAIN , Sungrack YUN , Hyoungwoo PARK , Kyu Woong HWANG

IPC: G06N3/04 , G06K9/62

Abstract: A method performed by an artificial neural network (ANN) includes determining, at a first stage of a multi-stage cross-attention model of the ANN, a first cross-correlation between a first representation of each modality of a number of modalities associated with a sequence of inputs. The method still further includes determining, at each second stage of one or more second stages of the multi-stage cross-attention model, a second cross-correlation between first attended representations of each modality. The method also includes generating a concatenated feature representation associated with a final second stage of the one or more second stages based on the second cross-correlation associated with the final second stage, the first attended representation of each modality, and the first representation of each modality. The method further includes determining a probability distribution between a set of background actions and a set of foreground actions from the concatenated feature representation. The method still further includes localizing an action in the sequence of inputs based on the probability distribution.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification