Patent search ap:("QUALCOMM INCORPORATED") AND inv:"Yinyi Guo" Page 2

11.

发明授权
Systems, methods, apparatus, and computer-readable media for pitch trajectory analysis 有权
Title translation: 用于俯仰轨迹分析的系统，方法，装置和计算机可读介质

公开(公告)号：US09305570B2

公开(公告)日：2016-04-05

申请号：US13840863

申请日：2013-03-15

Applicant: QUALCOMM Incorporated

Inventor： Erik Visser , Yinyi Guo , Lae-Hoon Kim , Pei Xiang

IPC: G10L21/00 , G10L25/93 , G10L25/90 , G10L21/02 , G10H1/00 , G10H1/36

CPC classification number: G10L25/90 , G10H1/0008 , G10H1/361 , G10H2210/056 , G10H2210/066 , G10H2210/211 , G10H2250/131 , G10H2250/215 , G10H2250/225 , G10H2250/251

Abstract: Systems, methods, and apparatus for pitch trajectory analysis are described. Such techniques may be used to remove vocals and/or vibrato from an audio mixture signal. For example, such a technique may be used to pre-process the signal before an operation to decompose the mixture signal into individual instrument components.

Abstract translation: 描述了用于俯仰轨迹分析的系统，方法和装置。这样的技术可以用于从音频混合信号中去除声音和/或颤音。例如，这样的技术可以用于在将混合信号分解成单独的仪器部件的操作之前预处理信号。

12.

发明授权
Adaptive sound event classification 有权

公开(公告)号：US11410677B2

公开(公告)日：2022-08-09

申请号：US17102724

申请日：2020-11-24

Applicant: QUALCOMM Incorporated

Inventor： Fatemeh Saki , Yinyi Guo , Erik Visser

IPC: H04R29/00 , G10L25/51 , G06N20/00 , G06N5/04 , H04R1/08

Abstract: A device includes one or more processors configured to provide audio data samples to a sound event classification model. The one or more processors are also configured to determine, based on an output of the sound event classification model responsive to the audio data samples, whether a sound class of with the audio data samples was recognized by the sound event classification model. The one or more processors are further configured to, based on a determination that the sound class was not recognized, determine whether the sound event classification model corresponds to an audio scene associated with the audio data samples. The one or more processors are also configured to, based on a determination that the sound event classification model corresponds to the audio scene associated with the audio data samples, store model update data based on the audio data samples.

13.

发明申请
ENHANCED SPEECH GENERATION 审中-公开

公开(公告)号：US20190251971A1

公开(公告)日：2019-08-15

申请号：US16396311

申请日：2019-04-26

Applicant: QUALCOMM Incorporated

Inventor： Erik Visser , Shuhua Zhang , Lae-Hoon Kim , Yinyi Guo , Sunkuk Moon

IPC: G10L15/26 , G10L25/48 , G10L13/047 , G10L21/00

CPC classification number: G10L15/26 , G10L13/047 , G10L21/00 , G10L21/003 , G10L25/48

Abstract: In a particular aspect, a speech generator includes a signal input configured to receive a first audio signal. The speech generator also includes at least one speech signal processor configured to generate a second audio signal based on information associated with the first audio signal and based further on automatic speech recognition (ASR) data associated with the first audio signal.

14.

发明授权
Audio processing for an acoustical environment 有权

公开(公告)号：US09881619B2

公开(公告)日：2018-01-30

申请号：US15081664

申请日：2016-03-25

Applicant: QUALCOMM Incorporated

Inventor： Erik Visser , Wenliang Lu , Lae-Hoon Kim , Yinyi Guo , Shuhua Zhang

IPC: G10L19/02 , G10L19/002 , G10L25/30 , G10L25/48 , G10L21/00

CPC classification number: G10L19/002 , G01S3/803 , G01S3/8083 , G01S5/18 , G10L25/30 , G10L25/48 , H04R3/005 , H04R2499/13

Abstract: An apparatus for detecting a sound in an acoustical environment includes a microphone array configured to detect an audio signal in the acoustical environment. The apparatus also includes a processor configured to determine an angular location of a sound source of the audio signal. The angular location is relative to the microphone array. The processor is also configured to determine at least one reverberation characteristic of the audio signal. The processor is further configured to determine a distance, relative to the microphone array, of the sound source along an axis associated with the angular location based on the at least one reverberation characteristic.

15.

发明申请
SUPPRESSING OR REDUCING EFFECTS OF WIND TURBULENCE 有权

公开(公告)号：US20170353809A1

公开(公告)日：2017-12-07

申请号：US15170771

申请日：2016-06-01

Applicant: QUALCOMM Incorporated

Inventor： Shuhua Zhang , Erik Visser , Lae-Hoon Kim , Raghuveer Peri , Yinyi Guo

IPC: H04R29/00 , H04R3/00

CPC classification number: H04R29/008 , G10L21/0208 , G10L21/0316 , G10L2021/02161 , H04R3/005 , H04R29/004 , H04R2410/07

Abstract: A method of operation of a device includes receiving an input signal at the device. The input signal is generated using at least one microphone. The input signal includes a first signal component having a first amount of wind turbulence noise and a second signal component having a second amount of wind turbulence noise that is greater than the first amount of wind turbulence noise. The method further includes generating, based on the input signal, an output signal at the device. The output signal includes the first signal component and a third signal component that replaces the second signal component. A first frequency response of the input signal corresponds to a second frequency response of the output signal.

16.

发明申请
AUDIO PROCESSING FOR AN ACOUSTICAL ENVIRONMENT 有权

公开(公告)号：US20170278519A1

公开(公告)日：2017-09-28

申请号：US15081664

申请日：2016-03-25

Applicant: QUALCOMM Incorporated

Inventor： Erik Visser , Wenliang Lu , Lae-Hoon Kim , Yinyi Guo , Shuhua Zhang

IPC: G10L19/002 , G10L25/48 , G10L25/30

CPC classification number: G10L19/002 , G01S3/803 , G01S3/8083 , G01S5/18 , G10L25/30 , G10L25/48 , H04R3/005 , H04R2499/13

Abstract: An apparatus for detecting a sound in an acoustical environment includes a microphone array configured to detect an audio signal in the acoustical environment. The apparatus also includes a processor configured to determine an angular location of a sound source of the audio signal. The angular location is relative to the microphone array. The processor is also configured to determine at least one reverberation characteristic of the audio signal. The processor is further configured to determine a distance, relative to the microphone array, of the sound source along an axis associated with the angular location based on the at least one reverberation characteristic.

17.

发明申请
LIVENESS DETERMINATION BASED ON SENSOR SIGNALS 有权

公开(公告)号：US20170220786A1

公开(公告)日：2017-08-03

申请号：US15013889

申请日：2016-02-02

Applicant: QUALCOMM Incorporated

Inventor： Yinyi Guo , Minho Jin , JunCheol Cho , Yongwoo Cho , Lae-Hoon Kim , Erik Visser , Shuhua Zhang

IPC: G06F21/32 , G06F21/36

CPC classification number: G06F21/32 , G06F21/316 , G06F21/36 , G06F2221/2133 , H04L63/0861 , H04W12/06

Abstract: A device includes a memory and a processor. The memory is configured to store a threshold. The processor is configured to authenticate a user based on authentication data. The processor is also configured to, in response to determining that the user is authenticated, generate a correlation score indicating a correlation between a first signal received from a first sensor and a second signal received from a second sensor. The processor is also configured to determine liveness of the user based on a comparison of the correlation score and the threshold.

18.

发明申请
SYSTEMS AND METHODS FOR SPEAKER DICTIONARY BASED SPEECH MODELING 有权
Title translation: 基于语音基础的语音建模系统与方法

公开(公告)号：US20150243284A1

公开(公告)日：2015-08-27

申请号：US14629109

申请日：2015-02-23

Applicant: QUALCOMM Incorporated

Inventor： Yinyi Guo , Juhan Nam , Erik Visser , Shuhua Zhang , Lae-Hoon Kim

IPC: G10L15/20 , G10L15/06

CPC classification number: G10L15/20 , G10L15/06 , G10L21/0208 , G10L21/028

Abstract: A method for speech modeling by an electronic device is described. The method includes obtaining a real-time noise reference based on a noisy speech signal. The method also includes obtaining a real-time noise dictionary based on the real-time noise reference. The method further includes obtaining a first speech dictionary and a second speech dictionary. The method additionally includes reducing residual noise based on the real-time noise dictionary and the first speech dictionary to produce a residual noise-suppressed speech signal at a first modeling stage. The method also includes generating a reconstructed speech signal based on the residual noise-suppressed speech signal and the second speech dictionary at a second modeling stage.

Abstract translation: 描述了一种由电子设备进行语音建模的方法。该方法包括基于噪声语音信号获得实时噪声参考。该方法还包括基于实时噪声参考获得实时噪声字典。该方法还包括获得第一语音词典和第二语音词典。该方法还包括基于实时噪声字典和第一语音字典减少残留噪声，以在第一建模阶段产生残留噪声抑制语音信号。该方法还包括在第二建模阶段基于剩余噪声抑制语音信号和第二语音词典生成重构语音信号。

19.

发明申请
SYSTEMS AND METHODS FOR AUDIO SIGNAL PROCESSING 审中-公开
Title translation: 用于音频信号处理的系统和方法

公开(公告)号：US20130282372A1

公开(公告)日：2013-10-24

申请号：US13828158

申请日：2013-03-14

Applicant: QUALCOMM INCORPORATED

Inventor： Erik Visser , Lae-Hoon Kim , Yinyi Guo , Juhan Nam

IPC: G10L15/20

CPC classification number: G10L21/0208 , G10L15/20 , G10L21/0316 , G10L25/93 , G10L2021/02165

Abstract: A method for detecting voice activity by an electronic device is described. The method includes detecting near end speech based on a near end voiced speech detector and at least one single channel voice activity detector. The near end voiced speech detector is associated with a harmonic statistic based on a speech pitch histogram.

Abstract translation: 描述了一种用于由电子设备检测语音活动的方法。该方法包括基于近端浊音语音检测器和至少一个单声道语音活动检测器检测近端语音。近端浊音语音检测器与基于语音音调直方图的谐波统计量相关联。

20.

发明授权
Sound event detection learning 有权

公开(公告)号：US11664044B2

公开(公告)日：2023-05-30

申请号：US17102797

申请日：2020-11-24

Applicant: QUALCOMM Incorporated

Inventor： Fatemeh Saki , Yinyi Guo , Erik Visser , Eunjeong Koh

IPC: G10L25/78 , G10L25/30 , G10L25/24 , H04R5/033 , B60R11/02 , G06N3/084 , G06N3/045

CPC classification number: G10L25/78 , B60R11/0247 , G06N3/045 , G06N3/084 , G10L25/24 , G10L25/30 , H04R5/033 , H04R2499/13

Abstract: A device includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound classes. The processor is further configured to provide the audio data samples to a second neural network to generate a second output corresponding to a second set of sound classes. A second count of classes of the second set of sound classes is greater than a first count of classes of the first set of sound classes. The processor is also configured to provide the first output to a neural adapter to generate a third output corresponding to the second set of sound classes. The processor is further configured to provide the second output and the third output to a merger adapter to generate sound event identification data based on the audio data samples.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification