-
公开(公告)号:US12244994B2
公开(公告)日:2025-03-04
申请号:US17814660
申请日:2022-07-25
Applicant: QUALCOMM Incorporated
Inventor: Erik Visser , Fatemeh Saki , Yinyi Guo , Lae-Hoon Kim , Rogerio Guedes Alves , Hannes Pessentheiner
Abstract: A first device includes a memory configured to store instructions and one or more processors configured to receive audio signals from multiple microphones. The one or more processors are configured to process the audio signals to generate direction-of-arrival information corresponding to one or more sources of sound represented in one or more of the audio signals. The one or more processors are also configured to and send, to a second device, data based on the direction-of-arrival information and a class or embedding associated with the direction-of-arrival information.
-
公开(公告)号:US11862189B2
公开(公告)日:2024-01-02
申请号:US16837420
申请日:2020-04-01
Applicant: QUALCOMM Incorporated
Inventor: Prajakt Kulkarni , Yinyi Guo , Erik Visser
IPC: G10L25/78 , G10L15/16 , H04W52/02 , G06F18/211 , G06F18/241
CPC classification number: G10L25/78 , G06F18/211 , G06F18/241 , G10L15/16 , H04W52/0229 , H04W52/0261
Abstract: A device to perform target sound detection includes one or more processors. The one or more processors include a buffer configured to store audio data and a target sound detector. The target sound detector includes a first stage and a second stage. The first stage includes a binary target sound classifier configured to process the audio data. The first stage is configured to activate the second stage in response to detection of a target sound. The second stage is configured to receive the audio data from the buffer in response to the detection of the target sound.
-
公开(公告)号:US11290518B2
公开(公告)日:2022-03-29
申请号:US15717027
申请日:2017-09-27
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Erik Visser , Yinyi Guo
Abstract: Various embodiments provide systems and methods which disclose a command device which can be used to establish a wireless connection, through one or more wireless channels, between the command device and a remote device. An intention code may be generated, prior to, or after, the establishment of the wireless connection, and the remote device may be selected based on the intention code. The command device may initiate a wireless transfer, through one or more wireless channels of the established wireless connection, of an intention code, and receive acknowledgement that the intention code was successfully transferred to the remote device. The command device may then control the remote device, based on the intention code sent to the remote device, through the one or more wireless channels of the established wireless connection between the command device and the remote device.
-
公开(公告)号:US10720165B2
公开(公告)日:2020-07-21
申请号:US15413110
申请日:2017-01-23
Applicant: QUALCOMM Incorporated
Inventor: Yinyi Guo , Erik Visser
IPC: G10L17/20 , G10L17/24 , G10L17/04 , G10L25/21 , G10L17/06 , G10L25/24 , G10L21/0272 , G10L21/0208
Abstract: A method of authenticating a user based on voice recognition of a keyword includes generating, at a processor, clean speech statistics. The clean speech statistics are generated from an audio recording of the keyword spoken by the user during an enrollment phase. The method further includes separating speech data and noise data from noisy input speech using the clean speech statistics during an authentication phase. The method also includes authenticating the user by comparing the speech data to the clean speech statistics or by comparing the noisy input speech to noisy speech statistics. The noisy speech statistics are based at least in part on the noise data.
-
公开(公告)号:US20190355351A1
公开(公告)日:2019-11-21
申请号:US15982851
申请日:2018-05-17
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Yinyi Guo , Ravi Choudhary , Sunkuk Moon , Erik Visser , Fatemeh Saki
IPC: G10L15/22 , G06F3/16 , G10L15/18 , G10L25/63 , G06F3/0484
Abstract: A device includes a memory configured to store a user experience evaluation unit. A processor is configured to receive a first user input corresponding to a user command to initiate a particular task, the first user input received via a first sensor. The processor is configured to, after receiving the first user input, receive one or more subsequent user inputs, the one or subsequent user inputs including a second user input received via a second sensor. The processor is configured to initiate a remedial action in response to determining, based on the user experience evaluation unit, that the one or more subsequent user inputs correspond to a negative user experience.
-
36.
公开(公告)号:US20190098070A1
公开(公告)日:2019-03-28
申请号:US15717027
申请日:2017-09-27
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Erik Visser , Yinyi Guo
Abstract: Various embodiments provide systems and methods which disclose a command device which can be used to establish a wireless connection, through one or more wireless channels, between the command device and a remote device. An intention code may be generated, prior to, or after, the establishment of the wireless connection, and the remote device may be selected based on the intention code. The command device may initiate a wireless transfer, through one or more wireless channels of the established wireless connection, of an intention code, and receive acknowledgement that the intention code was successfully transferred to the remote device. The command device may then control the remote device, based on the intention code sent to the remote device, through the one or more wireless channels of the established wireless connection between the command device and the remote device.
-
公开(公告)号:US20180211671A1
公开(公告)日:2018-07-26
申请号:US15413110
申请日:2017-01-23
Applicant: QUALCOMM Incorporated
Inventor: Yinyi Guo , Erik Visser
Abstract: A method of authenticating a user based on voice recognition of a keyword includes generating, at a processor, clean speech statistics. The clean speech statistics are generated from an audio recording of the keyword spoken by the user during an enrollment phase. The method further includes separating speech data and noise data from noisy input speech using the clean speech statistics during an authentication phase. The method also includes authenticating the user by comparing the speech data to the clean speech statistics or by comparing the noisy input speech to noisy speech statistics. The noisy speech statistics are based at least in part on the noise data.
-
公开(公告)号:US10013975B2
公开(公告)日:2018-07-03
申请号:US14629109
申请日:2015-02-23
Applicant: QUALCOMM Incorporated
Inventor: Yinyi Guo , Juhan Nam , Erik Visser , Shuhua Zhang , Lae-Hoon Kim
IPC: G10L21/02 , G10L15/20 , G10L15/06 , G10L21/0208 , G10L21/028
CPC classification number: G10L15/20 , G10L15/06 , G10L21/0208 , G10L21/028
Abstract: A method for speech modeling by an electronic device is described. The method includes obtaining a real-time noise reference based on a noisy speech signal. The method also includes obtaining a real-time noise dictionary based on the real-time noise reference. The method further includes obtaining a first speech dictionary and a second speech dictionary. The method additionally includes reducing residual noise based on the real-time noise dictionary and the first speech dictionary to produce a residual noise-suppressed speech signal at a first modeling stage. The method also includes generating a reconstructed speech signal based on the residual noise-suppressed speech signal and the second speech dictionary at a second modeling stage.
-
公开(公告)号:US20160254007A1
公开(公告)日:2016-09-01
申请号:US14634637
申请日:2015-02-27
Applicant: QUALCOMM Incorporated
Inventor: Yinyi Guo , Shuhua Zhang , Erik Visser , Lae-Hoon Kim , Sanghyun Chi
IPC: G10L21/0208 , G10L21/034 , H03G5/00 , H03G5/16
CPC classification number: G10L21/0208 , G10L21/0232 , G10L21/034 , H03G5/005 , H03G5/165
Abstract: A method for speech restoration by an electronic device is described. The method includes obtaining a noisy speech signal. The method also includes suppressing noise in the noisy speech signal to produce a noise-suppressed speech signal. The noise-suppressed speech signal has a bandwidth that includes at least three subbands. The method further includes iteratively restoring each of the at least three subbands. Each of the at least three subbands is restored based on all previously restored subbands of the at least three subbands.
Abstract translation: 描述了一种通过电子设备进行语音恢复的方法。 该方法包括获得噪声语音信号。 该方法还包括抑制噪声语音信号中的噪声以产生噪声抑制语音信号。 噪声抑制语音信号具有包括至少三个子带的带宽。 该方法还包括迭代地恢复至少三个子带中的每一个。 所述至少三个子带中的每一个基于所述至少三个子带的所有先前恢复的子带被恢复。
-
公开(公告)号:US09305567B2
公开(公告)日:2016-04-05
申请号:US13827894
申请日:2013-03-14
Applicant: QUALCOMM Incorporated
Inventor: Erik Visser , Lae-Hoon Kim , Jongwon Shin , Yinyi Guo , Sang-Uk Ryu , Andre Gustavo P. Schevciw
IPC: G10L21/0208 , G10L15/20 , G10L21/0316 , G10L25/93 , G10L21/0216
CPC classification number: G10L21/0208 , G10L15/20 , G10L21/0316 , G10L25/93 , G10L2021/02165
Abstract: A method for signal level matching by an electronic device is described. The method includes capturing a plurality of audio signals from a plurality of microphones. The method also includes determining a difference signal based on an inter-microphone subtraction. The difference signal includes multiple harmonics. The method also includes determining whether a harmonicity of the difference signal exceeds a harmonicity threshold. The method also includes preserving the harmonics to determine an envelope. The method further applies the envelope to a noise-suppressed signal.
Abstract translation: 描述了一种由电子设备进行信号电平匹配的方法。 该方法包括从多个麦克风中捕获多个音频信号。 该方法还包括基于麦克风间减法确定差分信号。 差分信号包括多个谐波。 该方法还包括确定差分信号的谐波是否超过谐波阈值。 该方法还包括保存谐波以确定信封。 该方法进一步将包络应用于噪声抑制信号。
-
-
-
-
-
-
-
-
-