-
公开(公告)号:US11783809B2
公开(公告)日:2023-10-10
申请号:US17308593
申请日:2021-05-05
Applicant: QUALCOMM Incorporated
Inventor: Taher Shahbazi Mirzahasanloo , Rogerio Guedes Alves , Erik Visser , Lae-Hoon Kim
Abstract: A device includes a memory configured to store instructions and one or more processors configured execute the instructions. The one or more processors are configured execute the instructions to receive audio data including first audio data corresponding to a first output of a first microphone and second audio data corresponding to a second output of a second microphone. The one or more processors are also configured to execute the instructions to provide the audio data to a dynamic classifier. The dynamic classifier is configured to generate a classification output corresponding to the audio data. The one or more processors are further configured to execute the instructions to determine, at least partially based on the classification output, whether the audio data corresponds to user voice activity.
-
公开(公告)号:US11700484B2
公开(公告)日:2023-07-11
申请号:US17650595
申请日:2022-02-10
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Sunkuk Moon , Erik Visser , Prajakt Kulkarni
IPC: H04R3/00 , G10L21/02 , H04R5/04 , G06N20/00 , H04L65/60 , H04L65/80 , G06F18/21 , G06V10/82 , G06V20/20
CPC classification number: H04R3/005 , G06F18/217 , G06N20/00 , G06V10/82 , G06V20/20 , G10L21/02 , H04L65/60 , H04L65/80 , H04R5/04 , H04R2420/07 , H04R2499/13
Abstract: A device to process speech includes a speech processing network that includes an input configured to receive audio data corresponding to audio captured by one or more microphones. The speech processing network also includes one or more network layers configured to process the audio data to generate a network output. The speech processing network includes an output configured to be coupled to multiple speech application modules to enable the network output to be provided as a common input to each of the multiple speech application modules. A first speech application module corresponds to a speaker verifier, and a second speech application module corresponds to a speech recognition network.
-
公开(公告)号:US11290518B2
公开(公告)日:2022-03-29
申请号:US15717027
申请日:2017-09-27
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Erik Visser , Yinyi Guo
Abstract: Various embodiments provide systems and methods which disclose a command device which can be used to establish a wireless connection, through one or more wireless channels, between the command device and a remote device. An intention code may be generated, prior to, or after, the establishment of the wireless connection, and the remote device may be selected based on the intention code. The command device may initiate a wireless transfer, through one or more wireless channels of the established wireless connection, of an intention code, and receive acknowledgement that the intention code was successfully transferred to the remote device. The command device may then control the remote device, based on the intention code sent to the remote device, through the one or more wireless channels of the established wireless connection between the command device and the remote device.
-
公开(公告)号:US10964335B2
公开(公告)日:2021-03-30
申请号:US15948681
申请日:2018-04-09
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Shuhua Zhang , Erik Visser
IPC: H04R3/00 , G10L21/0364 , H04R1/40 , G10L25/84
Abstract: Methods, systems, and devices for auditory enhancement are described. A device may receive a respective auditory signal at each of a set of microphones, where each auditory signal includes a respective representation of a target auditory component and one or more noise artifacts. The device may identify a directionality associated with a source of the target auditory component (e.g., based on an arrangement of the multiple microphones). The device may determine a distribution function for the target auditory component based at least in part on the directionality associated with the source and on the received plurality of auditory signals. The device may generate an estimate of the target auditory component based at least in part on the distribution function and output the estimate of the target auditory component.
-
公开(公告)号:US20200278832A1
公开(公告)日:2020-09-03
申请号:US16800735
申请日:2020-02-25
Applicant: QUALCOMM Incorporated
Inventor: Taher Shahbazi Mirzahasanloo , Rogerio Guedes Alves , Lae-Hoon Kim , Erik Visser , Dongmei Wang , Fatemeh Saki
Abstract: In general, techniques are described that enable voice activation for computing devices. A computing device configured to support an audible interface that comprises a memory and one or more processors may be configured to perform the techniques. The memory may store a first audio signal representative of an environment external to a user associated with the computing device and a second audio signal sensed by a microphone coupled to a housing of the computing device. The one or more processors may verify, based on the first audio signal and the second audio signal, that the user activated the audible interface of the computing device, and obtain, based on the verification, additional audio signals representative of one or more audible commands.
-
公开(公告)号:US10720165B2
公开(公告)日:2020-07-21
申请号:US15413110
申请日:2017-01-23
Applicant: QUALCOMM Incorporated
Inventor: Yinyi Guo , Erik Visser
IPC: G10L17/20 , G10L17/24 , G10L17/04 , G10L25/21 , G10L17/06 , G10L25/24 , G10L21/0272 , G10L21/0208
Abstract: A method of authenticating a user based on voice recognition of a keyword includes generating, at a processor, clean speech statistics. The clean speech statistics are generated from an audio recording of the keyword spoken by the user during an enrollment phase. The method further includes separating speech data and noise data from noisy input speech using the clean speech statistics during an authentication phase. The method also includes authenticating the user by comparing the speech data to the clean speech statistics or by comparing the noisy input speech to noisy speech statistics. The noisy speech statistics are based at least in part on the noise data.
-
公开(公告)号:US20190355351A1
公开(公告)日:2019-11-21
申请号:US15982851
申请日:2018-05-17
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Yinyi Guo , Ravi Choudhary , Sunkuk Moon , Erik Visser , Fatemeh Saki
IPC: G10L15/22 , G06F3/16 , G10L15/18 , G10L25/63 , G06F3/0484
Abstract: A device includes a memory configured to store a user experience evaluation unit. A processor is configured to receive a first user input corresponding to a user command to initiate a particular task, the first user input received via a first sensor. The processor is configured to, after receiving the first user input, receive one or more subsequent user inputs, the one or subsequent user inputs including a second user input received via a second sensor. The processor is configured to initiate a remedial action in response to determining, based on the user experience evaluation unit, that the one or more subsequent user inputs correspond to a negative user experience.
-
108.
公开(公告)号:US20190098070A1
公开(公告)日:2019-03-28
申请号:US15717027
申请日:2017-09-27
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Erik Visser , Yinyi Guo
Abstract: Various embodiments provide systems and methods which disclose a command device which can be used to establish a wireless connection, through one or more wireless channels, between the command device and a remote device. An intention code may be generated, prior to, or after, the establishment of the wireless connection, and the remote device may be selected based on the intention code. The command device may initiate a wireless transfer, through one or more wireless channels of the established wireless connection, of an intention code, and receive acknowledgement that the intention code was successfully transferred to the remote device. The command device may then control the remote device, based on the intention code sent to the remote device, through the one or more wireless channels of the established wireless connection between the command device and the remote device.
-
公开(公告)号:US10051364B2
公开(公告)日:2018-08-14
申请号:US14789766
申请日:2015-07-01
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Erik Visser , Raghuveer Peri , Phuong Lam Ton , Jeremy Patrick Toman , Troy Schultz , Jimeng Zheng
IPC: H04R3/00 , G06F3/0487 , G06F3/0484 , G06F3/16 , H04S7/00 , H04R29/00 , G10L21/0208
Abstract: A method of processing audio may include receiving, by a computing device, a plurality of real-time audio signals outputted by a plurality of microphones communicatively coupled to the computing device. The computing device may output to a display a graphical user interface (GUI) that presents audio information associated with the received audio signals. The one or more received audio signals may be processed based on a user input associated with the audio information presented via the GUI to generate one or more processed audio signals. The one or more processed audio signals may be output to, for example, one or more output devices such as speakers, headsets, and the like.
-
公开(公告)号:US20180211671A1
公开(公告)日:2018-07-26
申请号:US15413110
申请日:2017-01-23
Applicant: QUALCOMM Incorporated
Inventor: Yinyi Guo , Erik Visser
Abstract: A method of authenticating a user based on voice recognition of a keyword includes generating, at a processor, clean speech statistics. The clean speech statistics are generated from an audio recording of the keyword spoken by the user during an enrollment phase. The method further includes separating speech data and noise data from noisy input speech using the clean speech statistics during an authentication phase. The method also includes authenticating the user by comparing the speech data to the clean speech statistics or by comparing the noisy input speech to noisy speech statistics. The noisy speech statistics are based at least in part on the noise data.
-
-
-
-
-
-
-
-
-