-
公开(公告)号:US20200278832A1
公开(公告)日:2020-09-03
申请号:US16800735
申请日:2020-02-25
Applicant: QUALCOMM Incorporated
Inventor: Taher Shahbazi Mirzahasanloo , Rogerio Guedes Alves , Lae-Hoon Kim , Erik Visser , Dongmei Wang , Fatemeh Saki
Abstract: In general, techniques are described that enable voice activation for computing devices. A computing device configured to support an audible interface that comprises a memory and one or more processors may be configured to perform the techniques. The memory may store a first audio signal representative of an environment external to a user associated with the computing device and a second audio signal sensed by a microphone coupled to a housing of the computing device. The one or more processors may verify, based on the first audio signal and the second audio signal, that the user activated the audible interface of the computing device, and obtain, based on the verification, additional audio signals representative of one or more audible commands.
-
公开(公告)号:US20190355351A1
公开(公告)日:2019-11-21
申请号:US15982851
申请日:2018-05-17
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Yinyi Guo , Ravi Choudhary , Sunkuk Moon , Erik Visser , Fatemeh Saki
IPC: G10L15/22 , G06F3/16 , G10L15/18 , G10L25/63 , G06F3/0484
Abstract: A device includes a memory configured to store a user experience evaluation unit. A processor is configured to receive a first user input corresponding to a user command to initiate a particular task, the first user input received via a first sensor. The processor is configured to, after receiving the first user input, receive one or more subsequent user inputs, the one or subsequent user inputs including a second user input received via a second sensor. The processor is configured to initiate a remedial action in response to determining, based on the user experience evaluation unit, that the one or more subsequent user inputs correspond to a negative user experience.
-
133.
公开(公告)号:US20190098070A1
公开(公告)日:2019-03-28
申请号:US15717027
申请日:2017-09-27
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Erik Visser , Yinyi Guo
Abstract: Various embodiments provide systems and methods which disclose a command device which can be used to establish a wireless connection, through one or more wireless channels, between the command device and a remote device. An intention code may be generated, prior to, or after, the establishment of the wireless connection, and the remote device may be selected based on the intention code. The command device may initiate a wireless transfer, through one or more wireless channels of the established wireless connection, of an intention code, and receive acknowledgement that the intention code was successfully transferred to the remote device. The command device may then control the remote device, based on the intention code sent to the remote device, through the one or more wireless channels of the established wireless connection between the command device and the remote device.
-
公开(公告)号:US10051364B2
公开(公告)日:2018-08-14
申请号:US14789766
申请日:2015-07-01
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Erik Visser , Raghuveer Peri , Phuong Lam Ton , Jeremy Patrick Toman , Troy Schultz , Jimeng Zheng
IPC: H04R3/00 , G06F3/0487 , G06F3/0484 , G06F3/16 , H04S7/00 , H04R29/00 , G10L21/0208
Abstract: A method of processing audio may include receiving, by a computing device, a plurality of real-time audio signals outputted by a plurality of microphones communicatively coupled to the computing device. The computing device may output to a display a graphical user interface (GUI) that presents audio information associated with the received audio signals. The one or more received audio signals may be processed based on a user input associated with the audio information presented via the GUI to generate one or more processed audio signals. The one or more processed audio signals may be output to, for example, one or more output devices such as speakers, headsets, and the like.
-
公开(公告)号:US10013975B2
公开(公告)日:2018-07-03
申请号:US14629109
申请日:2015-02-23
Applicant: QUALCOMM Incorporated
Inventor: Yinyi Guo , Juhan Nam , Erik Visser , Shuhua Zhang , Lae-Hoon Kim
IPC: G10L21/02 , G10L15/20 , G10L15/06 , G10L21/0208 , G10L21/028
CPC classification number: G10L15/20 , G10L15/06 , G10L21/0208 , G10L21/028
Abstract: A method for speech modeling by an electronic device is described. The method includes obtaining a real-time noise reference based on a noisy speech signal. The method also includes obtaining a real-time noise dictionary based on the real-time noise reference. The method further includes obtaining a first speech dictionary and a second speech dictionary. The method additionally includes reducing residual noise based on the real-time noise dictionary and the first speech dictionary to produce a residual noise-suppressed speech signal at a first modeling stage. The method also includes generating a reconstructed speech signal based on the residual noise-suppressed speech signal and the second speech dictionary at a second modeling stage.
-
公开(公告)号:US09947334B2
公开(公告)日:2018-04-17
申请号:US14808870
申请日:2015-07-24
Applicant: QUALCOMM Incorporated
Inventor: Samir Kumar Gupta , Asif Iqbal Mohammad , Erik Visser , Lae-Hoon Kim , Shaun William Van Dyken
IPC: G10L21/0208 , G10K11/175 , H04M9/08 , H04R27/00 , G10L21/0216
CPC classification number: G10L21/0208 , G10K11/175 , G10L2021/02082 , G10L2021/02166 , H04M9/082 , H04R27/00 , H04R2499/13
Abstract: A multichannel acoustic system (MAS) comprises an arrangement of microphones and loudspeakers and a multichannel acoustic processor (MAP) to together enhance conversational speech between two or more persons in a shared acoustic space such as an automobile. The enhancements are achieved by receiving sound signals substantially originating from relatively near sound sources; filtering the sound signals to cancel at least one echo signal detected for at least one microphone from among the plurality of microphones; filtering the sound signals received by the plurality of microphones to cancel at least one feedback signal detected for at least one microphone from among the plurality of microphones; and reproducing the filtered sound signals for each microphone from among the plurality of microphones on a subset of loudspeakers corresponding that are relatively far from the source microphone.
-
公开(公告)号:US09936290B2
公开(公告)日:2018-04-03
申请号:US14156292
申请日:2014-01-15
Applicant: QUALCOMM Incorporated
Inventor: Asif Iqbal Mohammad , Lae-Hoon Kim , Ian Ernan Liu , Erik Visser
Abstract: A method for multi-channel echo cancellation and noise suppression is described. One of multiple echo estimates is selected for non-linear echo cancellation. Echo notch masking is performed on a noise-suppressed signal based on an echo direction of arrival (DOA) to produce an echo-suppressed signal. Non-linear echo cancellation is performed on the echo-suppressed signal based, at least in part, on the selected echo estimate.
-
公开(公告)号:US20180033428A1
公开(公告)日:2018-02-01
申请号:US15387411
申请日:2016-12-21
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Erik Visser , Asif Mohammad , Ian Ernan Liu , Ye Jiang
IPC: G10L15/20 , G10L21/028 , H04R1/40 , G10L21/0232 , G10L15/08 , G10L15/22
CPC classification number: G10L15/20 , G10L15/08 , G10L15/22 , G10L21/0208 , G10L21/0232 , G10L21/028 , G10L2015/088 , G10L2015/223 , G10L2021/02166 , H04R1/406 , H04R3/005
Abstract: An apparatus includes multiple microphones to generate audio signals based on sound of a far-field acoustic environment. The apparatus also includes a signal processing system to process the audio signals to generate at least one processed audio signal. The signal processing system is configured to update one or more processing parameters while operating in a first operational mode and is configured to use a static version of the one or more processing parameters while operating in the second operational mode. The apparatus further includes a keyword detection system to perform keyword detection based on the at least one processed audio signal to determine whether the sound includes an utterance corresponding to a keyword and, based on a result of the keyword detection, to send a control signal to the signal processing system to change an operational mode of the signal processing system.
-
公开(公告)号:US20170339491A1
公开(公告)日:2017-11-23
申请号:US15158505
申请日:2016-05-18
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Hyun Jin Park , Erik Visser , Raghuveer Peri
Abstract: A headset device includes a first earpiece configured to receive a reference sound and to generate a first reference audio signal based on the reference sound. The headset device further includes a second earpiece configured to receive the reference sound and to generate a second reference audio signal based on the reference sound. The headset device further includes a controller coupled to the first earpiece and to the second earpiece. The controller is configured to generate a first signal and a second signal based on a phase relationship between the first reference audio signal and the second reference audio signal. The controller is further configured to output the first signal to the first earpiece and output the second signal to the second earpiece.
-
公开(公告)号:US09746916B2
公开(公告)日:2017-08-29
申请号:US13674789
申请日:2012-11-12
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Jongwon Shin , Erik Visser
IPC: G06F3/01 , H04M3/56 , G10L25/48 , G04G21/00 , H04N7/15 , G01S3/808 , G10L17/00 , G10L21/0216 , H04R29/00 , H04S7/00 , G06F1/16 , H04R1/40 , H04R3/00
CPC classification number: G06F3/013 , G01S3/8083 , G04G21/00 , G06F1/1613 , G06F3/011 , G10L17/00 , G10L25/48 , G10L2021/02166 , H04M3/568 , H04N7/15 , H04R1/406 , H04R3/005 , H04R29/005 , H04S7/304
Abstract: Disclosed is an application interface that takes into account the user's gaze direction relative to who is speaking in an interactive multi-participant environment where audio-based contextual information and/or visual-based semantic information is being presented. Among these various implementations, two different types of microphone array devices (MADs) may be used. The first type of MAD is a steerable microphone array (a.k.a. a steerable array) which is worn by a user in a known orientation with regard to the user's eyes, and wherein multiple users may each wear a steerable array. The second type of MAD is a fixed-location microphone array (a.k.a. a fixed array) which is placed in the same acoustic space as the users (one or more of which are using steerable arrays).
-
-
-
-
-
-
-
-
-