-
公开(公告)号:US20200278832A1
公开(公告)日:2020-09-03
申请号:US16800735
申请日:2020-02-25
Applicant: QUALCOMM Incorporated
Inventor: Taher Shahbazi Mirzahasanloo , Rogerio Guedes Alves , Lae-Hoon Kim , Erik Visser , Dongmei Wang , Fatemeh Saki
Abstract: In general, techniques are described that enable voice activation for computing devices. A computing device configured to support an audible interface that comprises a memory and one or more processors may be configured to perform the techniques. The memory may store a first audio signal representative of an environment external to a user associated with the computing device and a second audio signal sensed by a microphone coupled to a housing of the computing device. The one or more processors may verify, based on the first audio signal and the second audio signal, that the user activated the audible interface of the computing device, and obtain, based on the verification, additional audio signals representative of one or more audible commands.
-
公开(公告)号:US20250077177A1
公开(公告)日:2025-03-06
申请号:US18953658
申请日:2024-11-20
Applicant: QUALCOMM Incorporated
Inventor: Taher Shahbazi Mirzahasanloo , Rogerio Guedes Alves , Lae-Hoon Kim , Erik Visser , Dongmei Wang , Fatemeh Saki
Abstract: In general, techniques are described that enable voice activation for computing devices. A computing device configured to support an audible interface that comprises a memory and one or more processors may be configured to perform the techniques. The memory may store a first audio signal representative of an environment external to a user associated with the computing device and a second audio signal sensed by a microphone coupled to a housing of the computing device. The one or more processors may verify, based on the first audio signal and the second audio signal, that the user activated the audible interface of the computing device, and obtain, based on the verification, additional audio signals representative of one or more audible commands.
-
公开(公告)号:US11589153B2
公开(公告)日:2023-02-21
申请号:US17201998
申请日:2021-03-15
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Dongmei Wang , Fatemeh Saki , Taher Shahbazi Mirzahasanloo , Erik Visser , Rogerio Guedes Alves
Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device may receive an input audio signal (e.g., including both an external signal and a self-voice signal). The wearable device may detect the self-voice signal in the input audio signal based on a self-voice activity detection (SVAD) procedure, and may implement the described techniques based thereon. The wearable device may perform beamforming operations or other separation procedures to isolate the external signal and the self-voice signal from the input audio signal. The wearable device may apply a first filter to the external signal, and a second filter to the self-voice signal. The wearable device may then mix the filtered signals, and generate an output signal that sounds natural to the user.
-
公开(公告)号:US11212637B2
公开(公告)日:2021-12-28
申请号:US15951907
申请日:2018-04-12
Applicant: QUALCOMM Incorporated
Inventor: Yinyi Guo , Lae-Hoon Kim , Dongmei Wang , Erik Visser
Abstract: An apparatus includes a processor configured to receive one or more media signals associated with a scene. The processor is also configured to identify a spatial location in the scene for each source of the one or more media signals. The processor is further configured to identify audio content for each media signal of the one or more media signals. The processor is also configured to determine one or more candidate spatial locations in the scene based on the identified spatial locations. The processor is further configured to generate audio to playback as virtual sounds that originate from the one or more candidate spatial locations.
-
公开(公告)号:US20210151064A1
公开(公告)日:2021-05-20
申请号:US16685987
申请日:2019-11-15
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon KIM , Dongmei Wang , Cheng-Yu Hung , Erik Visser
Abstract: A device includes one or more processors configured to perform signal processing including a linear transformation and a non-linear transformation of an input signal to generate a reference target signal. The reference target signal has a linear component associated with the linear transformation and a non-linear component associated with the non-linear transformation. The one or more processors are also configured to perform linear filtering of the input signal by controlling adaptation of the linear filtering to generate an output signal that substantially matches the linear component of the reference target signal.
-
公开(公告)号:US10957334B2
公开(公告)日:2021-03-23
申请号:US16224022
申请日:2018-12-18
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Sharon Kaziunas , Anne Katrin Konertz , Erik Visser , Cheng-Yu Hung , Shuhua Zhang , Fatemeh Saki , Dongmei Wang
IPC: H04R3/00 , G10L21/0216 , H04R3/04 , H04R5/033 , H04R5/04
Abstract: Methods, systems, computer-readable media, and apparatuses for signal enhancement are presented. One example of such an apparatus includes a receiver configured to produce a remote speech signal from information carried by a wireless signal; a signal canceller configured to perform a signal cancellation operation on a local speech signal to generate a room response; and a filter configured to filter the remote speech signal according to the room response to produce a filtered speech signal. In this example, the signal cancellation operation is based on the remote speech signal as a reference signal.
-
公开(公告)号:US12069425B2
公开(公告)日:2024-08-20
申请号:US18349920
申请日:2023-07-10
Applicant: Qualcomm Incorporated
Inventor: Lae-Hoon Kim , Dongmei Wang , Fatemeh Saki , Taher Shahbazi Mirzahasanloo , Erik Visser , Rogerio Guedes Alves
CPC classification number: H04R1/1083 , H04R1/1075 , H04R2420/07 , H04R2460/01 , H04R2460/13
Abstract: A wearable device may include a processor configured to detect a self-voice signal, based on one or more transducers. The processor may be configured to separate the self-voice signal from a background signal in an external audio signal based on using a multi-microphone speech generative network. The processor may also be configured to apply a first filter to an external audio signal, detected by at least one external microphone on the wearable device, during a listen through operation based on an activation of the audio zoom feature to generate a first listen-through signal that includes the external audio signal. The processor may be configured to produce an output audio signal that is based on at least the first listen-through signal that includes the external signal, and is based on the detected self-voice signal.
-
公开(公告)号:US11804233B2
公开(公告)日:2023-10-31
申请号:US16685987
申请日:2019-11-15
Applicant: QUALCOMM incorporated
Inventor: Lae-Hoon Kim , Dongmei Wang , Cheng-Yu Hung , Erik Visser
IPC: G10L21/02 , G06T5/10 , G10L15/16 , G10L15/22 , G10L21/0232 , G10L21/0364 , G10L25/78 , G06V40/16 , G06V10/764 , G06V10/82 , G06V10/30 , G10L21/0208 , H04B1/06
CPC classification number: G10L21/02 , G06T5/10 , G06V10/30 , G06V10/764 , G06V10/82 , G06V40/16 , G06V40/161 , G10L15/16 , G10L15/22 , G10L21/0232 , G10L21/0364 , G10L25/78 , G10L2021/02082 , H04B1/06
Abstract: A device includes one or more processors configured to perform signal processing including a linear transformation and a non-linear transformation of an input signal to generate a reference target signal. The reference target signal has a linear component associated with the linear transformation and a non-linear component associated with the non-linear transformation. The one or more processors are also configured to perform linear filtering of the input signal by controlling adaptation of the linear filtering to generate an output signal that substantially matches the linear component of the reference target signal.
-
公开(公告)号:US11743631B2
公开(公告)日:2023-08-29
申请号:US18063493
申请日:2022-12-08
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Dongmei Wang , Fatemeh Saki , Taher Shahbazi Mirzahasanloo , Erik Visser , Rogerio Guedes Alves
CPC classification number: H04R1/1083 , H04R1/1075 , H04R2420/07 , H04R2460/01 , H04R2460/13
Abstract: A wearable device may include a processor configured to perform active noise cancelation (ANC) applied to an input audio signal received by at least one microphone, and detect a self-voice signal, based on one or more transducers. The processor may also be configured to apply a first filter to an external audio signal, detected by at least one external microphone on the wearable device, during a listen through operation based on an activation of the audio zoom feature to generate a first listen-through signal that includes the external audio signal. The processor may also be configured to after the activation of the audio zoom feature terminate a second filter that provides low frequency compensation. The processor may be configured to produce an output audio signal that is based on at least the first listen-through signal that includes the external signal, and is based on the detected self-voice signal.
-
公开(公告)号:US10681452B1
公开(公告)日:2020-06-09
申请号:US16285923
申请日:2019-02-26
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Dongmei Wang , Fatemeh Saki , Taher Shahbazi Mirzahasanloo , Erik Visser , Rogerio Guedes Alves
Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device may receive an input audio signal (e.g., including both an external signal and a self-voice signal). The wearable device may detect the self-voice signal in the input audio signal based on a self-voice activity detection (SVAD) procedure, and may implement the described techniques based thereon. The wearable device may perform beamforming operations or other separation procedures to isolate the external signal and the self-voice signal from the input audio signal. The wearable device may apply a first filter to the external signal, and a second filter to the self-voice signal. The wearable device may then mix the filtered signals, and generate an output signal that sounds natural to the user.
-
-
-
-
-
-
-
-
-