-
公开(公告)号:US12002455B2
公开(公告)日:2024-06-04
申请号:US17383284
申请日:2021-07-22
Applicant: QUALCOMM Incorporated
Inventor: Arvind Krishna Sridhar , Ravi Choudhary , Lae-Hoon Kim , Erik Visser
IPC: G10L15/16 , G06V30/262 , G10L15/18 , G10L15/22 , G06N3/0455 , G10L15/183
CPC classification number: G10L15/1815 , G06V30/274 , G10L15/1822 , G10L15/22 , G06N3/0455 , G10L15/16 , G10L15/183
Abstract: A device includes a memory configured to store instructions. The device also includes one or more processors configured to execute the instructions to provide context and one or more items of interest corresponding to the context to a dependency network encoder to generate a semantic-based representation of the context. The one or more processors are also configured to provide the context to a data dependent encoder to generate a context-based representation. The one or more processors are further configured to combine the semantic-based representation and the context-based representation to generate a semantically-augmented representation of the context.
-
公开(公告)号:US11805360B2
公开(公告)日:2023-10-31
申请号:US17382166
申请日:2021-07-21
Applicant: QUALCOMM Incorporated
Inventor: Vahid Montazeri , Van Nguyen , Hannes Pessentheiner , Lae-Hoon Kim , Erik Visser , Rogerio Guedes Alves
CPC classification number: H04R3/04 , G06N3/08 , H04R3/005 , H04R5/033 , H04R5/04 , H04S1/007 , H04S7/307 , H04R2420/07
Abstract: A device includes a memory configured to store instructions and one or more processors configured to execute the instructions. The one or more processors are configured to execute the instructions to receive audio data including a first audio frame corresponding to a first output of a first microphone and a second audio frame corresponding to a second output of a second microphone. The one or more processors are also configured to execute the instructions to provide the audio data to a first noise-suppression network and a second noise-suppression network. The first noise-suppression network is configured to generate a first noise-suppressed audio frame and the second noise-suppression network is configured to generate a second noise-suppressed audio frame. The one or more processors are further configured to execute the instructions to provide the noise-suppressed audio frames to an attention-pooling network. The attention-pooling network is configured to generate an output noise-suppressed audio frame.
-
公开(公告)号:US10951975B2
公开(公告)日:2021-03-16
申请号:US16896010
申请日:2020-06-08
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Dongmei Wang , Fatemeh Saki , Taher Shahbazi Mirzahasanloo , Erik Visser , Rogerio Guedes Alves
Abstract: Methods, systems, and devices for signal processing are described. Generally, in one example as provided for by the described techniques, a wearable device includes a processor configured to retrieve a plurality of external microphone signals that includes audio sound from outside of the device from a memory; to separate, based on at least information from an internal microphone signal, a self-voice component from a background component; to perform a first listen-through operation on the separated self-voice component to produce a first listen-through signal; and to produce an output audio signal that is based on at least the first listen-through signal, wherein the output audio signal includes an audio zoom signal that includes audio sound of the plurality of external microphone signals.
-
公开(公告)号:US10909988B2
公开(公告)日:2021-02-02
申请号:US16140227
申请日:2018-09-24
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Erik Visser , Phuong Lam Ton , Jeremy Patrick Toman , Jeffrey Clinton Shaw
IPC: G06F3/0484 , G10L17/00 , H04R3/00 , G01S3/80 , G01S5/18 , G01S15/87 , H04R1/08 , G01B21/00 , G06F3/16 , G06F16/432 , G06F3/0481 , G06F3/0488 , H04S7/00 , G01S3/808 , G06F1/16 , G10L21/0216 , G01S15/86
Abstract: An electronic device includes a display, wherein the display is configured to present a user interface, wherein the user interface comprises a coordinate system. The coordinate system corresponds to physical coordinates. The display is configured to present a sector selection feature that allows selection of at least one sector of the coordinate system. The at least one sector corresponds to captured audio from multiple microphones. The sector selection may also include an audio signal indicator. The electronic device includes operation circuitry coupled to the display. The operation circuitry is configured to perform an audio operation on the captured audio corresponding to the audio signal indicator based on the sector selection.
-
公开(公告)号:US10854214B2
公开(公告)日:2020-12-01
申请号:US16370812
申请日:2019-03-29
Applicant: QUALCOMM Incorporated
Inventor: Rogerio Guedes Alves , Taher Shahbazi Mirzahasanloo , Erik Visser , Lae-Hoon Kim , Fatemeh Saki , Dongmei Wang
IPC: H04R1/00 , H04R3/04 , G10L21/0208 , G10L21/02
Abstract: A device includes a memory and one or more processors coupled to the memory. The one or more processors are configured to perform an active noise cancellation (ANC) operation on noisy input speech as captured by a first microphone, the noisy input speech as captured by a second microphone, or both, to suppress a noise level associated with the noisy input speech. The one or more processors are configured to match a second frequency spectrum of a second signal with a first frequency spectrum of a first signal. The first signal is representative of the noisy input speech as captured by the first microphone, and the second signal is representative of the noisy input speech as captured by the second microphone. The one or more processors are also configured to generate an output speech signal that is representative of input speech based on the second signal.
-
公开(公告)号:US10798513B2
公开(公告)日:2020-10-06
申请号:US16206680
申请日:2018-11-30
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Dongmei Wang , Erik Visser
Abstract: Methods, systems, and devices for head-related transfer function generation are described. A device may receive a digital representation of a first audio signal associated with a location relative to a subject, and select from a database a first reference head-related transfer function measurement pair corresponding to the location of the first audio signal. The device may then obtain a second head-related transfer function measurement pair by performing a style transfer operation on the selected reference head-related transfer function measurement pair based on a set of head-related transfer function measurement pairs specific to the subject. As a result, the device may output a second audio signal based on the digital representation of the first audio signal and the second head-related transfer function measurement pair.
-
公开(公告)号:US10547947B2
公开(公告)日:2020-01-28
申请号:US15158505
申请日:2016-05-18
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Hyun Jin Park , Erik Visser , Raghuveer Peri
IPC: H04R3/12 , H04R5/02 , H04S7/00 , H03G5/16 , H03G3/30 , H03G3/02 , H04R5/04 , H04R5/033 , G06F3/16 , H04R29/00
Abstract: A headset device includes a first earpiece configured to receive a reference sound and to generate a first reference audio signal based on the reference sound. The headset device further includes a second earpiece configured to receive the reference sound and to generate a second reference audio signal based on the reference sound. The headset device further includes a controller coupled to the first earpiece and to the second earpiece. The controller is configured to generate a first signal and a second signal based on a phase relationship between the first reference audio signal and the second reference audio signal. The controller is further configured to output the first signal to the first earpiece and output the second signal to the second earpiece.
-
公开(公告)号:US10540979B2
公开(公告)日:2020-01-21
申请号:US14688974
申请日:2015-04-16
Applicant: QUALCOMM Incorporated
Inventor: Erik Visser , Lae-Hoon Kim , Minho Jin , Yinyi Guo
Abstract: A device includes a memory, a receiver, a processor, and a display. The memory is configured to store a speaker model. The receiver is configured to receive an input audio signal. The processor is configured to determine a first confidence level associated with a first portion of the input audio signal based on the speaker model. The processor is also configured to determine a second confidence level associated with a second portion of the input audio signal based on the speaker model. The display is configured to present a graphical user interface associated with the first confidence level or associated with the second confidence level.
-
公开(公告)号:US20190320281A1
公开(公告)日:2019-10-17
申请号:US15951907
申请日:2018-04-12
Applicant: QUALCOMM Incorporated
Inventor: Yinyi Guo , Lae-Hoon Kim , Dongmei Wang , Erik Visser
Abstract: An apparatus includes a processor configured to receive one or more media signals associated with a scene. The processor is also configured to identify a spatial location in the scene for each source of the one or more media signals. The processor is further configured to identify audio content for each media signal of the one or more media signals. The processor is also configured to determine one or more candidate spatial locations in the scene based on the identified spatial locations. The processor is further configured to generate audio to playback as virtual sounds that originate from the one or more candidate spatial locations.
-
公开(公告)号:US20190311728A1
公开(公告)日:2019-10-10
申请号:US15948681
申请日:2018-04-09
Applicant: QUALCOMM Incorporated
Inventor: Lae-Hoon Kim , Shuhua Zhang , Erik Visser
Abstract: Methods, systems, and devices for auditory enhancement are described. A device may receive a respective auditory signal at each of a set of microphones, where each auditory signal includes a respective representation of a target auditory component and one or more noise artifacts. The device may identify a directionality associated with a source of the target auditory component (e.g., based on an arrangement of the multiple microphones). The device may determine a distribution function for the target auditory component based at least in part on the directionality associated with the source and on the received plurality of auditory signals. The device may generate an estimate of the target auditory component based at least in part on the distribution function and output the estimate of the target auditory component.
-
-
-
-
-
-
-
-
-