Patent search ap:("QUALCOMM INCORPORATED") AND inv:"Erik Visser" Page 15

141.

发明授权
Mixed adaptive and fixed coefficient neural networks for speech enhancement 有权

公开(公告)号：US11705147B2

公开(公告)日：2023-07-18

申请号：US17243434

申请日：2021-04-28

Applicant: QUALCOMM Incorporated

Inventor： Erik Visser , Vahid Montazeri , Shuhua Zhang , Lae-Hoon Kim

IPC: G10L21/0208 , G10L25/30 , G06N3/08 , G06N3/044 , G06N3/045

CPC classification number: G10L21/0208 , G06N3/044 , G06N3/045 , G06N3/08 , G10L25/30 , G10L2021/02082

Abstract: Systems, methods and computer-readable media are provided for speech enhancement using a hybrid neural network. An example process can include receiving, by a first neural network portion of the hybrid neural network, audio data and reference data, the audio data including speech data, noise data, and echo data; filtering, by the first neural network portion, a portion of the audio data based on adapted coefficients of the first neural network portion, the portion of the audio data including the noise data and/or echo data; based on the filtering, generating, by the first neural network portion, filtered audio data including the speech data and an unfiltered portion of the noise data and/or echo data; and based on the filtered audio data and the reference data, extracting, by a second neural network portion of the hybrid neural network, the speech data from the filtered audio data.

142.

发明授权
Sound event detection learning 有权

公开(公告)号：US11664044B2

公开(公告)日：2023-05-30

申请号：US17102797

申请日：2020-11-24

Applicant: QUALCOMM Incorporated

Inventor： Fatemeh Saki , Yinyi Guo , Erik Visser , Eunjeong Koh

IPC: G10L25/78 , G10L25/30 , G10L25/24 , H04R5/033 , B60R11/02 , G06N3/084 , G06N3/045

CPC classification number: G10L25/78 , B60R11/0247 , G06N3/045 , G06N3/084 , G10L25/24 , G10L25/30 , H04R5/033 , H04R2499/13

Abstract: A device includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound classes. The processor is further configured to provide the audio data samples to a second neural network to generate a second output corresponding to a second set of sound classes. A second count of classes of the second set of sound classes is greater than a first count of classes of the first set of sound classes. The processor is also configured to provide the first output to a neural adapter to generate a third output corresponding to the second set of sound classes. The processor is further configured to provide the second output and the third output to a merger adapter to generate sound event identification data based on the audio data samples.

143.

发明授权
User speech profile management 有权

公开(公告)号：US11626104B2

公开(公告)日：2023-04-11

申请号：US17115158

申请日：2020-12-08

Applicant: QUALCOMM Incorporated

Inventor： Soo Jin Park , Sunkuk Moon , Lae-Hoon Kim , Erik Visser

IPC: G10L17/00 , G10L15/07 , G06F1/3231 , G10L15/04 , G10L15/16

Abstract: A device includes processors configured to determine, in a first power mode, whether an audio stream corresponds to speech of at least two talkers. The processors are configured to, based on determining that the audio stream corresponds to speech of at least two talkers, analyze, in a second power mode, audio feature data of the audio stream to generate a segmentation result. The processors are configured to perform a comparison of a plurality of user speech profiles to an audio feature data set of a plurality of audio feature data sets of a talker-homogenous audio segment to determine whether the audio feature data set matches any of the user speech profiles. The processors are configured to, based on determining that the audio feature data set does not match any of the plurality of user speech profiles, generate a user speech profile based on the plurality of audio feature data sets.

144.

发明授权
Seamless listen-through for a wearable device 有权

公开(公告)号：US11589153B2

公开(公告)日：2023-02-21

申请号：US17201998

申请日：2021-03-15

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon Kim , Dongmei Wang , Fatemeh Saki , Taher Shahbazi Mirzahasanloo , Erik Visser , Rogerio Guedes Alves

IPC: A61F11/06 , G10K11/16 , H03B29/00 , H04R1/10

Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device may receive an input audio signal (e.g., including both an external signal and a self-voice signal). The wearable device may detect the self-voice signal in the input audio signal based on a self-voice activity detection (SVAD) procedure, and may implement the described techniques based thereon. The wearable device may perform beamforming operations or other separation procedures to isolate the external signal and the self-voice signal from the input audio signal. The wearable device may apply a first filter to the external signal, and a second filter to the self-voice signal. The wearable device may then mix the filtered signals, and generate an output signal that sounds natural to the user.

145.

发明申请
AUDIO EVENT DATA PROCESSING 有权

公开(公告)号：US20230035531A1

公开(公告)日：2023-02-02

申请号：US17814665

申请日：2022-07-25

Applicant: QUALCOMM Incorporated

Inventor： Erik Visser , Fatemeh Saki , Yinyi Guo , Lae-Hoon Kim , Rogerio Guedes Alves , Hannes Pessentheiner

IPC: H04R1/10 , H04S3/00 , H04S7/00 , H04R5/027 , H04R3/00

Abstract: A second device includes a memory configured to store instructions and one or more processors configured to receive, from a first device, an indication of an audio class corresponding to an audio event.

146.

发明授权
Active self-voice naturalization using a bone conduction sensor 有权

公开(公告)号：US11533561B2

公开(公告)日：2022-12-20

申请号：US17667779

申请日：2022-02-09

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon Kim , Rogerio Guedes Alves , Jacob Jon Bean , Erik Visser

IPC: H04R3/04

Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device to receive an input audio signal from one or more outer microphones, an input audio signal from one or more inner microphones, and a bone conduction signal from a bone conduction sensor based on the input audio signals. The wearable device may filter the bone conduction signal based on a set of frequencies of the input audio signals, such as a low frequency portion of the input audio signals. For example, the wearable device may apply a filter to the bone conduction signal that accounts for an error in the input audio signals. The wearable device may add a gain to the filtered bone conduction signal and may equalize the filtered bone conduction signal based on the gain. The wearable device may output an audio signal to a speaker.

147.

发明申请
IN-VEHICLE VOICE COMMAND CONTROL 有权

公开(公告)号：US20220277744A1

公开(公告)日：2022-09-01

申请号：US17747631

申请日：2022-05-18

Applicant: QUALCOMM Incorporated

Inventor： Asif Iqbal Mohammad , Sreekanth Narayanaswamy , Rishabh Tyagi , Erik Visser

IPC: G10L15/22 , G10L25/84 , G06F3/16 , H04R1/40 , H04R3/00 , G10K11/178

Abstract: A vehicle includes an interface device, an in-vehicle control unit, a functional unit, and a processing circuitry. The interface device receives a spoken command to identify an in-cabin vehicle zone of two or more in-cabin vehicle zones of the vehicle, and receives background audio data concurrently with a portion of the spoken command. The in-cabin vehicle control unit separates the background audio data from the spoken command, and selects which in-cabin vehicle zone of the two or more in-cabin vehicle zones is identified by the spoken command. The functional unit controls a function within the vehicle. The processing circuitry stores, to a command buffer, data processed from the received spoken command, and controls, based on the data processed from the received spoken command, the functional unit using audio input received from the selected in-cabin vehicle zone.

148.

发明申请
ACTIVE SELF-VOICE NATURALIZATION USING A BONE CONDUCTION SENSOR 有权

公开(公告)号：US20220272451A1

公开(公告)日：2022-08-25

申请号：US17667779

申请日：2022-02-09

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon Kim , Rogerio Guedes Alves , Jacob Jon Bean , Erik Visser

IPC: H04R3/04

Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device to receive an input audio signal from one or more outer microphones, an input audio signal from one or more inner microphones, and a bone conduction signal from a bone conduction sensor based on the input audio signals. The wearable device may filter the bone conduction signal based on a set of frequencies of the input audio signals, such as a low frequency portion of the input audio signals. For example, the wearable device may apply a filter to the bone conduction signal that accounts for an error in the input audio signals. The wearable device may add a gain to the filtered bone conduction signal and may equalize the filtered bone conduction signal based on the gain. The wearable device may output an audio signal to a speaker.

149.

发明授权
Multi-modal user interface 有权

公开(公告)号：US11348581B2

公开(公告)日：2022-05-31

申请号：US16685946

申请日：2019-11-15

Applicant: QUALCOMM Incorporated

Inventor： Ravi Choudhary , Lae-Hoon Kim , Sunkuk Moon , Yinyi Guo , Fatemeh Saki , Erik Visser

IPC: G10L15/22 , G06F3/16 , G06F3/01 , G10L15/26 , G06F3/038 , G06F3/04883 , G06F3/0484 , G06F9/451 , G10L15/20

Abstract: A device for multi-modal user input includes a processor configured to process first data received from a first input device. The first data indicates a first input from a user based on a first input mode. The first input corresponds to a command. The processor is configured to send a feedback message to an output device based on processing the first data. The feedback message instructs the user to provide, based on a second input mode that is different from the first input mode, a second input that identifies a command associated with the first input. The processor is configured to receive second data from a second input device, the second data indicating the second input, and to update a mapping to associate the first input to the command identified by the second input.

150.

发明授权
Shared speech processing network for multiple speech applications 有权

公开(公告)号：US11276415B2

公开(公告)日：2022-03-15

申请号：US16844836

申请日：2020-04-09

Applicant: QUALCOMM Incorporated

Inventor： Lae-Hoon Kim , Sunkuk Moon , Erik Visser , Prajakt Kulkarni

IPC: G10L21/02 , H04R5/04 , H04R3/00 , G06N20/00 , H04L29/06 , G06K9/62 , H04L65/60 , H04L65/80

Abstract: A device to process speech includes a speech processing network that includes an input configured to receive audio data corresponding to audio captured by one or more microphones. The speech processing network also includes one or more network layers configured to process the audio data to generate an output representation of the audio data. The speech processing network includes an output configured to be coupled to multiple speech application modules to enable the output representation to be provided as a common input to each of the multiple speech application modules.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification