-
公开(公告)号:US11646009B1
公开(公告)日:2023-05-09
申请号:US16902476
申请日:2020-06-16
Applicant: Amazon Technologies, Inc.
Inventor: Amit Singh Chhetri , Navin Chatlani
IPC: G10L25/84 , G10K11/175 , G10L15/16
CPC classification number: G10K11/1752 , G10L15/16 , G10L25/84
Abstract: A device capable of autonomous motion may move in an environment and may receive audio data from a microphone. A model may be trained to process the audio data to determine mask data, which may be used to mask noise in the audio data. Training data for the model may be normalized before training, and different loss functions may be used for different types of training data.
-
公开(公告)号:US11521635B1
公开(公告)日:2022-12-06
申请号:US17108718
申请日:2020-12-01
Applicant: Amazon Technologies, Inc.
Inventor: Amit Singh Chhetri , Navin Chatlani
IPC: G10L15/20 , G10L25/84 , G10L21/0232 , G10L15/06 , G10L15/22 , G10L15/16 , G06N3/08 , G06N3/04 , G10L25/90 , G10L21/0216 , G10L21/0208
Abstract: A computing device may receive audio data from a microphone representing audio in an environment of the device, which may correspond to an utterance and noise. A model may be trained to process the audio data to cancel noise from the audio data. The model may include an encoder that includes one or more dense layers, one or more recurrent layers, and a decoder that includes one or more dense layers.
-
公开(公告)号:US10937418B1
公开(公告)日:2021-03-02
申请号:US16240294
申请日:2019-01-04
Applicant: Amazon Technologies, Inc.
Inventor: Navin Chatlani , Krishna Kamath Koteshwara , Trausti Thor Kristjansson , Inseok Heo , Robert Ayrapetian
IPC: G10L15/20 , G10L21/0232 , G10L15/22 , G10L21/0208
Abstract: A system configured to improve echo cancellation for nonlinear systems. The system generate reference audio data by isolating portions of microphone audio data that correspond to playback audio data. For example, the system may determine a correlation between the playback audio data and the microphone audio data in individual time-frequency bands in a frequency domain. In some examples, the system may substitute microphone audio data associated with output audio for the playback audio data. The system may generate the reference audio data based on portions of the microphone audio data that have a strong correlation with the playback audio data. The system may generate the reference audio data by selecting these portions of the microphone audio data or by performing beamforming. This results in precise time alignment between the reference audio data and the microphone audio data, improving performance of the echo cancellation.
-
公开(公告)号:US11854564B1
公开(公告)日:2023-12-26
申请号:US16902731
申请日:2020-06-16
Applicant: Amazon Technologies, Inc.
Inventor: Navin Chatlani , Amit Singh Chhetri
Abstract: A device capable of autonomous motion may move in an environment and may receive audio data from a microphone. A model may be trained to process the audio data to suppress noise from the audio data. The model may include an encoder that includes one or more convolutional layers, one or more recurrent layers, and a decoder that includes one or more convolutional layers.
-
公开(公告)号:US11425494B1
公开(公告)日:2022-08-23
申请号:US16439139
申请日:2019-06-12
Applicant: Amazon Technologies, Inc.
Inventor: Navin Chatlani , Amit Singh Chhetri
IPC: G10L21/02 , H04R1/40 , G06F3/16 , G05D1/00 , G05D1/02 , G10L15/22 , G10L15/20 , G10L21/0208 , G10L25/78 , G10L21/0216
Abstract: A device capable of motion includes a beamformer for determining audio data corresponding to one or more directions. The beamformer includes a target beamformer that boosts audio from a target direction and a null beamformer that suppresses audio from that direction. When the device outputs sound while moving, the target and null beamformers capture and compensate for Doppler effects in output audio that reflects from nearby surfaces back to the device.
-
公开(公告)号:US11159878B1
公开(公告)日:2021-10-26
申请号:US16541943
申请日:2019-08-15
Applicant: Amazon Technologies, Inc.
Inventor: Navin Chatlani , Amit Chhetri , Ananth Raghavendra , Srivatsan Kandadai
Abstract: A device capable of moving a component of the device motion is capable of determining audio data corresponding to a direction relative to the device (“beamforming”). When the component moves, the device determines a new position of the component and selects one of a set of beamforming filter coefficients corresponding to the position. Using the filter coefficients, the device determines the directional audio data.
-
-
-
-
-