Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Amit Singh Chhetri"

11.

发明授权
Mitigating effects of electronic audio sources in expression detection 有权

公开(公告)号：US09734845B1

公开(公告)日：2017-08-15

申请号：US14752400

申请日：2015-06-26

Applicant: Amazon Technologies, Inc.

Inventor： Yue Liu , Praveen Jayakumar , Amit Singh Chhetri , Ramya Gopalan

IPC: G10L25/78 , G10L15/00 , H04R3/00 , H04R1/40

CPC classification number: G10L25/78 , G10L15/00 , G10L15/22 , G10L2015/088 , H04R1/406 , H04R3/005 , H04R2420/07 , H04R2420/09

Abstract: In a speech-based system, a wake word or other trigger expression is used to preface user speech that is intended as a command. The system receives multiple directional audio signals, each of which emphasizes sound from a different direction. The signals are monitored and analyzed to detect the directions of interfering audio sources such as televisions or other types of electronic audio players. One of the directional signals having the strongest presence of speech is selected to be monitored for the trigger expression. If the directional signal corresponds to the direction of an interfering audio source, a more strict standard is used to detect the trigger expression. In addition, the directional audio signal having the second strongest presence of speech may also be monitored to detect the trigger expression.

12.

发明授权
Audio output masking for improved automatic speech recognition 有权

公开(公告)号：US09704478B1

公开(公告)日：2017-07-11

申请号：US14094591

申请日：2013-12-02

Applicant: Amazon Technologies, Inc.

Inventor： Shiv Naga Prasad Vitaladevuni , Amit Singh Chhetri , Phillip Ryan Hilmes , Rohit Prasad

IPC: G10L15/00 , G10L15/02

CPC classification number: G10L21/0232 , G10L15/00 , G10L2021/02082

Abstract: Features are disclosed for filtering portions of an output audio signal in order to improve automatic speech recognition on an input signal which may include a representation of the output signal. A signal that includes audio content can be received, and a frequency or band of frequencies can be selected to be filtered from the signal. The frequency band may correspond to a desired frequency band for speech recognition. An input signal can be obtained comprising audio data corresponding to a user utterance and presentation of the output signal. Automatic speech recognition can be performed on the input signal. In some cases, an acoustic model trained for use with such frequency band filtering may be used to perform speech recognition.

13.

发明授权
Beamformer design using constrained convex optimization in three-dimensional space 有权
Title translation: Beamformer设计在三维空间中使用约束凸优化

公开(公告)号：US09591404B1

公开(公告)日：2017-03-07

申请号：US14040138

申请日：2013-09-27

Applicant: Amazon Technologies, Inc.

Inventor： Amit Singh Chhetri

IPC: H04R3/00 , H04R1/40

CPC classification number: H04R3/005 , H04R1/406 , H04R2203/12 , H04R2430/23 , H04R2430/25

Abstract: Embodiments of systems and methods are described for determining weighting coefficients based at least in part on using convex optimization subject to one or more constraints to approximate a three-dimensional beampattern. In some implementations, the approximated three-dimensional beampattern comprises a main lobe that includes a look direction for which waveforms detected by a sensor array are not suppressed and a side lobe that includes other directions for which waveforms detected by the microphone array are suppressed. The one or more constraints can include a constraint that suppression of waveforms received by the sensor array from the side lobe are greater than a threshold. In some implementations, the threshold can be dependent on at least one of an angular direction of the waveform and a frequency of the waveform.

Abstract translation: 描述了系统和方法的实施例，用于至少部分地基于利用一个或多个约束来逼近三维波形图的凸优化来确定加权系数。在一些实施方式中，近似的三维beampattern包括主瓣，其包括不抑制由传感器阵列检测到的波形的外观方向，以及包括由麦克风阵列检测到的波形的其他方向被抑制的旁瓣。一个或多个约束可以包括约束，其抑制传感器阵列从旁瓣接收的波形大于阈值。在一些实现中，阈值可以取决于波形的角度方向和波形的频率中的至少一个。

14.

发明授权
Echo path change detector with robustness to double talk 有权

公开(公告)号：US09521249B1

公开(公告)日：2016-12-13

申请号：US14609347

申请日：2015-01-29

Applicant: Amazon Technologies, Inc.

Inventor： Amit Singh Chhetri

IPC: H04M3/00 , H04M1/60 , H04B3/23 , H04M9/08

CPC classification number: H04M3/002 , H04B3/234 , H04M1/6041 , H04M9/082

Abstract: An echo path change detector may be used to control the rate of adaptation in an acoustic echo canceller. When echo path change is declared, the rate of adaptation may be increased. However, echo path change should not be declared in the presence of double talk, because rapid adaptation during double talk is undesirable. Accordingly, various features are disclosed for detecting echo path changes while avoiding the declaration of such changes in the presence of double talk.

15.

发明授权
Method and system for beam selection in microphone array beamformers 有权
Title translation: 麦克风阵列波束形成器中波束选择的方法和系统

公开(公告)号：US09432769B1

公开(公告)日：2016-08-30

申请号：US14447498

申请日：2014-07-30

Applicant: Amazon Technologies, Inc.

Inventor： Shiva Sundaram , Amit Singh Chhetri , Ramya Gopalan , Philip Ryan Hilmes

IPC: H04R3/00

CPC classification number: G10L21/028 , G10L25/72 , G10L25/84 , G10L2021/02166 , H04R1/406 , H04R3/005 , H04R25/405 , H04R25/407 , H04R2430/23

Abstract: Embodiments of systems and methods are described for determining which of a plurality of beamformed audio signals to select for signal processing. In some embodiments, a plurality of audio input signals are received from a microphone array comprising a plurality of microphones. A plurality of beamformed audio signals are determined based on the plurality of input audio signals, the beamformed audio signals comprising a direction. A plurality of signal features may be determined for each beamformed audio signal. Smoothed features may be determined for each beamformed audio signal based on at least a portion of the plurality of signal features. The beamformed audio signal corresponding to the maximum smoothed feature may be selected for further processing.

Abstract translation: 描述了系统和方法的实施例，用于确定多个波束形成的音频信号中的哪一个被选择用于信号处理。在一些实施例中，从包括多个麦克风的麦克风阵列接收多个音频输入信号。基于多个输入音频信号来确定多个波束形成的音频信号，波束形成的音频信号包括方向。可以为每个波束形成的音频信号确定多个信号特征。基于多个信号特征的至少一部分，可以为每个波束形成的音频信号确定平滑特征。可以选择对应于最大平滑特征的波束形成的音频信号用于进一步处理。

16.

发明授权
Dereverberation and noise reduction 有权

公开(公告)号：US12272369B1

公开(公告)日：2025-04-08

申请号：US17578737

申请日：2022-01-19

Applicant: Amazon Technologies, Inc.

Inventor： Amit Singh Chhetri , Mrudula V. Athi , Pradeep Kumar Govindaraju , Rong Hu

IPC: G10L21/0216 , G10L21/0208 , G10L21/10 , G10L25/30

Abstract: A system configured to improve audio processing by performing dereverberation and noise reduction during a communication session. In some examples, the system may include a deep neural network (DNN) configured to perform speech enhancement, which is located after an Acoustic Echo Cancellation (AEC) component. For example, the DNN may process isolated audio data output by the AEC component to jointly mitigate additive noise and reverberation. In other examples, the system may include a DNN configured to perform acoustic interference cancellation, which may jointly mitigate additive noise, reverberation, and residual echo, removing the need to perform residual echo suppression processing. The DNN is configured to process complex-valued spectrograms corresponding to the isolated audio data and/or estimated echo data generated by the AEC component.

17.

发明授权
Beam rejection in multi-beam microphone systems 有权

公开(公告)号：US11624800B1

公开(公告)日：2023-04-11

申请号：US17501093

申请日：2021-10-14

Applicant: Amazon Technologies, Inc.

Inventor： William Folwell Barton , Kenneth Edgar Hild, II , Ramya Gopalan , Kavitha Velusamy , Amit Singh Chhetri

IPC: H05H3/06 , G01S3/808 , G01S5/22 , G01S3/805 , G01S3/86 , H04R3/00 , H04R25/00

Abstract: The systems, devices, and processes described herein may identify a beam of a voice-controlled device that is directed toward a reflective surface, such as a wall. The beams may be created by a beamformer. An acoustic echo canceller (AEC) may create filter coefficients for a reference sound. The filter coefficients may be analyzed to identify beams that include multiple peaks. The multiple peaks may indicate presence of one or more reflective surfaces. Using the amplitude and the time delay between the peaks, the device may determine that it is close to a reflective surface in a direction of the beam.

18.

发明授权
Beam rejection in multi-beam microphone systems 有权

公开(公告)号：US11175372B1

公开(公告)日：2021-11-16

申请号：US16925046

申请日：2020-07-09

Applicant: Amazon Technologies, Inc.

Inventor： William Folwell Barton , Kenneth Edgar Hild, II , Ramya Gopalan , Kavitha Velusamy , Amit Singh Chhetri

IPC: H04R5/02 , G01S3/808 , G01S3/805 , G01S3/86 , G01S5/22 , H04R3/00 , H04R25/00

Abstract: The systems, devices, and processes described herein may identify a beam of a voice-controlled device that is directed toward a reflective surface, such as a wall. The beams may be created by a beamformer. An acoustic echo canceller (AEC) may create filter coefficients for a reference sound. The filter coefficients may be analyzed to identify beams that include multiple peaks. The multiple peaks may indicate presence of one or more reflective surfaces. Using the amplitude and the time delay between the peaks, the device may determine that it is close to a reflective surface in a direction of the beam.

19.

发明授权
Beamforming for a wearable computer 有权

公开(公告)号：US10863270B1

公开(公告)日：2020-12-08

申请号：US16361808

申请日：2019-03-22

Applicant: Amazon Technologies, Inc.

Inventor： Jeffrey Cornelius O'Neill , Amit Singh Chhetri , Philip Ryan Hilmes , Ramya Gopalan

IPC: H04R3/00 , G10L25/84 , G10L15/22 , H04R1/40

Abstract: A wearable computer is configured to use beamforming techniques to isolate a user's speech from extraneous audio signals occurring within a physical environment. A microphone array of the wearable computer may generate audio signal data from an utterance from a user's mouth. A motion sensor(s) of the wearable computer may generate motion data from movement of the wearable computer. This motion data may be used to determine a direction vector pointing from the wearable computer to the user's mouth, and a beampattern may be defined that has a beampattern direction in substantial alignment with the determined direction vector to focus the microphone array on the user's mouth for speech isolation.

20.

发明授权
Speech processing of reflected sound 有权

公开(公告)号：US09820036B1

公开(公告)日：2017-11-14

申请号：US14984010

申请日：2015-12-30

Applicant: Amazon Technologies, Inc.

Inventor： Charles Tritschler , Amit Singh Chhetri , Philip Ryan Hilmes

IPC: H04R1/00 , H04R1/32 , G10L19/008 , G10L17/22 , G10L19/02

CPC classification number: H04R1/326 , G10L15/00 , G10L21/0208 , G10L2021/02166 , H04R3/005 , H04R3/12

Abstract: Sound is banked laterally over an array of microphones arranged on a rear surface of a device. Sound enters a duct behind the device from different directions via inlets along the sides of the device. The duct directs the sound waves across the microphone array. An effective direction from which the banked sounds originated is determined, relative to a front of the device. Based on the determined effective direction, the device applies spatial filtering to isolate the received sound waves, selectively increasing a signal-to-noise ratio of sound from the selected source and at least partially occluding sounds from other sources.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification