-
公开(公告)号:US12126171B1
公开(公告)日:2024-10-22
申请号:US17116487
申请日:2020-12-09
Applicant: Amazon Technologies, Inc.
Inventor: Nayana Teja Chiluvuri , Chang Hwa Rob Yang , Xunwei Yu , Kenneth Lawrence Staton , Cameron Dean Whitehouse , Matthew L. Chaboud , Sajesh Kumar Saran , Pushkaraksha Gejji , Krzysztof Marcin Walczak
CPC classification number: H02J3/14 , G06Q50/06 , H02J13/00017
Abstract: Described is an apparatus to concurrently measure multiple input voltages at a high sampling data rate, such as at least two mega-samples per second. The apparatus may include a plurality of voltage data acquisition components that concurrently sample different input voltages and produce respective voltage data samples. Each of the plurality of voltage data acquisition components may be directly coupled to a field programmable gate array that receives the voltage data samples, packetizes those voltage data samples, and provide the packetized voltage data samples to a system on chip.
-
2.
公开(公告)号:US10546581B1
公开(公告)日:2020-01-28
申请号:US15699586
申请日:2017-09-08
Applicant: Amazon Technologies, Inc.
Inventor: Pushkaraksha Gejji
IPC: G10L15/22 , G10L21/0208 , G10L21/0232 , G10K11/16 , G10L15/20 , G10L25/84
Abstract: An echo cancellation system that synchronizes output audio data with input audio data in a heterogeneous system. The system may append a most recent outgoing audio data frame to an incoming audio data frame to synchronize a receiving buffer with a transmission buffer. By synchronizing the receiving buffer with the transmission buffer, the incoming audio data frames may be associated with corresponding outgoing audio data frames. After synchronizing the incoming audio data frames and the outgoing audio data frames, the system may perform Acoustic Echo Cancellation by removing the outgoing audio data frames from the incoming audio data frames.
-
公开(公告)号:US11056098B1
公开(公告)日:2021-07-06
申请号:US16455381
申请日:2019-06-27
Applicant: Amazon Technologies, Inc.
Inventor: Pushkaraksha Gejji
Abstract: Embodiments describe a method for speech endpoint detection including receiving identification data for a first state associated with a first frame of speech data from a WFST language model, determining that the first frame of the speech data includes silence data, incrementing a silence counter associated with the first state, copying a value of the silence counter of the first state to a corresponding silence counter field in a second state associated with the first state in an active state list, and determining that the value of the silence counter for the first state is above a silence threshold. The method further includes, determining that an endpoint of the speech has occurred in response to determining that the silence counter is above the silence threshold, and outputting text data representing a plurality of words determined from the speech data that was received prior to the endpoint.
-
公开(公告)号:US10284695B1
公开(公告)日:2019-05-07
申请号:US15912331
申请日:2018-03-05
Applicant: Amazon Technologies, Inc.
Inventor: Pushkaraksha Gejji , Prachi Avinash Pant , Matthew Carter , Wilfrido Loor Canizares , Jack Shih-Chieh Wei
Abstract: This disclosure describes voice-enabled modular devices that act as functional, voice-controlled endpoints at which users may provide voice commands. The voice-enabled modular devices may include a power unit which may be installed into a wall permanently to serve as a power source, and a front-panel module which detachably couples to the power unit via a hardware interface. The hardware interface may allow the front-panel module and power unit to communicate with each other, and may further allow the power unit to provide power to the front-panel module. The power unit of the voice-enabled modular device described herein may be configured to provide power to multiple front-panel modules having different capabilities, and potentially different power requirements, thereby supporting multiple front-panel modules to increase the functionality of the voice-enabled modular devices.
-
公开(公告)号:US11676585B1
公开(公告)日:2023-06-13
申请号:US17401141
申请日:2021-08-12
Applicant: Amazon Technologies, Inc.
Inventor: Pushkaraksha Gejji
IPC: G10L15/00 , G10L15/18 , G10L15/02 , G10L15/14 , G10L15/19 , G10L15/16 , G10L15/193 , G06F40/289 , G10L15/08
CPC classification number: G10L15/1822 , G10L15/02 , G10L15/14 , G10L15/19 , G06F40/289 , G10L15/083 , G10L15/16 , G10L15/18 , G10L15/193
Abstract: Embodiments describe a method for decoding speech including receiving speech input at an audio input device, generating speech data that is a digital representation of the speech input; extracting acoustic features of the speech data, assigning acoustic scores to the acoustic features, receiving data representing the acoustic features and the acoustic scores, decoding the data representing the acoustic features into a word, having a word score, by referencing a WFST language model, modifying the word score into a new word score based on a personalized grammar model stored in the external memory device, the processor is separate from and external to the WFST accelerator, and determining an intent represented by a plurality of words outputted by the WFST accelerator, where the plurality of words include the word and the new word score.
-
6.
公开(公告)号:US09812146B1
公开(公告)日:2017-11-07
申请号:US15044495
申请日:2016-02-16
Applicant: Amazon Technologies, Inc.
Inventor: Pushkaraksha Gejji , Arvind Mandhani
IPC: H04B3/20 , G10L21/02 , G10L21/0208 , G10L15/22 , G10L21/0216 , H04B3/23 , H04M9/08
CPC classification number: G10L21/0205 , G10L21/0208 , G10L2015/223 , G10L2021/02082 , G10L2021/02163 , H04B3/23 , H04M9/082
Abstract: An echo cancellation system that synchronizes output audio data with input audio data in a heterogeneous system. The system may increment a counter as outgoing audio frames are sent to a digital-to-analog converter in a speaker. As incoming audio frames are received by an analog-to-digital converter in a microphone, the system may copy contents of the counter into the incoming audio frames. Based on the contents of the counter, the incoming audio frames may be associated with corresponding outgoing audio frames. After synchronizing the incoming audio frames and the outgoing audio frames, the system may perform Acoustic Echo Cancellation by removing the outgoing audio frames from the incoming audio frames.
-
7.
公开(公告)号:US09424456B1
公开(公告)日:2016-08-23
申请号:US14748480
申请日:2015-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Krishna Kamath Koteshwara , Pushkaraksha Gejji
IPC: G06K9/00
CPC classification number: G06K9/0002 , G06K9/00093 , G06K9/001
Abstract: An ultrasonic finger print authentication system that generates a three-dimensional representation of a fingerprint. High frequency sound waves are used as a medium for imaging finer details of the patterns on a fingertip, including the ridge and valley formations. Multiple frequencies and beam-forming are used to quickly map the ridges. Acoustic resonance is used to determine the depths of the intervening valleys.
Abstract translation: 一种生成指纹的三维表示的超声波指纹识别系统。 使用高频声波作为用于在指尖上形成更细微图案的介质,包括脊和谷地层。 多个频率和波束形成用于快速映射脊。 声共振用于确定中间山谷的深度。
-
公开(公告)号:US11727917B1
公开(公告)日:2023-08-15
申请号:US17328114
申请日:2021-05-24
Applicant: Amazon Technologies, Inc.
Inventor: Pushkaraksha Gejji
CPC classification number: G10L15/05 , G10L25/87 , G10L15/01 , G10L2025/783
Abstract: Embodiments describe a method for speech endpoint detection including receiving identification data for a first state associated with a first frame of speech data from a WFST language model, determining that the first frame of the speech data includes silence data, incrementing a silence counter associated with the first state, copying a value of the silence counter of the first state to a corresponding silence counter field in a second state associated with the first state in an active state list, and determining that the value of the silence counter for the first state is above a silence threshold. The method further includes, determining that an endpoint of the speech has occurred in response to determining that the silence counter is above the silence threshold, and outputting text data representing a plurality of words determined from the speech data that was received prior to the endpoint.
-
公开(公告)号:US10726835B2
公开(公告)日:2020-07-28
申请号:US15389818
申请日:2016-12-23
Applicant: Amazon Technologies, Inc.
Inventor: Albert John Yu Sam Chua , Pushkaraksha Gejji , Wilfrido Loor Canizares , Adam Kenneth Cybart , Aleksandar Pance , Marc Rene Walliser
Abstract: A modular controller may be mounted in an opening, such as a standard single wide or double wide electrical junction box, in a wall or other surface. The modular controller may include a power module and a front module. The power module may be mounted in the opening of the surface, and may be configured to provide electrical power to the front module. The front module may be detachably coupleable to the power module. The front module may be configured to receive audio commands, gesture commands, and/or presence input corresponding to a desired action, and may cause the action to be performed by a device of the front module and/or an external device. The front module may include various devices (e.g., components) capable of providing various functionalities, and may be selected for coupling to a power module in a particular location based at least in part on the functionalities.
-
公开(公告)号:US10334702B1
公开(公告)日:2019-06-25
申请号:US15633693
申请日:2017-06-26
Applicant: Amazon Technologies, Inc.
Inventor: Mitchell Theodore Orysh , Pushkaraksha Gejji
IPC: G05F1/00 , H05B37/02 , H05B39/04 , H05B41/36 , H02J13/00 , H03K17/96 , G10L15/22 , G10L15/30 , H04L29/06 , H04L9/14
Abstract: This disclosure describes, in part, voice-controlled light dimmers that act as voice-controlled endpoints at which users may provide voice commands. These light dimmers include a front panel module coupled to a power module using a hardware interface. The front panel module may receive input from a user indicating commands for controlling appliances, and send communications to the power module using the hardware interface to control the appliances. In some examples, the communications involve encrypted data sent using an inter-integrated circuit (I2C) protocol using the hardware interface to an electrically isolated power module. The power provided to the appliances may be controlled by the power module of the voice-controlled light dimmer.
-
-
-
-
-
-
-
-
-