-
公开(公告)号:US09858927B2
公开(公告)日:2018-01-02
申请号:US15083902
申请日:2016-03-29
Applicant: Amazon Technologies, Inc.
Inventor: Robert Williams , Steven Todd Rabuchin , Gregory Michael Hart
IPC: H03G3/20 , H04R29/00 , H04B3/00 , H04R27/00 , H04R3/00 , G10L15/22 , G10L13/02 , G06F17/28 , H04R3/12 , G06F3/16 , G06F17/30
CPC classification number: G10L15/22 , G06F3/165 , G06F17/28 , G06F17/30749 , G10L13/02 , G10L21/0364 , G10L2015/223 , H04R3/12 , H04R2420/07 , H04R2430/01
Abstract: A system that is capable of controlling multiple entertainment systems and/or speakers using voice commands. The system receives voice commands and may determine audio sources and speakers indicated by the voice commands. The system may generate audio data from the audio sources and may send the audio data to the speakers using multiple interfaces. For example, the system may send the audio data directly to the speakers using a network address, may send the audio data to the speakers via a voice-enabled device or may send the audio data to the speakers via a speaker controller. The system may generate output zones including multiple speakers and may associate input devices with speakers within the output zones. For example, the system may receive a voice command from an input device in an output zone and may reduce output audio generated by speakers in the output zone.
-
公开(公告)号:US09640179B1
公开(公告)日:2017-05-02
申请号:US13928751
申请日:2013-06-27
Applicant: Amazon Technologies, Inc.
Inventor: Gregory Michael Hart , Kavitha Velusamy , William Spencer Worley, III
IPC: G10L15/20
CPC classification number: G10L15/20 , G10L21/0208 , G10L2021/02082 , H04R3/005 , H04R2420/09
Abstract: Techniques for tailoring beamforming techniques to environments such that processing resources may be devoted to a portion of an audio signal corresponding to a lobe of a beampattern that is most likely to contain user speech. The techniques take into account both acoustic characteristics of an environment and heuristics regarding lobes that have previously been found to include user speech.
-
公开(公告)号:US09570071B1
公开(公告)日:2017-02-14
申请号:US14828263
申请日:2015-08-17
Applicant: Amazon Technologies, Inc.
Inventor: Gregory Michael Hart , Jeffrey P. Bezos
IPC: G10L15/00 , G10L15/20 , G10L21/0272 , G10L21/0216 , G10L21/0208
CPC classification number: G10L15/20 , G10L15/30 , G10L19/00 , G10L21/00 , G10L21/0216 , G10L21/0272 , G10L2021/02082 , G10L2021/02166
Abstract: A voice interaction architecture that compiles multiple audio signals captured at different locations within an environment, determines a time offset between a primary audio signal and other captured audio signals and identifies differences between the primary signal and the other signal(s). Thereafter, the architecture may provide the primary audio signal, an indication of the determined time offset(s) and the identified differences to remote computing resources for further processing. For instance, the architecture may send this information to a network-accessible distributed computing platform that performs beamforming and/or automatic speech recognition (ASR) on the received audio. The distributed computing platform may in turn determine a response to provide based upon the beamforming and/or ASR.
Abstract translation: 编码在环境内的不同位置处捕获的多个音频信号的语音交互架构确定主音频信号和其它捕获的音频信号之间的时间偏移,并识别主信号与其它信号之间的差异。 此后,架构可以提供主音频信号,所确定的时间偏移的指示和所识别的与远程计算资源的差异以进一步处理。 例如,架构可以将该信息发送到在所接收的音频上执行波束成形和/或自动语音识别(ASR)的网络可访问的分布式计算平台。 分布式计算平台可以依次确定基于波束形成和/或ASR提供的响应。
-
公开(公告)号:US20240005918A1
公开(公告)日:2024-01-04
申请号:US18367779
申请日:2023-09-13
Applicant: Amazon Technologies, Inc.
Inventor: John Daniel Thimsen , Gregory Michael Hart , Ryan Paul Thomas
IPC: G10L15/20
CPC classification number: G10L15/20
Abstract: An audio controlled assistant captures environmental noise and converts the environmental noise into audio signals. The audio signals are provided to a system which analyzes the audio signals for a plurality of audio prompts, which have been customized for the acoustic environment surrounding the audio controlled assistant by an acoustic modeling system. The system configured to detect the presence of an audio prompt in the audio signals and transmit instructions associated with the detected audio prompt to at least one of the audio controlled assistant or one or more cloud based services, in response.
-
公开(公告)号:US11862153B1
公开(公告)日:2024-01-02
申请号:US16578838
申请日:2019-09-23
Applicant: Amazon Technologies, Inc.
Inventor: John Daniel Thimsen , Gregory Michael Hart , Ryan Paul Thomas
CPC classification number: G10L15/20
Abstract: An audio controlled assistant captures environmental noise and converts the environmental noise into audio signals. The audio signals are provided to a system which analyzes the audio signals for a plurality of audio prompts, which have been customized for the acoustic environment surrounding the audio controlled assistant by an acoustic modeling system. The system configured to detect the presence of an audio prompt in the audio signals and transmit instructions associated with the detected audio prompt to at least one of the audio controlled assistant or one or more cloud based services, in response.
-
公开(公告)号:US11657812B2
公开(公告)日:2023-05-23
申请号:US17030445
申请日:2020-09-24
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Brian Oliver , Sumedha Arvind Kshirsagar , Gregory Michael Hart , Ran Mokady
IPC: G10L15/00 , G10L15/22 , G06F3/16 , G10L15/26 , G06F16/683 , H04L51/226 , H04L67/63 , G10L13/08 , G10L15/02 , G10L15/18 , G10L17/06 , G10L17/22 , G10L17/00 , H04L51/02
CPC classification number: G10L15/22 , G06F3/167 , G06F16/685 , G10L13/08 , G10L15/02 , G10L15/1815 , G10L15/26 , G10L17/06 , G10L17/22 , H04L51/226 , H04L67/63 , G10L17/00 , G10L2015/223 , H04L51/02
Abstract: Methods and systems for providing message playback using a shared electronic device is described herein. In response to receiving a request to output messages, a speech-processing system may determine a group account associated with a requesting device, and may determine messages stored by a message data store for the group account. Speaker identification processing may also be performed to determine a speaker of the request. A user account associated with the speaker, and messages stored for the user account, may be determined. A summary response indicating the user account's messages and the group account's message may then be generated such that the user account messages are identified prior to the group account's messages. The messages may then be analyzed to determine an appropriate voice user interface for the requester such that the playback of the messages using a shared electronic device is more natural and conversational.
-
公开(公告)号:US20210110823A1
公开(公告)日:2021-04-15
申请号:US17030445
申请日:2020-09-24
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Brian Oliver , Sumedha Arvind Kshirsagar , Gregory Michael Hart , Ran Mokady
IPC: G10L15/22 , G06F3/16 , G10L15/26 , G06F16/683 , H04L12/58 , H04L29/08 , G10L13/08 , G10L15/02 , G10L15/18 , G10L17/06 , G10L17/22
Abstract: Methods and systems for providing message playback using a shared electronic device is described herein. In response to receiving a request to output messages, a speech-processing system may determine a group account associated with a requesting device, and may determine messages stored by a message data store for the group account. Speaker identification processing may also be performed to determine a speaker of the request. A user account associated with the speaker, and messages stored for the user account, may be determined. A summary response indicating the user account's messages and the group account's message may then be generated such that the user account messages are identified prior to the group account's messages. The messages may then be analyzed to determine an appropriate voice user interface for the requester such that the playback of the messages using a shared electronic device is more natural and conversational.
-
公开(公告)号:US10674114B1
公开(公告)日:2020-06-02
申请号:US15723982
申请日:2017-10-03
Applicant: Amazon Technologies, Inc.
Inventor: Michael Douglas McQueen , Meng Li , Eric Alan Breitbard , Robert Steven Murdock , Julien George Beguin , Gregory Michael Hart , David A. Limp , Scott Ian Blanksteen
Abstract: A video display hub is mounted in a common household area such as a kitchen or family room. During times that have been designated as being available for communications, devices in first and second households exchange and display blurred video, allowing users in each household to see vague shapes and movements of the other household. Upon noticing activity, a user in the first household may initiate a video conversation, causing the video from the first household to be unblurred and causing unobscured voice to be transmitted to the second household. A user in the second household may respond by allowing the video conversation to be fully enabled, allowing the video from the second household to be unblurred and unobscured voice to be transmitted back to the first household.
-
公开(公告)号:US10354621B1
公开(公告)日:2019-07-16
申请号:US15910650
申请日:2018-03-02
Applicant: Amazon Technologies, Inc.
Inventor: Michael Douglas McQueen , Meng Li , Eric Alan Breitbard , Robert Steven Murdock , Julien George Beguin , Gregory Michael Hart , Scott Ian Blanksteen
Abstract: A video display hub is mounted in a common household area such as a kitchen or family room. The display hub is configured to display various types of information for users in the area, such as weather, traffic updates, schedules, notes, messages, lists, news, etc. When the user is at a distance from the display hub, information is presented at a relatively low density, with a low level of granularity and detail in conjunction with large fonts, graphics, and icons. When the user is close to the display hub, information is presented at a relatively high density, with a high level of granularity and detail in conjunction with small fonts, graphics, and icons.
-
公开(公告)号:USD841644S1
公开(公告)日:2019-02-26
申请号:US29629112
申请日:2017-12-11
Applicant: Amazon Technologies, Inc.
-
-
-
-
-
-
-
-
-