-
公开(公告)号:US11232808B2
公开(公告)日:2022-01-25
申请号:US16394717
申请日:2019-04-25
Applicant: Amazon Technologies, Inc.
Inventor: Zhaoqing Ma , Tony Roy Hardie , Christo Frank Devaraj
Abstract: A system configured to vary a speech speed of speech represented in input audio data without changing a pitch of the speech. The system may vary the speech speed based on a number of different inputs, including non-audio data, data associated with a command, or data associated with the voice message itself. The non-audio data may correspond to information about an account, device or user, such as user preferences, calendar entries, location information, etc. The system may analyze audio data associated with the command to determine command speech speed, identity of person listening, etc. The system may analyze the input audio data to determine a message speech speed, background noise level, identity of the person speaking, etc. Using all of these inputs, the system may dynamically determine a target speech speed and may generate output audio data having the target speech speed.
-
公开(公告)号:US11062711B2
公开(公告)日:2021-07-13
申请号:US16693784
申请日:2019-11-25
Applicant: Amazon Technologies, Inc.
Inventor: Tapas Kanti Roy , Brian Oliver , Christo Frank Devaraj
Abstract: Systems and methods for establishing communication connections using speech, such as establishing calls between speech-controlled devices, are described. A first speech-controlled device receives a communication request in the form of audio and sends audio data corresponding to the captured audio to a server. The server performs speech processing on the audio data to determine a recipient, a subject for the call, and a device associated with the recipient. The server then sends a message indicating the communication request and audio data corresponding to the communication topic to the recipient's speech-controlled device. The recipient device outputs audio to the recipient requesting whether the recipient accepts the communication request. The recipient audibly refuses or accepts the communication request, and the recipient's speech-controlled device sends an indication of the recipient's audible decision to the server. If the recipient accepted the communication request, the server causes a communication connection be established between the two speech-controlled devices.
-
公开(公告)号:US20210074275A1
公开(公告)日:2021-03-11
申请号:US17067315
申请日:2020-10-09
Applicant: Amazon Technologies, Inc.
Inventor: Neil Christopher Fritz , Lakshya Bhagat , Scott Southwood , Katelyn Doran , Brett Lounsbury , Christo Frank Devaraj
Abstract: Audio data, corresponding to an utterance spoken by a person within a detection range of a voice communications device, can include an audio message portion. The audio data can be captured and analyzed to determine the intent to send a message. Based at least in part upon that intent, a remaining portion of the audio data can be analyzed to determine the intended message target or recipient, as well as the portion corresponding to the actual message payload. Once determined, the audio file can be trimmed to the message payload, and the message payload of the audio data can be delivered as an audio message to the target recipient.
-
公开(公告)号:US10916243B2
公开(公告)日:2021-02-09
申请号:US15390944
申请日:2016-12-27
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Venkata Krishnan Ramamoorthy , Gregory Michael Hart , Samuel Scott Gigliotti , Scott Southwood , Ran Mokady , Hale Sostock , Roman Yusufov
Abstract: Methods and systems for facilitating communications between shared electronic devices are described herein. In some embodiments, a group account may be assigned to a shared electronic device. The group account may include one or more user accounts, where individuals associated with those user accounts may interact with the shared electronic device, and also may form a part of the group account. When a message is sent from one shared electronic device to another personal device or shared electronic device, the message may be indicated as being sent from the group account, as if the shared electronic device corresponds to its own separate account. In some embodiments, speaker identification processing may be employed to determine a speaker of the message and, if the speaker is able to be identified, the message may be sent from the corresponding speaker's user account instead of the shared electronic device's corresponding group account.
-
公开(公告)号:US20190318758A1
公开(公告)日:2019-10-17
申请号:US16394717
申请日:2019-04-25
Applicant: Amazon Technologies, Inc.
Inventor: Zhaoqing Ma , Tony Roy Hardie , Christo Frank Devaraj
Abstract: A system configured to vary a speech speed of speech represented in input audio data without changing a pitch of the speech. The system may vary the speech speed based on a number of different inputs, including non-audio data, data associated with a command, or data associated with the voice message itself. The non-audio data may correspond to information about an account, device or user, such as user preferences, calendar entries, location information, etc. The system may analyze audio data associated with the command to determine command speech speed, identity of person listening, etc. The system may analyze the input audio data to determine a message speech speed, background noise level, identity of the person speaking, etc. Using all of these inputs, the system may dynamically determine a target speech speed and may generate output audio data having the target speech speed.
-
公开(公告)号:US10425780B1
公开(公告)日:2019-09-24
申请号:US15902762
申请日:2018-02-22
Applicant: Amazon Technologies, Inc.
Abstract: A system that determines that devices are co-located in an acoustic region and selects a single device to which to send incoming notifications for the acoustic region. The system may group devices into separate acoustic regions based on selection data that selects between similar audio data received from multiple devices. The system may select the best device for each acoustic region based on a frequency that the device was selected previously, input/output capabilities of the device, a proximity to a user, or the like. The system may send a notification to a single device in each of the acoustic regions so that a user receives a single notification instead of multiple unsynchronized notifications. The system may also determine that acoustic regions are associated with different locations and select acoustic regions to which to send a notification based on location.
-
公开(公告)号:US10186266B1
公开(公告)日:2019-01-22
申请号:US15392810
申请日:2016-12-28
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Brian Oliver , Sumedha Arvind Kshirsagar , Gregory Michael Hart , Ran Mokady
Abstract: Methods and systems for providing message playback using a shared electronic device is described herein. In response to receiving a request to output messages, a speech-processing system may determine a group account associated with a requesting device, and may determine messages stored by a message data store for the group account. Speaker identification processing may also be performed to determine a speaker of the request. A user account associated with the speaker, and messages stored for the user account, may be determined. A summary response indicating the user account's messages and the group account's message may then be generated such that the user account messages are identified prior to the group account's messages. The messages may then be analyzed to determine an appropriate voice user interface for the requester such that the playback of the messages using a shared electronic device is more natural and conversational.
-
公开(公告)号:US10157614B1
公开(公告)日:2018-12-18
申请号:US15392827
申请日:2016-12-28
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Brian Oliver , Sumedha Arvind Kshirsagar , Gregory Michael Hart , Ran Mokady
IPC: G10L15/00 , G10L15/22 , H04L29/08 , H04L12/58 , G10L15/02 , G10L15/18 , G10L13/08 , G10L17/22 , G10L17/06 , G06F17/30
Abstract: Methods and systems for redirecting messages based on contextual information associated with the messages are described herein. In some embodiments, a first individual may speak an utterance including a message, where the utterance indicates a first recipient for the message. Audio data representing the utterance may be provided to a speech-processing system, which may performed automatic speech recognition processing, natural language understanding processing, and contextual recognition processing to the audio data. In some embodiments, the contextual recognition processing may determine that the message may be intended for a second recipient. If so, the speech-processing system may cause the message to be redirected to the second recipient, such that the second recipient may receive the message as opposed to the first recipient.
-
公开(公告)号:US20180061402A1
公开(公告)日:2018-03-01
申请号:US15254359
申请日:2016-09-01
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Manish Kumar Dalmia , Tony Roy Hardie , Ran Mokady , Nick Ciubotariu , Sandra Lemon
CPC classification number: G10L15/22 , G06F17/2765 , G06F17/278 , G10L13/00 , G10L15/1822 , G10L17/22 , G10L17/24 , G10L2015/225 , G10L2015/227 , H04L67/306
Abstract: Systems, methods, and devices for escalating voice-based interactions via speech-controlled devices are described. Speech-controlled devices capture audio, including wakeword portions and payload portions, for sending to a server to relay messages between speech-controlled devices. In response to determining the occurrence of an escalation event, such as repeated messages between the same two devices, the system may automatically change a mode of a speech-controlled device, such as no longer requiring a wakeword, no longer requiring an indication of a desired recipient, or automatically connecting two speech-controlled devices in a voice-chat mode. In response to determining the occurrence of further escalation events, the system may initiate a real-time call between the speech-controlled devices.
-
公开(公告)号:US11968271B2
公开(公告)日:2024-04-23
申请号:US18089974
申请日:2022-12-28
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Christopher Geiger Parker , Sumedha Arvind Kshirsagar , James Alexander Stanton , Aaron Takayanagi Barnet , Venkatesh Kancharla , Gregory Michael Hart
IPC: H04B1/38 , G06Q10/10 , H04L67/306 , H04M1/72433 , H04M3/42 , H04M3/533 , G10L15/26 , H04W4/80
CPC classification number: H04L67/306 , G06Q10/10 , H04M1/72433 , H04M3/42238 , H04M3/53366 , G10L15/26 , H04M2203/556 , H04M2250/74 , H04W4/80
Abstract: Systems and methods for sender profile and/or recipient profile disambiguation and/or confirmation are disclosed. In instances where a sender profile is not indicated by a user sending a communication from a communal device, heuristic data may be utilized to infer the sender profile. Similar heuristic data may also be used when selection of the sender profile is associated with a low confidence level. Heuristic data may also be used to infer the recipient profile when the user does not indicate the recipient profile or when selection of the recipient profile is associated with a low confidence. Various confirmations may result from the sender and recipient profile disambiguation.
-
-
-
-
-
-
-
-
-