-
公开(公告)号:US11546434B2
公开(公告)日:2023-01-03
申请号:US17101662
申请日:2020-11-23
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Christopher Geiger Parker , Sumedha Arvind Kshirsagar , James Alexander Stanton , Aaron Takayanagi Barnet , Venkatesh Kancharla , Gregory Michael Hart
IPC: H04B1/38 , H04L67/306 , G06Q10/10 , H04M3/42 , H04M3/533 , H04M1/72433 , H04W4/80 , G10L15/26
Abstract: Systems and methods for sender profile and/or recipient profile disambiguation and/or confirmation are disclosed. In instances where a sender profile is not indicated by a user sending a communication from a communal device, heuristic data may be utilized to infer the sender profile. Similar heuristic data may also be used when selection of the sender profile is associated with a low confidence level. Heuristic data may also be used to infer the recipient profile when the user does not indicate the recipient profile or when selection of the recipient profile is associated with a low confidence. Various confirmations may result from the sender and recipient profile disambiguation.
-
公开(公告)号:US10482884B1
公开(公告)日:2019-11-19
申请号:US15663514
申请日:2017-07-28
Applicant: Amazon Technologies, Inc.
Inventor: Jeff Bradley Beal , Kevin Robert Charter , Ajay Gopalakrishnan , Sumedha Arvind Kshirsagar , Nishant Kumar
Abstract: A speech recognition platform configured to receive an audio signal that includes speech from a user and perform automatic speech recognition (ASR) on the audio signal to identify ASR results. The platform may identify: (i) a domain of a voice command within the speech based on the ASR results and based on context information associated with the speech or the user, and (ii) an intent of the voice command. In response to identifying the intent, the platform may perform multiple actions corresponding to this intent. The platform may select a target action to perform, and may engage in a back-and-forth dialog to obtain information for completing the target action. The action may include streaming audio to the device, setting a reminder for the user, purchasing an item on behalf of the user, making a reservation for the user or launching an application for the user.
-
公开(公告)号:US20190156830A1
公开(公告)日:2019-05-23
申请号:US16251901
申请日:2019-01-18
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Brian Oliver , Sumedha Arvind Kshirsagar , Gregory Michael Hart , Ran Mokady
Abstract: Methods and systems for providing message playback using a shared electronic device is described herein. In response to receiving a request to output messages, a speech-processing system may determine a group account associated with a requesting device, and may determine messages stored by a message data store for the group account. Speaker identification processing may also be performed to determine a speaker of the request. A user account associated with the speaker, and messages stored for the user account, may be determined. A summary response indicating the user account's messages and the group account's message may then be generated such that the user account messages are identified prior to the group account's messages. The messages may then be analyzed to determine an appropriate voice user interface for the requester such that the playback of the messages using a shared electronic device is more natural and conversational.
-
公开(公告)号:US10186267B1
公开(公告)日:2019-01-22
申请号:US15392844
申请日:2016-12-28
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , Brian Oliver , Sumedha Arvind Kshirsagar , Gregory Michael Hart , Ran Mokady
IPC: G10L15/00 , G10L15/22 , H04L29/08 , H04L12/58 , G10L15/18 , G10L17/02 , G10L17/22 , G10L13/08 , G06F17/30 , G10L15/08
Abstract: Methods and systems for prioritizing messages for playback are described herein. In some embodiments, a request for messages to be output may be received by a speech-processing system. The speech-processing system may include a message database that includes messages received for a speaker of the request's user account and/or a group account associated with a shared electronic device that the request was received from. One or more prioritization rules may be applied to the messages to order the messages for playback in order to provide an optimal voice user interface for the requesting individual. For instance, messages received for the user account may be prioritized over messages received for the group account, messages received from a similar sender or a high priority sender may be prioritized over other messages, and messages that are indicating as being urgent may be prioritized over messages that are indicated as being non-urgent.
-
公开(公告)号:US20180309866A1
公开(公告)日:2018-10-25
申请号:US15616713
申请日:2017-06-07
Applicant: Amazon Technologies, Inc.
Inventor: Christo Frank Devaraj , James Alexander Stanton , Sumedha Arvind Kshirsagar , Christopher Geiger Parker , Aaron Takayanagi Barnet , Venkatesh Kancharla , Gregory Michael Hart
CPC classification number: H04M1/72547 , G10L13/00 , G10L15/02 , G10L15/1815 , G10L15/22 , H04L67/306 , H04W4/80
Abstract: Systems and methods for sender profile and/or recipient profile disambiguation and/or confirmation are disclosed. In instances where a sender profile is not indicated by a user sending a communication from a communal device, heuristic data may be utilized to infer the sender profile. Similar heuristic data may also be used when selection of the sender profile is associated with a low confidence level. Heuristic data may also be used to infer the recipient profile when the user does not indicate the recipient profile or when selection of the recipient profile is associated with a low confidence. Various confirmations may result from the sender and recipient profile disambiguation.
-
公开(公告)号:US09405741B1
公开(公告)日:2016-08-02
申请号:US14223648
申请日:2014-03-24
Applicant: Amazon Technologies, Inc.
Inventor: Thomas Schaaf , Sumedha Arvind Kshirsagar , Roger Alix-Gaudreau , Remus Razvan Mois , Rafal Kuklinski , Derek Christopher Murman
CPC classification number: G06F17/27 , G06F17/274 , G10L13/00 , G10L15/08 , G10L15/22
Abstract: Features are disclosed for recognizing inappropriate content in an output. The offensive content may be generated as a result of a speech processing error. A system may identify the inappropriate elements of a generated output and select among different appropriate alternatives. The system may be adjusted based on certain user characteristics. The system may be localized based on language and cultural features. The system may modify the generated output based on characteristics such as the tolerance threshold of known persons in the proximity of the system. The tolerance threshold may further be used to personalize and modify available content. Models used by the system may be further trained using input from a user.
Abstract translation: 公开了用于识别输出中的不适当内容的特征。 可能由于语音处理错误而产生令人反感的内容。 系统可以识别生成的输出的不适当元素,并在不同的适当替代方案中进行选择。 可以基于某些用户特征来调整系统。 该系统可以基于语言和文化特征进行本地化。 系统可以基于诸如在系统附近的已知人员的容许阈值的特性来修改生成的输出。 公差阈值还可用于个性化和修改可用内容。 可以使用来自用户的输入来进一步训练系统使用的模型。
-
公开(公告)号:US12288561B1
公开(公告)日:2025-04-29
申请号:US18407271
申请日:2024-01-08
Applicant: Amazon Technologies, Inc.
Inventor: Jeff Bradley Beal , Kevin Robert Charter , Ajay Gopalakrishnan , Sumedha Arvind Kshirsagar , Nishant Kumar
Abstract: A speech recognition platform configured to receive an audio signal that includes speech from a user and perform automatic speech recognition (ASR) on the audio signal to identify ASR results. The platform may identify: (i) a domain of a voice command within the speech based on the ASR results and based on context information associated with the speech or the user, and (ii) an intent of the voice command. In response to identifying the intent, the platform may perform multiple actions corresponding to this intent. The platform may select a target action to perform, and may engage in a back-and-forth dialog to obtain information for completing the target action. The action may include streaming audio to the device, setting a reminder for the user, purchasing an item on behalf of the user, making a reservation for the user or launching an application for the user.
-
公开(公告)号:US20230410816A1
公开(公告)日:2023-12-21
申请号:US18341224
申请日:2023-06-26
Applicant: Amazon Technologies, Inc.
Inventor: Nishant Kumar , David Robert Thomas , Sumedha Arvind Kshirsagar , Vikas Jain , Jeff Bradley Beal , Ajay Gopalakrishnan , Shishir Sridhar Bharathi
IPC: G10L17/00 , G10L15/22 , G10L15/183 , G10L15/18
CPC classification number: G10L17/00 , G10L15/22 , G10L15/183 , G10L15/18 , G10L2015/228 , G10L2015/223
Abstract: Features are disclosed for performing functions in response to user requests based on contextual data regarding prior user requests. Users may engage in conversations with a computing device in order to initiate some function or obtain some information. A dialog manager may manage the conversations and store contextual data regarding one or more of the conversations. Processing and responding to subsequent conversations may benefit from the previously stored contextual data by, e.g., reducing the amount of information that a user must provide if the user has already provided the information in the context of a prior conversation. Additional information associated with performing functions responsive to user requests may be shared among applications, further improving efficiency and enhancing the user experience.
-
公开(公告)号:US11495224B2
公开(公告)日:2022-11-08
申请号:US16878315
申请日:2020-05-19
Applicant: Amazon Technologies, Inc.
Inventor: Someshwaran Elangovan , Aparna Nandyal , Venkatesh Kancharla , Arun Rajendran , Sumedha Arvind Kshirsagar , Christopher Geiger Parker
Abstract: Methods and systems for performing contact resolution are described herein. When initiating a communications session using a voice activated electronic device, a contact name may be resolved to determine an appropriate contact with which the communications session may be directed to. Contacts from an individual's contact list may be queried to determine a listing of probable contacts associated with the contact name, and contact identifiers associated with the contact may be determined. Using one or more rules for disambiguating between similar contact names, a single contact may be identified, and a communications session with that contact may be initiated.
-
公开(公告)号:US11094320B1
公开(公告)日:2021-08-17
申请号:US14579699
申请日:2014-12-22
Applicant: Amazon Technologies, Inc.
Inventor: Vikas Jain , Shishir Sridhar Bharathi , Giuseppe Pino Di Fabbrizio , Ling Hu , Sumedha Arvind Kshirsagar , Shamitha Somashekar , John Daniel Thimsen , Tudor Toma
IPC: G06F3/0481 , G10L15/08 , G10L15/22
Abstract: Dialog visualizations are created to enable analysis of interactions between a user and a speech recognition system used to implement user commands. Spoken commands from the user may be classified, along with system responses to the spoken commands, to enable aggregation of communication exchanges that form dialog. This data may then be used to create a dialog visualization. The dialog visualization may enable an analyst to visually explore different branches of the interactions represented in the dialog visualization. The dialog visualization may show a trajectory of the dialog, which may be explored in an interactive manner by the analyst.
-
-
-
-
-
-
-
-
-