-
公开(公告)号:US20220093101A1
公开(公告)日:2022-03-24
申请号:US17112520
申请日:2020-12-04
Applicant: Amazon Technologies, Inc.
Inventor: Prakash Krishnan , Arindam Mandal , Siddhartha Reddy Jonnalagadda , Nikko Strom , Ariya Rastrow , Ying Shi , David Chi-Wai Tang , Nishtha Gupta , Aaron Challenner , Bonan Zheng , Angeliki Metallinou , Vincent Auvray , Minmin Shen
Abstract: A system that is capable of resolving anaphora using timing data received by a local device. A local device outputs audio representing a list of entries. The audio may represent synthesized speech of the list of entries. A user can interrupt the device to select an entry in the list, such as by saying “that one.” The local device can determine an offset time representing the time between when audio playback began and when the user interrupted. The local device sends the offset time and audio data representing the utterance to a speech processing system which can then use the offset time and stored data to identify which entry on the list was most recently output by the local device when the user interrupted. The system can then resolve anaphora to match that entry and can perform additional processing based on the referred to item.
-
公开(公告)号:US11200885B1
公开(公告)日:2021-12-14
申请号:US16219228
申请日:2018-12-13
Applicant: Amazon Technologies, Inc.
Inventor: Arindam Mandal , Nikko Strom , Angeliki Metallinou , Tagyoung Chung , Dilek Hakkani-Tur , Suranjit Adhikari , Sridhar Yadav Manoharan , Ankita De , Qing Liu , Raefer Christopher Gabriel , Rohit Prasad
IPC: G10L15/22 , G10L21/00 , G10L15/06 , G10L15/18 , G06F16/332
Abstract: A dialog manager receives text data corresponding to a dialog with a user. Entities represented in the text data are identified. Context data relating to the dialog is maintained, which may include prior dialog, prior API calls, user profile information, or other data. Using the text data and the context data, an N-best list of one or more dialog models is selected to process the text data. After processing the text data, the outputs of the N-best models are ranked and a top-scoring output is selected. The top-scoring output may be an API call and/or an audio prompt.
-
公开(公告)号:US20170278514A1
公开(公告)日:2017-09-28
申请号:US15196540
申请日:2016-06-29
Applicant: AMAZON TECHNOLOGIES, INC.
Inventor: Lambert Mathias , Thomas Kollar , Arindam Mandal , Angeliki Metallinou
CPC classification number: G10L15/22 , G06F17/277 , G06F17/279 , G06F17/30637 , G06F17/30654 , G06F17/30705 , G10L15/02 , G10L15/142 , G10L15/1815 , G10L15/26 , G10L2015/223
Abstract: A system capable of performing natural language understanding (NLU) without the concept of a domain that influences NLU results. The present system uses a hierarchical organizations of intents/commands and entity types, and trained models associated with those hierarchies, so that commands and entity types may be determined for incoming text queries without necessarily determining a domain for the incoming text. The system thus operates in a domain agnostic manner, in a departure from multi-domain architecture NLU processing where a system determines NLU results for multiple domains simultaneously and then ranks them to determine which to select as the result.
-
-