-
公开(公告)号:US10726831B2
公开(公告)日:2020-07-28
申请号:US14283017
申请日:2014-05-20
Applicant: Amazon Technologies, Inc.
Inventor: Giuseppe Di Fabbrizio , Shishir Sridhar Bharathi , Ying Shi , Lambert Mathias
Abstract: Features are disclosed for processing and interpreting natural language, such as interpretations of user utterances, in multi-turn dialog interactions. Context information regarding interpretations of user utterances and system responses to the user utterances can be maintained. Subsequent user utterances can be interpreted using the context information, rather than being interpreted without context. In some cases, interpretations of subsequent user utterances can be merged with interpretations of prior user utterances using a rule-based framework. Rules may be defined to determine which interpretations may be merged and under what circumstances they may be merged.
-
公开(公告)号:US10304444B2
公开(公告)日:2019-05-28
申请号:US15196540
申请日:2016-06-29
Applicant: AMAZON TECHNOLOGIES, INC.
Inventor: Lambert Mathias , Thomas Kollar , Arindam Mandal , Angeliki Metallinou
IPC: G06F17/20 , G10L15/22 , G10L15/26 , G10L15/02 , G10L15/18 , G10L15/14 , G06F16/35 , G06F16/332 , G06F17/27
Abstract: A system capable of performing natural language understanding (NLU) without the concept of a domain that influences NLU results. The present system uses a hierarchical organizations of intents/commands and entity types, and trained models associated with those hierarchies, so that commands and entity types may be determined for incoming text queries without necessarily determining a domain for the incoming text. The system thus operates in a domain agnostic manner, in a departure from multi-domain architecture NLU processing where a system determines NLU results for multiple domains simultaneously and then ranks them to determine which to select as the result.
-
公开(公告)号:US09754589B2
公开(公告)日:2017-09-05
申请号:US15256176
申请日:2016-09-02
Applicant: Amazon Technologies, Inc.
Inventor: Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat
CPC classification number: G10L15/22 , G06F17/277 , G06F17/278 , G06F17/279 , G06F17/28 , G06F17/2881 , G10L13/08 , G10L15/26
Abstract: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.
-
公开(公告)号:US20170116985A1
公开(公告)日:2017-04-27
申请号:US15256176
申请日:2016-09-02
Applicant: Amazon Technologies, Inc.
Inventor: Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat
CPC classification number: G10L15/22 , G06F17/277 , G06F17/278 , G06F17/279 , G06F17/28 , G06F17/2881 , G10L13/08 , G10L15/26
Abstract: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.
-
公开(公告)号:US20150302002A1
公开(公告)日:2015-10-22
申请号:US14754598
申请日:2015-06-29
Applicant: Amazon Technologies, Inc.
Inventor: Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat
IPC: G06F17/28
CPC classification number: G10L15/22 , G06F17/277 , G06F17/278 , G06F17/279 , G06F17/28 , G06F17/2881 , G10L13/08 , G10L15/26
Abstract: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.
Abstract translation: 公开了用于处理关于多个主题或域的用户话语的特征,并且从用于响应于话语或以其他方式采取行动的特定域中选择可能的结果。 用户话语可以通过自动语音识别(“ASR”)模块进行转录,并且可以将结果提供给多域自然语言理解(“NLU”)引擎。 多域NLU引擎可以处理多个单个域中的转录,而不是在单个域中处理转录。 在一些情况下,转录可以在多个单独的结构域中并行或基本上同时进行。 此外,可以基于先前的用户交互和其他数据生成提示。 ASR模块,多域NLU引擎和口语处理系统的其他组件可以使用提示来更有效地处理输入或更准确地生成输出。
-
公开(公告)号:US09070366B1
公开(公告)日:2015-06-30
申请号:US13720909
申请日:2012-12-19
Applicant: Amazon Technologies, Inc.
Inventor: Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat
CPC classification number: G10L15/22 , G06F17/277 , G06F17/278 , G06F17/279 , G06F17/28 , G06F17/2881 , G10L13/08 , G10L15/26
Abstract: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.
Abstract translation: 公开了用于处理关于多个主题或域的用户话语的特征,并且从用于响应于话语或以其他方式采取行动的特定域中选择可能的结果。 用户话语可以通过自动语音识别(“ASR”)模块进行转录,并且可以将结果提供给多域自然语言理解(“NLU”)引擎。 多域NLU引擎可以处理多个单个域中的转录,而不是在单个域中处理转录。 在一些情况下,转录可以在多个单独的结构域中并行或基本上同时进行。 此外,可以基于先前的用户交互和其他数据生成提示。 ASR模块,多域NLU引擎和口语处理系统的其他组件可以使用提示来更有效地处理输入或更准确地生成输出。
-
公开(公告)号:US11189277B2
公开(公告)日:2021-11-30
申请号:US16275074
申请日:2019-02-13
Applicant: Amazon Technologies, Inc.
Inventor: Imre Attila Kiss , Arthur Richard Toth , Lambert Mathias
IPC: G10L15/22 , G10L17/00 , G06F40/284 , G10L15/26 , G06F40/295
Abstract: In speech processing systems personalization is added in the Natural Language Understanding (NLU) processor by incorporating external knowledge sources of user information to improve entity recognition performance of the speech processing system. Personalization in the NLU is effected by incorporating one or more dictionaries of entries, or gazetteers, with information personal to a respective user, that provide the user's information to permit disambiguation of semantic interpretation for input utterances to improve quality of speech processing results.
-
公开(公告)号:US11176936B2
公开(公告)日:2021-11-16
申请号:US16400905
申请日:2019-05-01
Applicant: Amazon Technologies, Inc.
Inventor: Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat
IPC: G10L15/22 , G10L15/26 , G06F40/35 , G06F40/40 , G06F40/56 , G06F40/284 , G06F40/295 , G10L13/08
Abstract: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.
-
公开(公告)号:US10224030B1
公开(公告)日:2019-03-05
申请号:US13828063
申请日:2013-03-14
Applicant: Amazon Technologies, Inc.
Inventor: Imre Attila Kiss , Arthur Richard Toth , Lambert Mathias
Abstract: In speech processing systems personalization is added in the Natural Language Understanding (NLU) processor by incorporating external knowledge sources of user information to improve entity recognition performance of the speech processing system. Personalization in the NLU is effected by incorporating one or more dictionaries of entries, or gazetteers, with information personal to a respective user, that provide the user's information to permit disambiguation of semantic interpretation for input utterances to improve quality of speech processing results.
-
公开(公告)号:US09436678B2
公开(公告)日:2016-09-06
申请号:US14754598
申请日:2015-06-29
Applicant: Amazon Technologies, Inc.
Inventor: Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat
CPC classification number: G10L15/22 , G06F17/277 , G06F17/278 , G06F17/279 , G06F17/28 , G06F17/2881 , G10L13/08 , G10L15/26
Abstract: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.
Abstract translation: 公开了用于处理关于多个主题或域的用户话语的特征,并且从用于响应于话语或以其他方式采取行动的特定域中选择可能的结果。 用户话语可以通过自动语音识别(“ASR”)模块进行转录,并且可以将结果提供给多域自然语言理解(“NLU”)引擎。 多域NLU引擎可以处理多个单个域中的转录,而不是在单个域中处理转录。 在一些情况下,转录可以在多个单独的结构域中并行或基本同时地进行处理。 此外,可以基于先前的用户交互和其他数据生成提示。 ASR模块,多域NLU引擎和口语处理系统的其他组件可以使用提示来更有效地处理输入或更准确地生成输出。
-
-
-
-
-
-
-
-
-