-
公开(公告)号:US20180012594A1
公开(公告)日:2018-01-11
申请号:US15205505
申请日:2016-07-08
Applicant: Google Inc.
Inventor: Behshad Behzadi , Dmitry Osmakov , Martin Baeuml , Gleb Skobeltsyn
CPC classification number: G10L15/183 , G06F17/279 , G06F17/30684 , G10L15/02 , G10L15/14 , G10L15/1815 , G10L15/26
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for predicting follow-up queries to an initial transcription of an utterance. In some implementations, one or more follow-up queries that are pre-associated with a transcription of an initial utterance of a user are identified. A new or modified language model in which a respective probability associated with one or more of the follow-up queries is increased with respect to an initial language model is obtained. Subsequent audio data corresponding to a subsequent utterance of the user is then received. The subsequent audio data is processed using the new or modified language model to generate a transcription of the subsequent utterance. The transcription of the subsequent utterance is then provided for output to the user.
-
公开(公告)号:US09865260B1
公开(公告)日:2018-01-09
申请号:US15585363
申请日:2017-05-03
Applicant: Google Inc.
Inventor: Vladimir Vuskovic , Stephan Wenger , Zineb Ait Bahajji , Martin Baeuml , Alexandru Dovlecel , Gleb Skobeltsyn
CPC classification number: G06F17/278 , G06F17/279 , G06F17/2881 , G10L15/22
Abstract: Methods, apparatus, and computer readable media are described related to automated assistants that proactively incorporate, into human-to-computer dialog sessions, unsolicited content of potential interest to a user. In various implementations, based on content of an existing human-to-computer dialog session between a user and an automated assistant, an entity mentioned by the user or automated assistant may be identified. Fact(s)s related to the entity or to another entity that is related to the entity may be identified based on entity data contained in database(s). For each of the fact(s), a corresponding measure of potential interest to the user may be determined. Unsolicited natural language content may then be generated that includes one or more of the facts selected based on the corresponding measure(s) of potential interest. The automated assistant may then incorporate the unsolicited content into the existing human-to-computer dialog session or a subsequent human-to-computer dialog session.
-
公开(公告)号:US20170301352A1
公开(公告)日:2017-10-19
申请号:US15637526
申请日:2017-06-29
Applicant: Google Inc.
Inventor: Trevor D. Strohman , Johan Schalkwyk , Gleb Skobeltsyn
CPC classification number: G10L15/32 , G10L15/02 , G10L15/183 , G10L15/19 , G10L15/22 , G10L15/26 , G10L25/51 , G10L2015/025
Abstract: Methods, including computer programs encoded on a computer storage medium, for improving speech recognition based on external data sources. In one aspect, a method includes obtaining an initial candidate transcription of an utterance using an automated speech recognizer and identifying, based on a language model that is not used by the automated speech recognizer in generating the initial candidate transcription, one or more terms that are phonetically similar to one or more terms that do occur in the initial candidate transcription. Additional actions include generating one or more additional candidate transcriptions based on the identified one or more terms and selecting a transcription from among the candidate transcriptions.
-
公开(公告)号:US20170076723A1
公开(公告)日:2017-03-16
申请号:US15359284
申请日:2016-11-22
Applicant: Google Inc.
Inventor: Gleb Skobeltsyn , Evgeny A. Cherepanov , Behshad Behzadi
IPC: G10L15/22 , G10L15/187 , G10L15/01
CPC classification number: G10L15/22 , G06F17/271 , G06F17/2765 , G06F17/30654 , G06F17/30663 , G06F17/30746 , G10L15/01 , G10L15/08 , G10L15/187 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the methods includes receiving a first voice query; generating a first recognition output; receiving a second voice query; determining from a recognition of the second voice query that the second voice query triggers a correction request; using the first recognition output and the second recognition to determine a plurality of candidate corrections; scoring each candidate correction; and generating a corrected recognition output for a particular candidate correction having a score that satisfies a threshold value.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于自然语言处理。 其中一种方法包括接收第一语音查询; 产生第一识别输出; 接收第二个语音查询; 从所述第二语音查询的识别确定所述第二语音查询触发校正请求; 使用所述第一识别输出和所述第二识别来确定多个候选校正; 对每个候选人进行校正; 以及生成具有满足阈值的分数的特定候选校正的校正识别输出。
-
公开(公告)号:US09773499B2
公开(公告)日:2017-09-26
申请号:US14639199
申请日:2015-03-05
Applicant: Google Inc.
Inventor: Gleb Skobeltsyn , Behshad Behzadi
IPC: G10L15/187 , G10L15/26 , G10L15/18 , G10L15/22 , G10L15/08
CPC classification number: G10L15/187 , G10L15/1815 , G10L15/26 , G10L2015/088 , G10L2015/228
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for recognizing names of entities in speech. In one aspect, a method includes actions of receiving an utterance that includes (i) a first term that indicates a particular entity type, and (ii) a second term that indicates an entity name. Additional actions include obtaining a phonetic representation of the second term and determining that the phonetic representation of the second term matches a particular phonetic representation of a particular canonical name of a set of canonical names associated with a particular entity. Further actions include outputting a reference name associated with the particular entity as a transcription of the second term.
-
公开(公告)号:US20170270928A1
公开(公告)日:2017-09-21
申请号:US15614239
申请日:2017-06-05
Applicant: Google Inc.
Inventor: Gleb Skobeltsyn , Evgeny A. Cherepanov , Behshad Behzadi
IPC: G10L15/22 , G10L15/187 , G10L15/01
CPC classification number: G10L15/22 , G06F17/271 , G06F17/2765 , G06F17/30654 , G06F17/30663 , G06F17/30746 , G10L15/01 , G10L15/08 , G10L15/187 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the methods includes receiving a first voice query; generating a first recognition output; receiving a second voice query; determining from a recognition of the second voice query that the second voice query triggers a correction request; using the first recognition output and the second recognition to determine a plurality of candidate corrections; scoring each candidate correction; and generating a corrected recognition output for a particular candidate correction having a score that satisfies a threshold value.
-
公开(公告)号:US09576578B1
公开(公告)日:2017-02-21
申请号:US14824902
申请日:2015-08-12
Applicant: Google Inc.
Inventor: Gleb Skobeltsyn , Alexandru Ovidiu Dovlecel , Carl-Anton Ingmarsson , Martin Baeuml , Behshad Behzadi , Dmitry Osmakov
CPC classification number: G10L15/26 , G06F17/30684 , G06F17/30746 , G10L15/22 , G10L2015/226
Abstract: Methods, including computer programs encoded on a computer storage medium, for collaborative language model biasing. In one aspect, a method includes: obtaining (i) one or more initial candidate transcriptions, and (ii) one or more terms that are associated with a context; selecting one or more of the terms that are associated with the context, and that (i) do not occur in the candidate transcriptions, and (ii) are indicated as phonetically similar to one or more terms that do occur in the initial candidate transcriptions; generating one or more additional candidate transcriptions based on the (i) initial candidate transcriptions, and (ii) the selected terms; and providing the one or more additional candidate transcriptions to an automated speech recognizer.
Abstract translation: 方法,包括在计算机存储介质上编码的计算机程序,用于协作语言模型偏移。 一方面,一种方法包括:获得(i)一个或多个初始候选转录,以及(ii)与上下文相关联的一个或多个术语; 选择与上下文相关联的一个或多个术语,并且(i)不在候选转录中发生,并且(ii)被表示为与在初始候选转录中确实发生的一个或多个术语的语音相似; 基于(i)初始候选转录,和(ii)所选择的术语生成一个或多个另外的候选转录; 以及将一个或多个附加候选转录提供给自动语音识别器。
-
公开(公告)号:US09514743B2
公开(公告)日:2016-12-06
申请号:US14812811
申请日:2015-07-29
Applicant: Google Inc.
Inventor: Gleb Skobeltsyn , Evgeny A. Cherepanov , Behshad Behzadi
CPC classification number: G10L15/22 , G06F17/271 , G06F17/2765 , G06F17/30654 , G06F17/30663 , G06F17/30746 , G10L15/01 , G10L15/08 , G10L15/187 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the methods includes receiving a first voice query; generating a first recognition output; receiving a second voice query; determining from a recognition of the second voice query that the second voice query triggers a correction request; using the first recognition output and the second recognition to determine a plurality of candidate corrections; scoring each candidate correction; and generating a corrected recognition output for a particular candidate correction having a score that satisfies a threshold value.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于自然语言处理。 其中一种方法包括接收第一语音查询; 产生第一识别输出; 接收第二个语音查询; 从所述第二语音查询的识别确定所述第二语音查询触发校正请求; 使用所述第一识别输出和所述第二识别来确定多个候选校正; 对每个候选人进行校正; 以及生成具有满足阈值的分数的特定候选校正的校正识别输出。
-
公开(公告)号:US09672824B2
公开(公告)日:2017-06-06
申请号:US15359284
申请日:2016-11-22
Applicant: Google Inc.
Inventor: Gleb Skobeltsyn , Evgeny A. Cherepanov , Behshad Behzadi
IPC: G10L15/00 , G10L15/22 , G10L15/01 , G10L15/187
CPC classification number: G10L15/22 , G06F17/271 , G06F17/2765 , G06F17/30654 , G06F17/30663 , G06F17/30746 , G10L15/01 , G10L15/08 , G10L15/187 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the methods includes receiving a first voice query; generating a first recognition output; receiving a second voice query; determining from a recognition of the second voice query that the second voice query triggers a correction request; using the first recognition output and the second recognition to determine a plurality of candidate corrections; scoring each candidate correction; and generating a corrected recognition output for a particular candidate correction having a score that satisfies a threshold value.
-
公开(公告)号:US20160063994A1
公开(公告)日:2016-03-03
申请号:US14812811
申请日:2015-07-29
Applicant: Google Inc.
Inventor: Gleb Skobeltsyn , Evgeny A. Cherepanov , Behshad Behzadi
CPC classification number: G10L15/22 , G06F17/271 , G06F17/2765 , G06F17/30654 , G06F17/30663 , G06F17/30746 , G10L15/01 , G10L15/08 , G10L15/187 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the methods includes receiving a first voice query; generating a first recognition output; receiving a second voice query; determining from a recognition of the second voice query that the second voice query triggers a correction request; using the first recognition output and the second recognition to determine a plurality of candidate corrections; scoring each candidate correction; and generating a corrected recognition output for a particular candidate correction having a score that satisfies a threshold value.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于自然语言处理。 其中一种方法包括接收第一语音查询; 产生第一识别输出; 接收第二个语音查询; 从所述第二语音查询的识别确定所述第二语音查询触发校正请求; 使用所述第一识别输出和所述第二识别来确定多个候选校正; 对每个候选人进行校正; 以及生成具有满足阈值的分数的特定候选校正的校正识别输出。
-
-
-
-
-
-
-
-
-