-
公开(公告)号:US20150371632A1
公开(公告)日:2015-12-24
申请号:US14639199
申请日:2015-03-05
Applicant: Google Inc.
Inventor: Gleb Skobeltsyn , Behshad Behzadi
IPC: G10L15/187
CPC classification number: G10L15/187 , G10L15/1815 , G10L15/26 , G10L2015/088 , G10L2015/228
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for recognizing names of entities in speech. In one aspect, a method includes actions of receiving an utterance that includes (i) a first term that indicates a particular entity type, and (ii) a second term that indicates an entity name. Additional actions include obtaining a phonetic representation of the second term and determining that the phonetic representation of the second term matches a particular phonetic representation of a particular canonical name of a set of canonical names associated with a particular entity. Further actions include outputting a reference name associated with the particular entity as a transcription of the second term.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于识别语音中实体的名称。 一方面,一种方法包括接收包括(i)指示特定实体类型的第一项和(ii)指示实体名称的第二项的话语的动作。 附加动作包括获得第二项的语音表示,并且确定第二项的语音表示符合与特定实体相关联的一组规范名称的特定规范名称的特定语音表示。 进一步的动作包括输出与特定实体相关联的参考名称作为第二项的转录。
-
公开(公告)号:US09875738B2
公开(公告)日:2018-01-23
申请号:US15614239
申请日:2017-06-05
Applicant: Google Inc.
Inventor: Gleb Skobeltsyn , Evgeny A. Cherepanov , Behshad Behzadi
IPC: G10L15/00 , G10L15/22 , G10L15/187 , G10L15/01
CPC classification number: G10L15/22 , G06F17/271 , G06F17/2765 , G06F17/30654 , G06F17/30663 , G06F17/30746 , G10L15/01 , G10L15/08 , G10L15/187 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the methods includes receiving a first voice query; generating a first recognition output; receiving a second voice query; determining from a recognition of the second voice query that the second voice query triggers a correction request; using the first recognition output and the second recognition to determine a plurality of candidate corrections; scoring each candidate correction; and generating a corrected recognition output for a particular candidate correction having a score that satisfies a threshold value.
-
公开(公告)号:US20170229124A1
公开(公告)日:2017-08-10
申请号:US15016609
申请日:2016-02-05
Applicant: Google Inc.
Inventor: Trevor D. Strohman , Johan Schalkwyk , Gleb Skobeltsyn
IPC: G10L15/32 , G10L15/02 , G10L25/51 , G10L15/26 , G10L15/183
CPC classification number: G10L15/32 , G10L15/02 , G10L15/183 , G10L15/19 , G10L15/22 , G10L15/26 , G10L25/51 , G10L2015/025
Abstract: Methods, including computer programs encoded on a computer storage medium, for improving speech recognition based on external data sources. In one aspect, a method includes obtaining an initial candidate transcription of an utterance using an automated speech recognizer and identifying, based on a language model that is not used by the automated speech recognizer in generating the initial candidate transcription, one or more terms that are phonetically similar to one or more terms that do occur in the initial candidate transcription. Additional actions include generating one or more additional candidate transcriptions based on the identified one or more terms and selecting a transcription from among the candidate transcriptions.
-
公开(公告)号:US09971758B1
公开(公告)日:2018-05-15
申请号:US14989621
申请日:2016-01-06
Applicant: Google Inc.
Inventor: Evgeny A. Cherepanov , Gleb Skobeltsyn , Jakob Nicolaus Foerster , Petar Aleksic , Assaf Avner Hurwitz Michaely
IPC: G10L15/26 , G06F17/27 , G10L15/32 , G10L15/197 , G10L15/187 , G10L15/08
CPC classification number: G06F17/273 , G10L15/187 , G10L15/197 , G10L15/22 , G10L15/26 , G10L15/32 , G10L2015/086
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for natural language processing. One of the methods includes receiving a first voice input from a user device; generating a first recognition output; receiving a user selection of one or more terms in the first recognition output; receiving a second voice input spelling a correction of the user selection; determining a corrected recognition output for the selected portion; and providing a second recognition output that merges the first recognition output and the corrected recognition output.
-
公开(公告)号:US20180033426A1
公开(公告)日:2018-02-01
申请号:US15224104
申请日:2016-07-29
Applicant: Google Inc.
Inventor: Olga Kapralova , Evgeny A. Cherepanov , Dmitry Osmakov , Martin Baeuml , Gleb Skobeltsyn
CPC classification number: G10L15/063 , G10L15/01 , G10L15/06 , G10L15/10 , G10L15/22 , G10L15/32 , G10L2015/0635 , G10L2015/0638
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.
-
-
-
-