Patent search ap:("Google Inc.") AND inv:"Trevor D. Strohman" Page 1

1.

发明申请
RE-RECOGNIZING SPEECH WITH EXTERNAL DATA SOURCES 审中-公开

公开(公告)号：US20170301352A1

公开(公告)日：2017-10-19

申请号：US15637526

申请日：2017-06-29

Applicant: Google Inc.

Inventor： Trevor D. Strohman , Johan Schalkwyk , Gleb Skobeltsyn

IPC: G10L15/32 , G10L15/22 , G10L15/19 , G10L25/51 , G10L15/02

CPC classification number: G10L15/32 , G10L15/02 , G10L15/183 , G10L15/19 , G10L15/22 , G10L15/26 , G10L25/51 , G10L2015/025

Abstract: Methods, including computer programs encoded on a computer storage medium, for improving speech recognition based on external data sources. In one aspect, a method includes obtaining an initial candidate transcription of an utterance using an automated speech recognizer and identifying, based on a language model that is not used by the automated speech recognizer in generating the initial candidate transcription, one or more terms that are phonetically similar to one or more terms that do occur in the initial candidate transcription. Additional actions include generating one or more additional candidate transcriptions based on the identified one or more terms and selecting a transcription from among the candidate transcriptions.

2.

发明授权
Discovery of problematic pronunciations for automatic speech recognition systems 有权
Title translation: 发现自动语音识别系统的有问题的发音

公开(公告)号：US08959020B1

公开(公告)日：2015-02-17

申请号：US13853150

申请日：2013-03-29

Applicant: Google Inc.

Inventor： Brian Strope , Francoise Beaufays , Trevor D. Strohman

IPC: G10L15/18 , G06F17/24

CPC classification number: G10L15/187

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for discovery of problematic pronunciations for automatic speech recognition systems. One of the methods includes determining a frequency of occurrences of one or more n-grams in transcribed text and a frequency of occurrences of the n-grams in typed text and classifying a system pronunciation of a word included in the n-grams as correct or incorrect based on the frequencies. The n-grams may comprise one or more words and at least one of the words is classified as incorrect based on the frequencies. The frequencies of the specific n-grams may be determined across a domain using one or more n-grams that typically appear adjacent to the specific n-grams.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于发现用于自动语音识别系统的有问题的发音。其中一种方法包括确定转录文本中一个或多个n克的出现频率和类型文本中出现的n-gram的频率，并将包含在n-gram中的单词的系统发音分类为正确或基于频率不正确。 n克可以包括一个或多个单词，并且基于频率将这些单词中的至少一个分类为不正确的。可以使用通常出现在特定n-gram附近的一个或多个n克来跨域确定特定n克的频率。

3.

发明授权
Data driven word pronunciation learning and scoring with crowd sourcing based on the word's phonemes pronunciation scores 有权

公开(公告)号：US09741339B2

公开(公告)日：2017-08-22

申请号：US13930495

申请日：2013-06-28

Applicant: Google Inc.

Inventor： Fuchun Peng , Francoise Beaufays , Brian Strope , Xin Lei , Pedro J. Moreno Mengibar , Trevor D. Strohman

IPC: G10L15/00 , G09B5/00 , G10L15/14 , G10L15/18 , G10L13/08 , G10L15/06 , G09B17/00

CPC classification number: G10L15/18 , G09B17/006 , G10L13/08 , G10L15/06

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining pronunciations for particular terms. The methods, systems, and apparatus include actions of obtaining audio samples of speech corresponding to a particular term and obtaining candidate pronunciations for the particular term. Further actions include generating, for each candidate pronunciation for the particular term and audio sample of speech corresponding to the particular term, a score reflecting a level of similarity between of the candidate pronunciation and the audio sample, wherein the said score for the particular term is obtained by using a minimum of individual scores of phonemes comprising the term. Additional actions include aggregating the scores for each candidate pronunciation and adding one or more candidate pronunciations for the particular term to a pronunciation lexicon based on the aggregated scores for the candidate pronunciations.

4.

发明申请
RE-RECOGNIZING SPEECH WITH EXTERNAL DATA SOURCES 审中-公开

公开(公告)号：US20170229124A1

公开(公告)日：2017-08-10

申请号：US15016609

申请日：2016-02-05

Applicant: Google Inc.

Inventor： Trevor D. Strohman , Johan Schalkwyk , Gleb Skobeltsyn

IPC: G10L15/32 , G10L15/02 , G10L25/51 , G10L15/26 , G10L15/183

CPC classification number: G10L15/32 , G10L15/02 , G10L15/183 , G10L15/19 , G10L15/22 , G10L15/26 , G10L25/51 , G10L2015/025

Abstract: Methods, including computer programs encoded on a computer storage medium, for improving speech recognition based on external data sources. In one aspect, a method includes obtaining an initial candidate transcription of an utterance using an automated speech recognizer and identifying, based on a language model that is not used by the automated speech recognizer in generating the initial candidate transcription, one or more terms that are phonetically similar to one or more terms that do occur in the initial candidate transcription. Additional actions include generating one or more additional candidate transcriptions based on the identified one or more terms and selecting a transcription from among the candidate transcriptions.

5.

发明申请
DATA DRIVEN PRONUNCIATION LEARNING WITH CROWD SOURCING 有权
Title translation: 数据驱动公开学习与CROWD采购

公开(公告)号：US20150006178A1

公开(公告)日：2015-01-01

申请号：US13930495

申请日：2013-06-28

Applicant: Google Inc.

Inventor： Fuchun Peng , Francoise Beaufays , Brian Strope , Xin Lei , Pedro J. Moreno Mengibar , Trevor D. Strohman

IPC: G10L15/18

CPC classification number: G10L15/18 , G09B17/006 , G10L13/08 , G10L15/06

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining pronunciations for particular terms. The methods, systems, and apparatus include actions of obtaining audio samples of speech corresponding to a particular term and obtaining candidate pronunciations for the particular term. Further actions include generating, for each candidate pronunciation for the particular term and audio sample of speech corresponding to the particular term, a score reflecting a level of similarity between of the candidate pronunciation and the audio sample. Additional actions include aggregating the scores for each candidate pronunciation and adding one or more candidate pronunciations for the particular term to a pronunciation lexicon based on the aggregated scores for the candidate pronunciations.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于确定特定术语的发音。方法，系统和装置包括获得与特定术语相对应的语音样本的动作，并获得特定术语的候选发音。进一步的动作包括针对特定术语的每个候选发音和对应于特定术语的语音样本生成反映候选发音和音频样本之间的相似程度的分数。附加动作包括聚合每个候选发音的分数，并且基于候选发音的聚合分数，将特定术语的一个或多个候选发音添加到发音词典。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification