Training an automatic speech recognition system using compressed word frequencies

Invention Grant

US08543398B1 Training an automatic speech recognition system using compressed word frequencies 有权

Please log in to see more content

Patent Title: Training an automatic speech recognition system using compressed word frequencies
Application No.: US13666223

Application Date: 2012-11-01
Publication No.: US08543398B1

Publication Date: 2013-09-24
Inventor: Brian Strope , Mitchel Weintraub
Applicant: Google Inc.
Applicant Address: US CA Mountain View
Assignee: Google Inc.
Current Assignee: Google Inc.
Current Assignee Address: US CA Mountain View
Agency: McDonnell Boehnen Hulbert & Berghoff LLP
Main IPC: G10L15/06
IPC: G10L15/06

Training an automatic speech recognition system using compressed word frequencies

Abstract:

Respective word frequencies may be determined from a corpus of utterance-to-text-string mappings that contain associations between audio utterances and a respective text string transcription of each audio utterance. Respective compressed word frequencies may be obtained based on the respective word frequencies such that the distribution of the respective compressed word frequencies has a lower variance than the distribution of the respective word frequencies. Sample utterance-to-text-string mappings may be selected from the corpus of utterance-to-text-string mappings based on the compressed word frequencies. An automatic speech recognition (ASR) system may be trained with the sample utterance-to-text-string mappings.

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）