发明授权
- 专利标题: Training an automatic speech recognition system using compressed word frequencies
-
申请号: US13666223申请日: 2012-11-01
-
公开(公告)号: US08543398B1公开(公告)日: 2013-09-24
- 发明人: Brian Strope , Mitchel Weintraub
- 申请人: Google Inc.
- 申请人地址: US CA Mountain View
- 专利权人: Google Inc.
- 当前专利权人: Google Inc.
- 当前专利权人地址: US CA Mountain View
- 代理机构: McDonnell Boehnen Hulbert & Berghoff LLP
- 主分类号: G10L15/06
- IPC分类号: G10L15/06
摘要:
Respective word frequencies may be determined from a corpus of utterance-to-text-string mappings that contain associations between audio utterances and a respective text string transcription of each audio utterance. Respective compressed word frequencies may be obtained based on the respective word frequencies such that the distribution of the respective compressed word frequencies has a lower variance than the distribution of the respective word frequencies. Sample utterance-to-text-string mappings may be selected from the corpus of utterance-to-text-string mappings based on the compressed word frequencies. An automatic speech recognition (ASR) system may be trained with the sample utterance-to-text-string mappings.
信息查询