Discriminative Training of Document Transcription System

发明申请

US20160078861A1 Discriminative Training of Document Transcription System 有权

请登陆查看更多内容

专利标题： Discriminative Training of Document Transcription System
申请号： US14942349

申请日： 2015-11-16
公开(公告)号： US20160078861A1

公开(公告)日： 2016-03-17
发明人: Lambert Mathias , Girija Yegnanarayanan , Juergen Fritsch
申请人： MModal IP LLC
申请人地址： US TN Franklin
专利权人： MMODAL IP LLC
当前专利权人： MMODAL IP LLC
当前专利权人地址： US TN Franklin
主分类号： G10L15/06
IPC分类号： G10L15/06 ; G06F17/28 ; G10L15/02 ; G06F17/27

Discriminative Training of Document Transcription System

摘要：

A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model using discriminative training techniques, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.

公开/授权文献

US09520124B2 Discriminative training of document transcription system 公开/授权日：2016-12-13

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）