发明授权
- 专利标题: Amalgamating multimedia transcripts for closed captioning from a plurality of text to speech conversions
- 专利标题(中): 从多个文本到语音转换合并用于隐藏字幕的多媒体抄本
-
申请号: US12890744申请日: 2010-09-27
-
公开(公告)号: US09332319B2公开(公告)日: 2016-05-03
- 发明人: Johney Tsai , Matthew Miller , David Strong
- 申请人: Johney Tsai , Matthew Miller , David Strong
- 申请人地址: US PA Blue Bell
- 专利权人: Unisys Corporation
- 当前专利权人: Unisys Corporation
- 当前专利权人地址: US PA Blue Bell
- 代理商 Richard J. Gregson
- 主分类号: G10L15/32
- IPC分类号: G10L15/32 ; G10L15/183 ; G10L15/26 ; H04N21/488
摘要:
Methods and systems for converting speech to text are disclosed. One method includes analyzing multimedia content to determine the presence of closed captioning data. The method includes, upon detecting closed captioning data, indexing the closed captioning data as associated with the multimedia content. The method also includes, upon failure to detect closed captioning data in the multimedia content, extracting audio data from multimedia content, the audio data including speech data, performing a plurality of speech to text conversions on the speech data to create a plurality of transcripts of the speech data, selecting text from one or more of the plurality of transcripts to form an amalgamated transcript, and indexing the amalgamated transcript as associated with the multimedia content.
公开/授权文献
信息查询