发明授权
- 专利标题: Input-feeding architecture for attention based end-to-end speech recognition
-
申请号: US16160352申请日: 2018-10-15
-
公开(公告)号: US10672382B2公开(公告)日: 2020-06-02
- 发明人: Chao Weng , Jia Cui , Guangsen Wang , Jun Wang , Chengzhu Yu , Dan Su , Dong Yu
- 申请人: TENCENT AMERICA LLC
- 申请人地址: US CA Palo Alto
- 专利权人: TENCENT AMERICA LLC
- 当前专利权人: TENCENT AMERICA LLC
- 当前专利权人地址: US CA Palo Alto
- 代理机构: Sughrue Mion, PLLC
- 主分类号: G10L15/06
- IPC分类号: G10L15/06 ; G10L15/14 ; G10L15/183 ; G10L15/22
摘要:
Methods and apparatuses are provided for performing end-to-end speech recognition training performed by at least one processor. The method includes receiving, by the at least one processor, one or more input speech frames, generating, by the at least one processor, a sequence of encoder hidden states by transforming the input speech frames, computing, by the at least one processor, attention weights based on each of the sequence of encoder hidden states and a current decoder hidden state, performing, by the at least one processor, a decoding operation based on a previous embedded label prediction information and a previous attentional hidden state information generated based on the attention weights; and generating a current embedded label prediction information based on a result of the decoding operation and the attention weights.
公开/授权文献
信息查询