- 专利标题: Efficient streaming non-recurrent on-device end-to-end model
-
申请号: US17316198申请日: 2021-05-10
-
公开(公告)号: US11715458B2公开(公告)日: 2023-08-01
- 发明人: Tara Sainath , Arun Narayanan , Rami Botros , Yanzhang He , Ehsan Variani , Cyril Allauzen , David Rybach , Ruoming Pang , Trevor Strohman
- 申请人: Google LLC
- 申请人地址: US CA Mountain View
- 专利权人: Google LLC
- 当前专利权人: Google LLC
- 当前专利权人地址: US CA Mountain View
- 代理机构: Honigman LLP
- 代理商 Brett A. Krueger; Grant Griffith
- 主分类号: G10L15/00
- IPC分类号: G10L15/00 ; G10L15/06 ; G10L15/02 ; G10L15/22 ; G10L15/30
摘要:
An ASR model includes a first encoder configured to receive a sequence of acoustic frames and generate a first higher order feature representation for a corresponding acoustic frame in the sequence of acoustic frames. The ASR model also includes a second encoder configured to receive the first higher order feature representation generated by the first encoder at each of the plurality of output steps and generate a second higher order feature representation for a corresponding first higher order feature frame. The ASR model also includes a decoder configured to receive the second higher order feature representation generated by the second encoder at each of the plurality of output steps and generate a first probability distribution over possible speech recognition hypothesis. The ASR model also includes a language model configured to receive the first probability distribution over possible speech hypothesis and generate a rescored probability distribution.
公开/授权文献
信息查询