Generating representations of acoustic sequences

发明授权

US10535338B2 Generating representations of acoustic sequences 有权

请登陆查看更多内容

专利标题： Generating representations of acoustic sequences
申请号： US16179801

申请日： 2018-11-02
公开(公告)号： US10535338B2

公开(公告)日： 2020-01-14
发明人: Hasim Sak , Andrew W. Senior
申请人： Google LLC
申请人地址： US CA Mountain View
专利权人： Google LLC
当前专利权人： Google LLC
当前专利权人地址： US CA Mountain View
代理机构： Honigman LLP
代理商 Brett A. Krueger
主分类号： G10L15/00
IPC分类号： G10L15/00 ; G10L15/16 ; G10L15/02 ; G10L15/14

Generating representations of acoustic sequences

摘要：

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating representation of acoustic sequences. One of the methods includes: receiving an acoustic sequence, the acoustic sequence comprising a respective acoustic feature representation at each of a plurality of time steps; processing the acoustic feature representation at an initial time step using an acoustic modeling neural network; for each subsequent time step of the plurality of time steps: receiving an output generated by the acoustic modeling neural network for a preceding time step, generating a modified input from the output generated by the acoustic modeling neural network for the preceding time step and the acoustic representation for the time step, and processing the modified input using the acoustic modeling neural network to generate an output for the time step; and generating a phoneme representation for the utterance from the outputs for each of the time steps.

公开/授权文献

US20190139536A1 GENERATING REPRESENTATIONS OF ACOUSTIC SEQUENCES 公开/授权日：2019-05-09

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）