Deliberation Model-Based Two-Pass End-To-End Speech Recognition

    公开(公告)号:US20210225369A1

    公开(公告)日:2021-07-22

    申请号:US17149018

    申请日:2021-01-14

    Applicant: Google LLC

    Abstract: A method of performing speech recognition using a two-pass deliberation architecture includes receiving a first-pass hypothesis and an encoded acoustic frame and encoding the first-pass hypothesis at a hypothesis encoder. The first-pass hypothesis is generated by a recurrent neural network (RNN) decoder model for the encoded acoustic frame. The method also includes generating, using a first attention mechanism attending to the encoded acoustic frame, a first context vector, and generating, using a second attention mechanism attending to the encoded first-pass hypothesis, a second context vector. The method also includes decoding the first context vector and the second context vector at a context vector decoder to form a second-pass hypothesis

Patent Agency Ranking