Invention Grant
- Patent Title: Enhanced multi-channel acoustic models
-
Application No.: US17303822Application Date: 2021-06-08
-
Publication No.: US11783849B2Publication Date: 2023-10-10
- Inventor: Ehsan Variani , Kevin William Wilson , Ron J. Weiss , Tara N. Sainath , Arun Narayanan
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Honigman LLP
- Agent Brett A. Krueger; Grant J. Griffith
- Main IPC: G10L15/16
- IPC: G10L15/16 ; G10L25/30 ; G10L21/028 ; G10L21/0388 ; G10L19/008 ; G10L15/20 ; G10L21/0208 ; G10L21/0216

Abstract:
This specification describes computer-implemented methods and systems. One method includes receiving, by a neural network of a speech recognition system, first data representing a first raw audio signal and second data representing a second raw audio signal. The first raw audio signal and the second raw audio signal describe audio occurring at a same period of time. The method further includes generating, by a spatial filtering layer of the neural network, a spatial filtered output using the first data and the second data, and generating, by a spectral filtering layer of the neural network, a spectral filtered output using the spatial filtered output. Generating the spectral filtered output comprises processing frequency-domain data representing the spatial filtered output. The method still further includes processing, by one or more additional layers of the neural network, the spectral filtered output to predict sub-word units encoded in both the first raw audio signal and the second raw audio signal.
Public/Granted literature
- US20210295859A1 ENHANCED MULTI-CHANNEL ACOUSTIC MODELS Public/Granted day:2021-09-23
Information query