MULTI-DIALECT AND MULTILINGUAL SPEECH RECOGNITION

Invention Publication

US20240161732A1 MULTI-DIALECT AND MULTILINGUAL SPEECH RECOGNITION 审中-公开

Please log in to see more content

Patent Title: MULTI-DIALECT AND MULTILINGUAL SPEECH RECOGNITION
Application No.: US18418246

Application Date: 2024-01-20
Publication No.: US20240161732A1

Publication Date: 2024-05-16
Inventor: Zhifeng Chen , Bo Li , Eugene Weinstein , Yonghui Wu , Pedro J. Moreno Mengibar , Ron J. Weiss , Khe Chai Sim , Tara N. Sainath , Patrick An Phu Nguyen
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Main IPC: G10L15/00
IPC: G10L15/00 ; G10L15/07 ; G10L15/16

MULTI-DIALECT AND MULTILINGUAL SPEECH RECOGNITION

Abstract:

Methods, systems, and apparatus, including computer programs encoded on a computer-readable media, for speech recognition using multi-dialect and multilingual models. In some implementations, audio data indicating audio characteristics of an utterance is received. Input features determined based on the audio data are provided to a speech recognition model that has been trained to output score indicating the likelihood of linguistic units for each of multiple different language or dialects. The speech recognition model can be one that has been trained using cluster adaptive training. Output that the speech recognition model generated in response to receiving the input features determined based on the audio data is received. A transcription of the utterance generated based on the output of the speech recognition model is provided.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）