END-TO-END TEXT-TO-SPEECH CONVERSION

Invention Application

US20200098350A1 END-TO-END TEXT-TO-SPEECH CONVERSION 审中-公开

Please log in to see more content

Patent Title: END-TO-END TEXT-TO-SPEECH CONVERSION
Application No.: US16696101

Application Date: 2019-11-26
Publication No.: US20200098350A1

Publication Date: 2020-03-26
Inventor: Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao
Applicant: Google LLC
Priority: GR20170100126 20170329
Main IPC: G10L13/08
IPC: G10L13/08 ; G10L15/16 ; G06N3/08 ; G06N3/04 ; G10L13/04 ; G10L25/30 ; G10L25/18

Abstract:

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

Public/Granted literature

US11107457B2 End-to-end text-to-speech conversion Public/Granted day:2021-08-31

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/08	.文本分析或文本以外的语音合成参数的产生，例如语义图翻译为音素、韵律产生、重音或声调测定