Phonemes And Graphemes for Neural Text-to-Speech

Invention Application

US20220310059A1 Phonemes And Graphemes for Neural Text-to-Speech 有权

Please log in to see more content

Patent Title: Phonemes And Graphemes for Neural Text-to-Speech
Application No.: US17643684

Application Date: 2021-12-10
Publication No.: US20220310059A1

Publication Date: 2022-09-29
Inventor: Ye Jia , Byungha Chun , Yu Zhang , Jonathan Shen , Yonghui Wu
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Main IPC: G10L13/08
IPC: G10L13/08 ; G06F40/279 ; G06F40/263 ; G06N3/08

Phonemes And Graphemes for Neural Text-to-Speech

Abstract:

A method includes receiving a text input including a sequence of words represented as an input encoder embedding. The input encoder embedding includes a plurality of tokens, with the plurality of tokens including a first set of grapheme tokens representing the text input as respective graphemes and a second set of phoneme tokens representing the text input as respective phonemes. The method also includes, for each respective phoneme token of the second set of phoneme tokens: identifying a respective word of the sequence of words corresponding to the respective phoneme token and determining a respective grapheme token representing the respective word of the sequence of words corresponding to the respective phoneme token. The method also includes generating an output encoder embedding based on a relationship between each respective phoneme token and the corresponding grapheme token determined to represent a same respective word as the respective phoneme token.

Public/Granted literature

US12020685B2 Phonemes and graphemes for neural text-to-speech Public/Granted day:2024-06-25

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/08	.文本分析或文本以外的语音合成参数的产生，例如语义图翻译为音素、韵律产生、重音或声调测定