Invention Application
- Patent Title: Clockwork Hierarchical Variational Encoder
-
Application No.: US16678981Application Date: 2019-11-08
-
Publication No.: US20200074985A1Publication Date: 2020-03-05
- Inventor: Robert Andrew James Clark , Chun-an Chan , Vincent Ping Leung Wan
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Main IPC: G10L15/06
- IPC: G10L15/06 ; G10L15/22 ; G10L15/16 ; G10L25/18 ; G10L25/24 ; G10L25/21 ; G10L15/02 ; G06N3/04 ; G06N3/08

Abstract:
A method for providing a frame-based mel spectral representation of speech includes receiving a text utterance having at least one word, and selecting a mel spectral embedding for the text utterance. Each word in the text utterance has at least one syllable and each syllable has at least one phoneme. For each phoneme, using the selected mel spectral embedding, the method also includes: predicting a duration of the corresponding phoneme by encoding linguistic features of the corresponding phoneme with a corresponding syllable embedding for the syllable that includes the corresponding phoneme; and generating a plurality of fixed-length predicted mel-frequency spectrogram frames based on the predicted duration for the corresponding phoneme. Each fixed-length predicted mel-frequency spectrogram frame representing mel-spectral information of the corresponding phoneme.
Public/Granted literature
- US11264010B2 Clockwork hierarchical variational encoder Public/Granted day:2022-03-01
Information query