Invention Grant
- Patent Title: Clockwork hierarchal variational encoder
-
Application No.: US17650452Application Date: 2022-02-09
-
Publication No.: US11664011B2Publication Date: 2023-05-30
- Inventor: Robert Andrew James Clark , Chun-an Chan , Vincent Ping Leung Wan
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Honigman LLP
- Agent Brett A. Krueger; Grant Griffith
- Main IPC: G10L15/06
- IPC: G10L15/06 ; G10L15/22 ; G10L15/16 ; G10L25/18 ; G10L25/24 ; G10L15/02 ; G06N3/084 ; G10L25/21 ; G06N3/044 ; G06N3/045

Abstract:
A method of providing a frame-based mel spectral representation of speech includes receiving a text utterance having at least one word and selecting a mel spectral embedding for the text utterance. Each word has at least one syllable and each syllable has at least one phoneme. For each phoneme, the method further includes using the selected mel spectral embedding to: (i) predict a duration of the corresponding phoneme based on corresponding linguistic features associated with the word that includes the corresponding phoneme and corresponding linguistic features associated with the syllable that includes the corresponding phoneme; and (ii) generate a plurality of fixed-length predicted mel-frequency spectrogram frames based on the predicted duration for the corresponding phoneme. Each fixed-length predicted mel-frequency spectrogram frame represents mel-spectral information of the corresponding phoneme.
Public/Granted literature
- US20220172705A1 Clockwork Hierarchal Variational Encoder Public/Granted day:2022-06-02
Information query