Speech synthesis method and apparatus and computer readable storage medium using the same

发明授权

US11417316B2 Speech synthesis method and apparatus and computer readable storage medium using the same 有权

请登陆查看更多内容

专利标题： Speech synthesis method and apparatus and computer readable storage medium using the same
申请号： US17115729

申请日： 2020-12-08
公开(公告)号： US11417316B2

公开(公告)日： 2022-08-16
发明人: Dongyan Huang , Leyuan Sheng , Youjun Xiong
申请人： UBTECH ROBOTICS CORP LTD
申请人地址： CN Shenzhen
专利权人： UBTECH ROBOTICS CORP LTD
当前专利权人： UBTECH ROBOTICS CORP LTD
当前专利权人地址： CN Shenzhen
主分类号： G10L13/08
IPC分类号： G10L13/08 ; G10L13/047 ; G10L25/24

Speech synthesis method and apparatus and computer readable storage medium using the same

摘要：

The present disclosure provides a speech synthesis method as well as an apparatus and a computer readable storage medium using the same. The method includes: obtaining a to-be-synthesized text, and extracting to-be-processed Mel spectrum features of the to-be-synthesized text through a preset speech feature extraction algorithm; inputting the to-be-processed Mel spectrum features into a preset ResUnet network model to obtain first intermediate features; performing an average pooling and a first down sampling on the to-be-processed Mel spectrum features to obtain second intermediate features; taking the second intermediate features and the first intermediate features output by the ResUnet network model as an input to perform a deconvolution and a first up sampling so as to obtain target Mel spectrum features corresponding to the to-be-processed Mel spectrum features; and converting the target Mel spectrum features into a target speech corresponding to the to-be-synthesized text.

公开/授权文献

US20210193113A1 SPEECH SYNTHESIS METHOD AND APPARATUS AND COMPUTER READABLE STORAGE MEDIUM USING THE SAME 公开/授权日：2021-06-24

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/08	.文本分析或文本以外的语音合成参数的产生，例如语义图翻译为音素、韵律产生、重音或声调测定