- 专利标题: Speech synthesis method and apparatus and computer readable storage medium using the same
-
申请号: US17115729申请日: 2020-12-08
-
公开(公告)号: US11417316B2公开(公告)日: 2022-08-16
- 发明人: Dongyan Huang , Leyuan Sheng , Youjun Xiong
- 申请人: UBTECH ROBOTICS CORP LTD
- 申请人地址: CN Shenzhen
- 专利权人: UBTECH ROBOTICS CORP LTD
- 当前专利权人: UBTECH ROBOTICS CORP LTD
- 当前专利权人地址: CN Shenzhen
- 主分类号: G10L13/08
- IPC分类号: G10L13/08 ; G10L13/047 ; G10L25/24
摘要:
The present disclosure provides a speech synthesis method as well as an apparatus and a computer readable storage medium using the same. The method includes: obtaining a to-be-synthesized text, and extracting to-be-processed Mel spectrum features of the to-be-synthesized text through a preset speech feature extraction algorithm; inputting the to-be-processed Mel spectrum features into a preset ResUnet network model to obtain first intermediate features; performing an average pooling and a first down sampling on the to-be-processed Mel spectrum features to obtain second intermediate features; taking the second intermediate features and the first intermediate features output by the ResUnet network model as an input to perform a deconvolution and a first up sampling so as to obtain target Mel spectrum features corresponding to the to-be-processed Mel spectrum features; and converting the target Mel spectrum features into a target speech corresponding to the to-be-synthesized text.
公开/授权文献
信息查询