一种说话人语音自适应训练方法

发明公开

请登陆查看更多内容

专利标题： 一种说话人语音自适应训练方法
专利标题（英）： Speaker speech adaptive training method
申请号： CN201810576452.2

申请日： 2018-06-06
公开(公告)号： CN109036370A

公开(公告)日： 2018-12-18
发明人: 赵峰 , 徐海青 , 吴立刚 , 章爱武 , 潘子春 , 李葵 , 李明 , 张引强 , 黄影 , 陈是同 , 徐唯耀 , 秦浩 , 王文清 , 郑娟 , 王维佳 , 秦婷 , 梁翀 , 浦正国 , 张天奇 , 余江斌 , 韩涛 , 杨维 , 张才俊
申请人： 安徽继远软件有限公司 , 国网信息通信产业集团有限公司 , 国网安徽省电力有限公司信息通信分公司 , 国家电网有限公司
申请人地址： 安徽省合肥市高新区习友路1800号; ; ;
专利权人： 安徽继远软件有限公司,国网信息通信产业集团有限公司,国网安徽省电力有限公司信息通信分公司,国家电网有限公司
当前专利权人： 安徽继远软件有限公司,国网信息通信产业集团有限公司,国网安徽省电力有限公司信息通信分公司,国家电网有限公司
当前专利权人地址： 安徽省合肥市高新区习友路1800号; ; ;
代理机构： 合肥维可专利代理事务所
代理商 吴明华
主分类号： G10L13/02
IPC分类号： G10L13/02 ; G10L25/03 ; G10L25/27

摘要：

本发明公开了一种说话人语音自适应训练方法，属于语音合成技术领域，包括：给定训练情感语音数据和目标说话人情感语音数据；对声学参数进行表征，并对声学参数的状态输出分布和时长分布进行估计、建模；对训练语音数据模型状态输出分布和平均音模型状态输出分布的差异进行归一化处理，得到目标说话人情感语音数据的平均音模型；对平均音模型进行说话人自适应变换，得到说话人相关的自适应模型。本发明示例的说话人语音自适应训练方法，得到的自适应模型用于语音合成，可以减小语音库中说话人的差异所造成的影响，提高合成语音的情感相似度，只用少量的待合成的情感语料，就能够合成出自然度、流利度、情感相似度都很好的情感语音。

摘要（英）：

The invention discloses a speaker speech adaptive training method and belongs to the technical field of speech synthesis. The method comprises giving training emotional speech data and target speakeremotional speech data; representing an acoustic parameter and estimating and modeling the state output distribution and the duration distribution of the acoustic parameter; normalizing a difference between the state output distribution of a training speech data model and the state output distribution of an average sound model to obtain the average sound model of the target speaker emotional speechdata; and subjecting the average sound model to speaker adaptive conversion to obtain a speaker-dependent adaptive model. The speaker speech adaptive training method of the present invention uses theobtained adaptive model for speech synthesis, can reduce the influence caused by the difference of speakers in a speech library, and improves the emotional similarity of the synthesized speech, and can synthesize emotional speech with good naturalness, fluency and emotional similarity with only a small amount of emotional corpus to be synthesized.

公开/授权文献

CN109036370B 一种说话人语音自适应训练方法公开/授权日：2021-07-20

信息查询

中国专利公布公告 Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备