情感语音合成方法和装置

发明授权

CN102005205B 情感语音合成方法和装置失效 - 权利终止

请登陆查看更多内容

专利标题： 情感语音合成方法和装置
专利标题（英）： Emotional speech synthesizing method and device
申请号： CN200910170713.1

申请日： 2009-09-03
公开(公告)号： CN102005205B

公开(公告)日： 2012-10-03
发明人: 栾剑 , 李健
申请人： 株式会社东芝
申请人地址： 日本东京都
专利权人： 株式会社东芝
当前专利权人： 株式会社东芝
当前专利权人地址： 日本东京都
代理机构： 北京市中咨律师事务所
代理商 于静; 刘瑞东
主分类号： G10L13/02
IPC分类号： G10L13/02 ; G10L13/04

摘要：

本发明提供了情感语音合成方法和装置。根据本发明的一个方面，提供了一种情感语音合成方法，包括以下步骤：输入文本句；利用由第一说话人的中立语音库训练获得的中立特征模型，预测上述文本句在上述第一说话人的第一特征空间中的中立特征向量；利用由上述中立语音库和第二说话人的平行语音库训练获得的说话人规整模型，将上述中立特征向量变换为上述第二说话人的第二特征空间中的规整中立特征向量；利用由上述平行语音库训练获得的情感转换模型，将上述规整中立特征向量转换为上述第二特征空间中的规整情感特征向量；利用上述说话人规整模型，将上述规整情感特征向量逆变换为上述第一特征空间中的情感特征向量；以及利用上述第一特征空间中的情感特征向量合成出第一说话人的情感语音。

摘要（英）：

The invention provides an emotional speech synthesizing method and device. According to an aspect of the invention, the emotional speech synthesizing method comprises the following steps: inputting a text sentence; forecasting a neutral characteristic vector of the text sentence in first characteristic space of a first speaker by utilizing a neutral characteristic model acquired by neutral voice database training of the first speaker; converting the neutral characteristic vector into a regularly neutral characteristic vector in second characteristic space of a second speaker by utilizing a speaker regular model acquired by the neutral voice database and parallel voice database training of the second speaker; converting the regularly neutral characteristic vector into a regular emotional characteristic vector in the first characteristic space by utilizing the speaker regular model; and synthesizing the emotional voice of the first speaker by utilizing the emotional characteristic vector in the first characteristic space.

公开/授权文献

CN102005205A 情感语音合成方法和装置公开/授权日：2011-04-06

信息查询

中国专利公布公告 Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备