Real-time accent conversion model

发明授权

US11948550B2 Real-time accent conversion model 有权

请登陆查看更多内容

专利标题： Real-time accent conversion model
申请号： US17460145

申请日： 2021-08-27
公开(公告)号： US11948550B2

公开(公告)日： 2024-04-02
发明人: Maxim Serebryakov , Shawn Zhang
申请人： Sanas.ai Inc.
申请人地址： US CA Pleasanton
专利权人： SANAS.AI INC.
当前专利权人： SANAS.AI INC.
当前专利权人地址： US CA Pleasanton
代理机构： Troutman Pepper Hamilton Sanders LLP
主分类号： G10L13/02
IPC分类号： G10L13/02 ; G06N20/20 ; G10L15/02 ; G10L25/27

摘要：

Techniques for real-time accent conversion are described herein. An example computing device receives an indication of a first accent and a second accent. The computing device further receives, via at least one microphone, speech content having the first accent. The computing device is configured to derive, using a first machine-learning algorithm trained with audio data including the first accent, a linguistic representation of the received speech content having the first accent. The computing device is configured to, based on the derived linguistic representation of the received speech content having the first accent, synthesize, using a second machine learning-algorithm trained with (i) audio data comprising the first accent and (ii) audio data including the second accent, audio data representative of the received speech content having the second accent. The computing device is configured to convert the synthesized audio data into a synthesized version of the received speech content having the second accent.

公开/授权文献

US20220358903A1 Real-Time Accent Conversion Model 公开/授权日：2022-11-10

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备