专利检索 ap:("National Chung Cheng University") AND inv:"Tay Jyi LIN" 第 1 页

1.

发明申请
PERSONALIZED VOICE CONVERSION SYSTEM 有权

公开(公告)号：US20230026329A1

公开(公告)日：2023-01-26

申请号：US17475903

申请日：2021-09-15

申请人： National Chung Cheng University

发明人： Tay Jyi LIN , Yu Chia HU , Yi-Hsuan TING , Ching Wei YEH , Jinn-Shyan WANG

IPC分类号： G10L15/30 , G10L15/22

摘要： A personalized voice conversion system includes a cloud server and an intelligent device that communicates with the cloud server. The intelligent device upstreams an original voice signal to the cloud server. The cloud server converts the original voice signal into an intelligible voice signal based on an intelligible voice conversion model. The intelligent device downloads and plays the intelligible voice signal. Based on the original voice signal and the corresponding intelligible voice signal, the cloud server and the intelligent device train an off-line voice conversion model provided to the intelligent device. When the intelligent device stops communicating with the cloud server, the intelligent device converts a new original voice signal into a new intelligible voice signal based on the off-line voice conversion model and plays the new intelligible voice signal.

2.

发明申请
DEVICE AND METHOD FOR GENERATING SYNCHRONOUS CORPUS 有权

公开(公告)号：US20210225384A1

公开(公告)日：2021-07-22

申请号：US16823036

申请日：2020-03-18

申请人： National Chung Cheng University

发明人： Tay Jyi LIN , Ching Wei YEH , Shun Pu YANG , Chen Zong LIAO

IPC分类号： G10L21/02 , G10L13/04 , G10L15/26 , G10L15/187 , G10L25/30 , G10L25/66

摘要： A device and a method for generating synchronous corpus is disclosed. Firstly, script data and a dysarthria voice signal having a dysarthria consonant signal are received and the position of the dysarthria consonant signal is detected, wherein the script data have text corresponding to the dysarthria voice signal. Then, normal phoneme data corresponding to the text are searched and the text is converted into a normal voice signal based on the normal phoneme data corresponding to the text. The dysarthria consonant signal is replaced with the normal consonant signal based on the positions of the normal consonant signal and the dysarthria consonant signal, thereby synchronously converting the dysarthria voice signal into a synthesized voice signal. The synthesized voice signal and the dysarthria voice signal are provided to train a voice conversion model, retain the timbre of the dysarthria voices and improve the communication situations.