-
公开(公告)号:US20210280202A1
公开(公告)日:2021-09-09
申请号:US17330126
申请日:2021-05-25
Inventor: Xilei WANG , Wersfu WANG , Tao SUN
IPC: G10L21/013 , G10L13/033 , G10L15/02 , G10L25/30 , G06N3/08
Abstract: The disclosure provides a voice conversion method, a voice conversion apparatus, an electronic device, and a storage medium, related to the field of voice conversion, speech interaction, natural language processing, and deep learning. The method includes: acquiring a source speech of a first user and a reference speech of a second user; extracting first speech content information and a first acoustic feature from the source speech; extracting a second acoustic feature from the reference speech; acquiring a reconstructed third acoustic feature by inputting the first speech content information, the first acoustic feature, and the second acoustic feature into a pre-trained voice conversion model, in which the pre-trained voice conversion model is acquired by training based on speeches of a third user; and synthesizing a target speech based on the third acoustic feature.