Systems and methods of speech generation for target user given limited data

    公开(公告)号:US10418024B1

    公开(公告)日:2019-09-17

    申请号:US16035915

    申请日:2018-07-16

    Abstract: Systems and methods are provided for training an audio generation model for a first person using a first voice audio data and a first text transcript of the first voice audio data. Using a second voice audio data and a second text transcript of the second voice audio data, a plurality of pitch voice audio data for the second person may be generated with different pitches. The audio generation model may be trained for the second person using the generated plurality of pitch voice audio data with the different pitches for the second person. Output voice audio may be generated for the second person using received text and the model trained with the generated plurality of pitch voice audio data.

Patent Agency Ranking