-
公开(公告)号:US20230360634A1
公开(公告)日:2023-11-09
申请号:US18356738
申请日:2023-07-21
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Nianzu ZHENG , Disong WANG , Liqun DENG , Yang ZHANG
Abstract: The present disclosure relates to text data processing methods and apparatuses. One example method includes obtaining target text, where a phoneme of the target text includes a first phoneme and a second phoneme that are adjacent to each other. Feature extraction is performed on the first phoneme and the second phoneme to obtain a first audio feature of the first phoneme and a second audio feature of the second phoneme. By using a target recurrent neural network (RNN) and based on the first audio feature, first speech data corresponding to the first phoneme is obtained.By using the target RNN and based on the second audio feature, second speech data corresponding to the second phoneme is obtained.By using a vocoder and based on the first speech data and the second speech data, audio corresponding to the first phoneme and audio corresponding to the second phoneme are obtained.