-
公开(公告)号:US20250124916A1
公开(公告)日:2025-04-17
申请号:US18905343
申请日:2024-10-03
Applicant: Beijing Zitiao Network Technology Co., Ltd.
Inventor: Yongsen Jiang , Shunping Ye
IPC: G10L15/187 , G10L15/06
Abstract: Embodiments of the present disclosure provide an audio processing method and apparatus, an electronic device, and a storage medium. The method includes: obtaining first audio and first text corresponding to the first audio; predicting a first pronunciation sequence for the first text by a first pronunciation prediction system based on the first audio and the first text, where tones of pronunciations of characters in the first text that are labeled in the first pronunciation sequence include neutral tones and/or third tones after tone sandhi; and the first third tone in two consecutive third tones in the first text is labeled as a third tone after tone sandhi in the first pronunciation sequence; and correcting a neutral tone in the first pronunciation sequence by a second pronunciation prediction system, and/or correcting a third tone after tone sandhi in the first pronunciation sequence by a third pronunciation prediction system.