-
公开(公告)号:US20250124916A1
公开(公告)日:2025-04-17
申请号:US18905343
申请日:2024-10-03
Applicant: Beijing Zitiao Network Technology Co., Ltd.
Inventor: Yongsen Jiang , Shunping Ye
IPC: G10L15/187 , G10L15/06
Abstract: Embodiments of the present disclosure provide an audio processing method and apparatus, an electronic device, and a storage medium. The method includes: obtaining first audio and first text corresponding to the first audio; predicting a first pronunciation sequence for the first text by a first pronunciation prediction system based on the first audio and the first text, where tones of pronunciations of characters in the first text that are labeled in the first pronunciation sequence include neutral tones and/or third tones after tone sandhi; and the first third tone in two consecutive third tones in the first text is labeled as a third tone after tone sandhi in the first pronunciation sequence; and correcting a neutral tone in the first pronunciation sequence by a second pronunciation prediction system, and/or correcting a third tone after tone sandhi in the first pronunciation sequence by a third pronunciation prediction system.
-
公开(公告)号:US20240428784A1
公开(公告)日:2024-12-26
申请号:US18687753
申请日:2022-08-09
Applicant: Beijing Zitiao Network Technology Co., Ltd.
Inventor: Yongsen Jiang
IPC: G10L15/197 , G10L15/02 , G10L15/08 , G10L15/22
Abstract: Embodiments of the present disclosure provide a method, apparatus, electronic device, and storage medium for text content matching. The method of text content matching includes: in accordance with a collection of to-be-processed speech information, determining a to-be-processed acoustic feature corresponding to the to-be-processed speech information (S110); processing, based on an audio following method, the to-be-processed acoustic feature to obtain a to-be-matched utterance corresponding to the to-be-processed acoustic feature (S120); and determining a target utterance associated with the to-be-matched utterance in target text and differentiating a display of the target utterance in the target text (S130).
-
公开(公告)号:US20240296830A1
公开(公告)日:2024-09-05
申请号:US18573126
申请日:2022-08-24
Applicant: Beijing Zitiao Network Technology Co., Ltd.
Inventor: Fan Yang , Ziqi Sun , Yongsen Jiang , Bingchuan Li , Rongkun Gao
CPC classification number: G10L15/02 , G06V40/161 , G10L15/08 , G10L2015/088
Abstract: The disclosure discloses a data processing method, apparatus, electronic device, and storage medium. The data processing method includes: collecting audio and video frame data associated with a target user, wherein the audio and video frame data includes voice information to be processed and a face image to be processed; processing the face image to be processed based on a target line-of-sight angle adjustment model to obtain a target face image corresponding to the face image to be processed; performing a following process on the voice information to be processed based on an audio content following method and determining a target sentence in a target text associated with the voice information to be processed; and displaying the target sentence and the target face image separately on clients associated with the target user, or displaying the target sentence and the target face image on a client associated with the target user together.
-
-