AUDIO PROCESSING METHOD AND APPARATUS, ELECTRONIC DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20250124916A1

    公开(公告)日:2025-04-17

    申请号:US18905343

    申请日:2024-10-03

    Abstract: Embodiments of the present disclosure provide an audio processing method and apparatus, an electronic device, and a storage medium. The method includes: obtaining first audio and first text corresponding to the first audio; predicting a first pronunciation sequence for the first text by a first pronunciation prediction system based on the first audio and the first text, where tones of pronunciations of characters in the first text that are labeled in the first pronunciation sequence include neutral tones and/or third tones after tone sandhi; and the first third tone in two consecutive third tones in the first text is labeled as a third tone after tone sandhi in the first pronunciation sequence; and correcting a neutral tone in the first pronunciation sequence by a second pronunciation prediction system, and/or correcting a third tone after tone sandhi in the first pronunciation sequence by a third pronunciation prediction system.

    METHOD, APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM FOR TEXT CONTENT MATCHING

    公开(公告)号:US20240428784A1

    公开(公告)日:2024-12-26

    申请号:US18687753

    申请日:2022-08-09

    Inventor: Yongsen Jiang

    Abstract: Embodiments of the present disclosure provide a method, apparatus, electronic device, and storage medium for text content matching. The method of text content matching includes: in accordance with a collection of to-be-processed speech information, determining a to-be-processed acoustic feature corresponding to the to-be-processed speech information (S110); processing, based on an audio following method, the to-be-processed acoustic feature to obtain a to-be-matched utterance corresponding to the to-be-processed acoustic feature (S120); and determining a target utterance associated with the to-be-matched utterance in target text and differentiating a display of the target utterance in the target text (S130).

    DATA PROCESSING METHOD, APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20240296830A1

    公开(公告)日:2024-09-05

    申请号:US18573126

    申请日:2022-08-24

    CPC classification number: G10L15/02 G06V40/161 G10L15/08 G10L2015/088

    Abstract: The disclosure discloses a data processing method, apparatus, electronic device, and storage medium. The data processing method includes: collecting audio and video frame data associated with a target user, wherein the audio and video frame data includes voice information to be processed and a face image to be processed; processing the face image to be processed based on a target line-of-sight angle adjustment model to obtain a target face image corresponding to the face image to be processed; performing a following process on the voice information to be processed based on an audio content following method and determining a target sentence in a target text associated with the voice information to be processed; and displaying the target sentence and the target face image separately on clients associated with the target user, or displaying the target sentence and the target face image on a client associated with the target user together.

Patent Agency Ranking