SUBTITLE GENERATION METHOD, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM

    公开(公告)号:US20240371370A1

    公开(公告)日:2024-11-07

    申请号:US18573404

    申请日:2023-05-31

    Abstract: The present disclosure relates to a subtitle generation method, a subtitle generation apparatus, an electronic device, a storage medium and a program. The method includes: extracting audio data from a video to be processed, and performing speech recognition on the audio data to acquire text data corresponding to the audio data; acquiring a plurality of segmentation positions of the text data determined based on syntactic analysis, and acquiring pronunciation object information and timestamp information of audio segments corresponding to characters comprised in the text data; segmenting the text data to acquire a plurality of text segments according to the plurality of segmentation positions, the pronunciation object information and the timestamp information of the audio segments corresponding to the characters, wherein audio segments corresponding to characters in a text segment belong to a same pronunciation object, and a duration of a blank segment in the audio segments corresponding to the text segment is less than a preset duration; merging the plurality of text segments according to semantics of the text segments and the timestamp information of the audio segments corresponding to the characters to acquire a plurality of merged segments which have smooth semantics and meet a preset requirement of single subtitle length; and generating subtitle data corresponding to the video to be processed according to the plurality of merged segments.

Patent Agency Ranking