-
公开(公告)号:US20240371370A1
公开(公告)日:2024-11-07
申请号:US18573404
申请日:2023-05-31
Applicant: Beijing Zitiao Network Technology Co., Ltd.
Inventor: Xin ZHENG , Lelai DENG , Keyu CHEN
IPC: G10L15/187 , G10L15/04 , G10L15/18 , G10L25/57 , G11B27/036 , G11B27/34
Abstract: The present disclosure relates to a subtitle generation method, a subtitle generation apparatus, an electronic device, a storage medium and a program. The method includes: extracting audio data from a video to be processed, and performing speech recognition on the audio data to acquire text data corresponding to the audio data; acquiring a plurality of segmentation positions of the text data determined based on syntactic analysis, and acquiring pronunciation object information and timestamp information of audio segments corresponding to characters comprised in the text data; segmenting the text data to acquire a plurality of text segments according to the plurality of segmentation positions, the pronunciation object information and the timestamp information of the audio segments corresponding to the characters, wherein audio segments corresponding to characters in a text segment belong to a same pronunciation object, and a duration of a blank segment in the audio segments corresponding to the text segment is less than a preset duration; merging the plurality of text segments according to semantics of the text segments and the timestamp information of the audio segments corresponding to the characters to acquire a plurality of merged segments which have smooth semantics and meet a preset requirement of single subtitle length; and generating subtitle data corresponding to the video to be processed according to the plurality of merged segments.
-
公开(公告)号:US20240127860A1
公开(公告)日:2024-04-18
申请号:US18395118
申请日:2023-12-22
Applicant: Beijing Zitiao Network Technology Co., Ltd.
Inventor: Weiming ZHENG , Cheng LI , Xuelun FU , Yixiu HUANG , Rui XIA , Xin ZHENG , Lin BAO , Weisi WANG , Chen DING
IPC: G11B27/031 , G06F3/16 , G10L21/02
CPC classification number: G11B27/031 , G06F3/165 , G10L21/02
Abstract: Provided are an audio/video processing method and apparatus, a device, and a storage medium. The method comprises: displaying text data corresponding to an audio/video to be edited, wherein the text data has a mapping relation with an audio/video timestamp of said audio/video; displaying said audio/video according to a time axis track; in response to a preset operation triggered for target text data in the text data, determining an audio/video timestamp corresponding to the target text data as a target audio/video timestamp; and processing, on the basis of the preset operation, an audio/video clip corresponding to the target audio/video timestamp in said audio/video.
-