-
公开(公告)号:US20230230400A1
公开(公告)日:2023-07-20
申请号:US18001031
申请日:2021-09-09
Inventor: Jinlai LIU , Bin WEN , Changhu WANG
CPC classification number: G06V20/70 , G06F40/30 , G06V10/806 , G06V10/7715 , G06V20/62
Abstract: Provided are a label identification method and apparatus, a device, and a medium. The method includes: obtaining a target feature of a first image, in which the target feature characterizes a visual feature of the first image and a word feature of at least one label; and identifying a label of the first image from the at least one label based on the target feature. By characterizing the visual feature of the first image and the target feature of the word feature of the at least one label, the label of the first image is identified from the at least one label. Thus, identification accuracy of the label can be improved.
-
公开(公告)号:US20240233334A1
公开(公告)日:2024-07-11
申请号:US18563222
申请日:2022-04-26
Inventor: Jin XIA , Keyu WEN , Yuanyuan HUANG , Jie SHAO , Changhu WANG
CPC classification number: G06V10/7715 , G06T5/60 , G06V10/82
Abstract: The present disclosure relates to a multi-modal data retrieval method and apparatus, a medium, and an electronic device. The method includes: inputting target retrieval data into a first feature extraction network corresponding to a modality of the target retrieval data to acquire a data feature of the target retrieval data; inputting the data feature into a second feature extraction network corresponding to the modality of the target retrieval data to acquire a target retrieval feature corresponding to the target retrieval data, wherein second feature extraction networks respectively corresponding to modalities share a weight; and performing retrieval based on the target retrieval feature.
-
公开(公告)号:US20240112299A1
公开(公告)日:2024-04-04
申请号:US18255473
申请日:2021-12-01
Applicant: BEIJING YOUZHUJU NETWORK TECHNOLOGY CO. LTD.
Inventor: Hao WU , Changhu WANG
CPC classification number: G06T3/0012 , G06T3/4007
Abstract: This disclosure relates to a video cropping method and apparatus, storage medium, and electronic device. The present disclosure method: acquiring an original video to be cropped; performing frame extraction processing on the original video to obtain a plurality of target video frames; determining, for each of the target video frames, a target candidate cropping box corresponding to the target video frame according to a main content in the target video frame; performing interpolation processing according to the target candidate cropping box corresponding to each of the target video frames to determine a target cropping box corresponding to each frame picture in the original video; and cropping the original video according to the target cropping box corresponding to the each frame picture.
-
公开(公告)号:US20230394625A1
公开(公告)日:2023-12-07
申请号:US18253357
申请日:2021-11-16
Applicant: BEIJING YOUZHUJU NETWORK TECHNOLOGY CO. LTD.
Inventor: Hao WU , Yuntao MA , Changhu WANG
CPC classification number: G06T3/40 , G06T7/70 , G06V20/49 , G06V20/41 , G06V20/46 , G11B27/34 , G06T2207/10016 , G06T2207/30201
Abstract: The present disclosure relates to a video processing method and apparatus, a readable medium, and an electronic device, which relate to the technical field of image processing; the method includes: preprocessing a target video to obtain a plurality of target image frames of the target video; identifying a position of a designated object in each of the target image frames; and determining a reserved image frame from the target image frames based on the position of the designated object in each of the target image frames, the reserved image frame being used to indicate a cropping on image frames before the reserved image frame in the target video.
-
-
-