-
公开(公告)号:US11363271B2
公开(公告)日:2022-06-14
申请号:US17125370
申请日:2020-12-17
Inventor: Chao Li , Yukang Ding , Dongliang He , Fu Li , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
IPC: H04N19/132 , H04N19/172 , G06K9/62 , G06N3/04
Abstract: A method for video frame interpolation, a related electronic device and a storage medium is disclosed. A video is obtained. An (i−1)th frame and an ith frame of the video are obtained. Visual semantic feature maps and depth maps of the (i−1)th frame and the ith frame are obtained. Frame interpolation information is obtained based on the visual semantic feature maps and the depth maps. An interpolated frame between the (i−1)th frame and the ith frame is generated based on the frame interpolation information and the (i−1)th frame and is inserted between the (i−1)th frame and the ith frame.
-
公开(公告)号:US11983849B2
公开(公告)日:2024-05-14
申请号:US17203437
申请日:2021-03-16
Inventor: Chao Li , Dongliang He , Fu Li , Hao Sun
IPC: G06T9/00 , G06F18/214 , G06T3/4046 , G06T5/00 , G06T11/40 , G06T11/60 , G06V10/44 , G06V10/774 , G06V10/82
CPC classification number: G06T5/005 , G06F18/214 , G06T3/4046 , G06T9/00 , G06T11/40 , G06T11/60 , G06V10/454 , G06V10/774 , G06V10/82
Abstract: An image filling method and apparatus, a device and a storage medium are disclosed. The image filling method includes: performing multilevel encoding processing on features of an image to be filled to generate multilevel encoded feature layers, sizes of the multilevel encoded feature layers being reduced layer by layer; performing layer-by-layer decoding processing on the multilevel encoded feature layers to obtain multilevel decoded feature layers and a first image, there being no missing region in the first image, wherein the layer-by-layer decoding processing includes a concatenation operation on a decoded feature layer and an encoded feature layer with a same size; and performing up-sampling processing on the first image to obtain multilevel up-sampled feature layers and a second image optimized by the up-sampling processing, the up-sampling processing including a concatenation operation on an up-sampled feature layer and a decoded feature layer with a same size.
-
公开(公告)号:US20210319062A1
公开(公告)日:2021-10-14
申请号:US17182960
申请日:2021-02-23
Inventor: Xiang Long , Ping Wang , Fu Li , Dongliang He , Hao Sun , Shilei Wen
IPC: G06F16/783 , G06K9/00 , G06K9/62
Abstract: Embodiments of the present disclosure disclose a method and apparatus for searching a video segment, a device and a medium, and relate to the field of video data search. The method includes: sampling video frames from a target video and videos to be searched in a video library, and extracting features from the sampled frames; matching the target video and the videos to be searched according to the extracted features to determine a candidate video to be searched that matches the target video; determining at least one candidate video segment from the determined candidate video, and calculating a degree of matching between the target video and each candidate video segment based on the extracted features of each sampled frame; and determining a video segment matching the target video in the videos to be searched according to the calculated degree of matching between the target video and each candidate video segment.
-
公开(公告)号:US11748895B2
公开(公告)日:2023-09-05
申请号:US17184379
申请日:2021-02-24
Inventor: Tianwei Lin , Xin Li , Fu Li , Dongliang He , Hao Sun , Henan Zhang
CPC classification number: G06T7/246 , G06F18/253 , G06N3/04 , G06V20/41 , G06V20/46
Abstract: A method and apparatus for processing a video frame are provided. The method may include: converting, using an optical flow generated based on a previous frame and a next frame of adjacent frames in a video, a feature map of the previous frame to obtain a converted feature map; determining, based on an error of the optical flow, a weight of the converted feature map, and obtaining a fused feature map based on a weighted result of a feature of the converted feature map and a feature of a feature map of the next frame; and updating the feature map of the next frame as the fused feature map.
-
公开(公告)号:US11734809B2
公开(公告)日:2023-08-22
申请号:US17174002
申请日:2021-02-11
Inventor: Xiang Long , Ping Wang , Zhichao Zhou , Fu Li , Dongliang He , Hao Sun
CPC classification number: G06T7/0002 , G06N3/04 , G06N3/08 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084 , G06T2207/30168
Abstract: Embodiments of the present disclosure provide a method and apparatus for processing an image, and relates to the field of computer vision technology. The method may include: acquiring a value to be processed, where the value to be processed is associated with an image to be processed; and processing the value to be processed by using a quality scoring model to generate a score of the image to be processed in a target scoring domain, where the score of the image to be processed in the target scoring domain is related to an image quality of the image to be processed.
-
6.
公开(公告)号:US11615140B2
公开(公告)日:2023-03-28
申请号:US17144523
申请日:2021-01-08
Inventor: Xiang Long , Dongliang He , Fu Li , Xiang Zhao , Tianwei Lin , Hao Sun , Shilei Wen , Errui Ding
IPC: G06F16/738 , G06V20/40 , G06F18/214 , G06F18/25
Abstract: A method includes screening, by a video-clip screening module in a video description model, a plurality of video proposal clips acquired from a video to be analyzed, to acquire a plurality of video clips suitable for description. The plural video proposal clips acquired from the video to be analyzed may be screened by the video-clip screening module to acquire the plural video clips suitable for description; and then, each video clip is described by a video-clip describing module, thus avoiding description of all the video proposal clips, only describing the screened video clips which have strong correlation with the video and are suitable for description, removing the interference of the description of the video clips which are not suitable for description in the description of the video, guaranteeing the accuracy of the final descriptions of the video clips, and improving the quality of the descriptions of the video clips.
-
公开(公告)号:US20200320303A1
公开(公告)日:2020-10-08
申请号:US16905482
申请日:2020-06-18
Inventor: Dongliang He , Xiang Zhao , Jizhou Huang , Fu Li , Xiao Liu , Shilei Wen
IPC: G06K9/00 , G06F16/783 , G06N3/08
Abstract: A method and an apparatus for grounding a target video clip in a video are provided. The method includes: determining a current video clip in the video based on a current position; acquiring descriptive information indicative of a pre-generated target video clip descriptive feature, and executing a target video clip determining step which includes: determining current state information of the current video clip, wherein the current state information includes information indicative of a feature of the current video clip; generating a current action policy based on the descriptive information and the current state information, the current action policy being indicative of a position change of the current video clip in the video; the method further comprises: in response to reaching a preset condition, using a video clip resulting from executing the current action policy on the current video clip as the target video clip.
-
8.
公开(公告)号:US11810310B2
公开(公告)日:2023-11-07
申请号:US17335647
申请日:2021-06-01
Inventor: Dongliang He , Henan Zhang , Hao Sun
CPC classification number: G06T7/55 , G06N3/045 , G06N3/08 , G06T2207/10032 , G06T2207/20084 , G06T2207/30181
Abstract: The satellite image processing method includes: acquiring a first target satellite image; defogging the first target satellite image through a first neural network to acquire a first satellite image; and adjusting an image quality parameter of the first satellite image through a second neural network to acquire a second satellite image.
-
公开(公告)号:US11514676B2
公开(公告)日:2022-11-29
申请号:US17116578
申请日:2020-12-09
Inventor: Zhichao Zhou , Dongliang He , Fu Li , Hao Sun
Abstract: The present disclosure provides a method and apparatus for detecting a region of interest in a video, a device and a storage medium. The method may include: acquiring a current to-be-processed frame from a picture frame sequence of a video; detecting a region of interest (ROI) in the current to-be-processed frame, in response to determining that the current to-be-processed frame is a detection picture frame, to determine at least one ROI in the current to-be-processed frame; and updating a to-be-tracked ROI, based on the ROI in the current to-be-processed frame and a tracking result determined by a pre-order tracking picture frame; and tracking the current to-be-processed frame based on the existing to-be-tracked ROI, in response to determining that the current to-be-processed frame is a tracking picture frame, to determine at least one tracking result as the ROI of the current to-be-processed frame.
-
公开(公告)号:US11490168B2
公开(公告)日:2022-11-01
申请号:US17026488
申请日:2020-09-21
Inventor: Fu Li , Dongliang He , Hao Sun
IPC: H04N21/234 , H04N21/439 , H04N21/44 , H04N21/466 , H04N21/81 , H04N21/658 , H04N21/8549
Abstract: Embodiments of the present disclosure relate to a method and apparatus for selecting a video clip, a server and a medium. The method may include: determining at least two video clips from a video; for each video clip, perform following excitement determination steps: inputting a feature sequence of a video frame in the video clip and title information of the video into a pre-established prediction model to obtain a relevance between the inputted video frame and a title of the video; and determining an excitement of the video clip, based on the relevance between the video frame in the video clip and the title; and determining a target video clip from the video clips, based on the excitement of each of the video clips.
-
-
-
-
-
-
-
-
-