-
公开(公告)号:US11734809B2
公开(公告)日:2023-08-22
申请号:US17174002
申请日:2021-02-11
Inventor: Xiang Long , Ping Wang , Zhichao Zhou , Fu Li , Dongliang He , Hao Sun
CPC classification number: G06T7/0002 , G06N3/04 , G06N3/08 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084 , G06T2207/30168
Abstract: Embodiments of the present disclosure provide a method and apparatus for processing an image, and relates to the field of computer vision technology. The method may include: acquiring a value to be processed, where the value to be processed is associated with an image to be processed; and processing the value to be processed by using a quality scoring model to generate a score of the image to be processed in a target scoring domain, where the score of the image to be processed in the target scoring domain is related to an image quality of the image to be processed.
-
公开(公告)号:US11514676B2
公开(公告)日:2022-11-29
申请号:US17116578
申请日:2020-12-09
Inventor: Zhichao Zhou , Dongliang He , Fu Li , Hao Sun
Abstract: The present disclosure provides a method and apparatus for detecting a region of interest in a video, a device and a storage medium. The method may include: acquiring a current to-be-processed frame from a picture frame sequence of a video; detecting a region of interest (ROI) in the current to-be-processed frame, in response to determining that the current to-be-processed frame is a detection picture frame, to determine at least one ROI in the current to-be-processed frame; and updating a to-be-tracked ROI, based on the ROI in the current to-be-processed frame and a tracking result determined by a pre-order tracking picture frame; and tracking the current to-be-processed frame based on the existing to-be-tracked ROI, in response to determining that the current to-be-processed frame is a tracking picture frame, to determine at least one tracking result as the ROI of the current to-be-processed frame.
-
公开(公告)号:US11256920B2
公开(公告)日:2022-02-22
申请号:US16830895
申请日:2020-03-26
Inventor: Xiang Long , Dongliang He , Fu Li , Zhizhen Chi , Zhichao Zhou , Xiang Zhao , Ping Wang , Hao Sun , Shilei Wen , Errui Ding
Abstract: A method and an apparatus for classifying a video are provided. The method may include: acquiring a to-be-classified video; extracting a set of multimodal features of the to-be-classified video; inputting the set of multimodal features into a post-fusion model corresponding to each modal respectively, to obtain multimodal category information of the to-be-classified video; and fusing the multimodal category information of the to-be-classified video, to obtain category information of the to-be-classified video. This embodiment improves the accuracy of video classification.
-
公开(公告)号:US20200374526A1
公开(公告)日:2020-11-26
申请号:US16797911
申请日:2020-02-21
Inventor: Zhichao Zhou , Dongliang He , Fu Li , Xiang Zhao , Xin Li , Zhizhen Chi , Xiang Long , Hao Sun
IPC: H04N19/14 , H04N19/12 , H04N19/625 , G06N3/08
Abstract: A method, device, apparatus for predicting a video coding complexity and a computer-readable storage medium are provided. The method includes: acquiring an attribute feature of a target video; extracting a plurality of first target image frames from the target video; performing a frame difference calculation on the plurality of the first target image frames, to acquire a plurality of first frame difference images; determining a histogram feature for frame difference images of the target video according to a statistical histogram of each first frame difference image; and inputting a plurality of features of the target video into a coding complexity prediction model to acquire a coding complexity prediction value of the target video. Through the above method, the BPP prediction value can be acquired intelligently.
-
公开(公告)号:US11259029B2
公开(公告)日:2022-02-22
申请号:US16797911
申请日:2020-02-21
Inventor: Zhichao Zhou , Dongliang He , Fu Li , Xiang Zhao , Xin Li , Zhizhen Chi , Xiang Long , Hao Sun
IPC: H04N19/14 , H04N19/12 , H04N19/625 , G06N3/08
Abstract: A method, device, apparatus for predicting a video coding complexity and a computer-readable storage medium are provided. The method includes: acquiring an attribute feature of a target video; extracting a plurality of first target image frames from the target video; performing a frame difference calculation on the plurality of the first target image frames, to acquire a plurality of first frame difference images; determining a histogram feature for frame difference images of the target video according to a statistical histogram of each first frame difference image; and inputting a plurality of features of the target video into a coding complexity prediction model to acquire a coding complexity prediction value of the target video. Through the above method, the BPP prediction value can be acquired intelligently.
-
公开(公告)号:US20210019531A1
公开(公告)日:2021-01-21
申请号:US16830895
申请日:2020-03-26
Inventor: Xiang Long , Dongliang He , Fu Li , Zhizhen Chi , Zhichao Zhou , Xiang Zhao , Ping Wang , Hao Sun , Shilei Wen , Errui Ding
Abstract: a method and an apparatus for classifying a video are provided. The method may include: acquiring a to-be-classified video; extracting a set of multimodal features of the to-be-classified video; inputting the set of multimodal features into a post-fusion model corresponding to each modal respectively, to obtain multimodal category information of the to-be-classified video; and fusing the multimodal category information of the to-be-classified video, to obtain category information of the to-be-classified video. This embodiment improves the accuracy of video classification.
-
-
-
-
-