Patent search ap:("Beijing Baidu Netcom Science AND Technology Co. Page Ltd.") AND inv:"Zhichao Zhou"

1.

发明授权
Method and apparatus for processing image 有权

公开(公告)号：US11734809B2

公开(公告)日：2023-08-22

申请号：US17174002

申请日：2021-02-11

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Xiang Long , Ping Wang , Zhichao Zhou , Fu Li , Dongliang He , Hao Sun

IPC: G06T7/00 , G06N3/04 , G06N3/08

CPC classification number: G06T7/0002 , G06N3/04 , G06N3/08 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084 , G06T2207/30168

Abstract: Embodiments of the present disclosure provide a method and apparatus for processing an image, and relates to the field of computer vision technology. The method may include: acquiring a value to be processed, where the value to be processed is associated with an image to be processed; and processing the value to be processed by using a quality scoring model to generate a score of the image to be processed in a target scoring domain, where the score of the image to be processed in the target scoring domain is related to an image quality of the image to be processed.

2.

发明授权
Method and apparatus for detecting region of interest in video, device and medium 有权

公开(公告)号：US11514676B2

公开(公告)日：2022-11-29

申请号：US17116578

申请日：2020-12-09

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Zhichao Zhou , Dongliang He , Fu Li , Hao Sun

IPC: G06V20/40 , G06T7/246 , G06V10/25 , G06V10/75

Abstract: The present disclosure provides a method and apparatus for detecting a region of interest in a video, a device and a storage medium. The method may include: acquiring a current to-be-processed frame from a picture frame sequence of a video; detecting a region of interest (ROI) in the current to-be-processed frame, in response to determining that the current to-be-processed frame is a detection picture frame, to determine at least one ROI in the current to-be-processed frame; and updating a to-be-tracked ROI, based on the ROI in the current to-be-processed frame and a tracking result determined by a pre-order tracking picture frame; and tracking the current to-be-processed frame based on the existing to-be-tracked ROI, in response to determining that the current to-be-processed frame is a tracking picture frame, to determine at least one tracking result as the ROI of the current to-be-processed frame.

3.

发明授权
Method and apparatus for classifying video 有权

公开(公告)号：US11256920B2

公开(公告)日：2022-02-22

申请号：US16830895

申请日：2020-03-26

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Xiang Long , Dongliang He , Fu Li , Zhizhen Chi , Zhichao Zhou , Xiang Zhao , Ping Wang , Hao Sun , Shilei Wen , Errui Ding

IPC: G06K9/00 , G06K9/62 , G06N3/08

Abstract: A method and an apparatus for classifying a video are provided. The method may include: acquiring a to-be-classified video; extracting a set of multimodal features of the to-be-classified video; inputting the set of multimodal features into a post-fusion model corresponding to each modal respectively, to obtain multimodal category information of the to-be-classified video; and fusing the multimodal category information of the to-be-classified video, to obtain category information of the to-be-classified video. This embodiment improves the accuracy of video classification.

4.

发明申请
METHOD, DEVICE, APPARATUS FOR PREDICTING VIDEO CODING COMPLEXITY AND STORAGE MEDIUM 审中-公开

公开(公告)号：US20200374526A1

公开(公告)日：2020-11-26

申请号：US16797911

申请日：2020-02-21

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd

Inventor： Zhichao Zhou , Dongliang He , Fu Li , Xiang Zhao , Xin Li , Zhizhen Chi , Xiang Long , Hao Sun

IPC: H04N19/14 , H04N19/12 , H04N19/625 , G06N3/08

Abstract: A method, device, apparatus for predicting a video coding complexity and a computer-readable storage medium are provided. The method includes: acquiring an attribute feature of a target video; extracting a plurality of first target image frames from the target video; performing a frame difference calculation on the plurality of the first target image frames, to acquire a plurality of first frame difference images; determining a histogram feature for frame difference images of the target video according to a statistical histogram of each first frame difference image; and inputting a plurality of features of the target video into a coding complexity prediction model to acquire a coding complexity prediction value of the target video. Through the above method, the BPP prediction value can be acquired intelligently.

5.

发明授权
Method, device, apparatus for predicting video coding complexity and storage medium 有权

公开(公告)号：US11259029B2

公开(公告)日：2022-02-22

申请号：US16797911

申请日：2020-02-21

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Zhichao Zhou , Dongliang He , Fu Li , Xiang Zhao , Xin Li , Zhizhen Chi , Xiang Long , Hao Sun

IPC: H04N19/14 , H04N19/12 , H04N19/625 , G06N3/08

Abstract: A method, device, apparatus for predicting a video coding complexity and a computer-readable storage medium are provided. The method includes: acquiring an attribute feature of a target video; extracting a plurality of first target image frames from the target video; performing a frame difference calculation on the plurality of the first target image frames, to acquire a plurality of first frame difference images; determining a histogram feature for frame difference images of the target video according to a statistical histogram of each first frame difference image; and inputting a plurality of features of the target video into a coding complexity prediction model to acquire a coding complexity prediction value of the target video. Through the above method, the BPP prediction value can be acquired intelligently.

6.

发明申请
METHOD AND APPARATUS FOR CLASSIFYING VIDEO 有权

公开(公告)号：US20210019531A1

公开(公告)日：2021-01-21

申请号：US16830895

申请日：2020-03-26

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Xiang Long , Dongliang He , Fu Li , Zhizhen Chi , Zhichao Zhou , Xiang Zhao , Ping Wang , Hao Sun , Shilei Wen , Errui Ding

IPC: G06K9/00 , G06N3/08 , G06K9/62

Abstract: a method and an apparatus for classifying a video are provided. The method may include: acquiring a to-be-classified video; extracting a set of multimodal features of the to-be-classified video; inputting the set of multimodal features into a post-fusion model corresponding to each modal respectively, to obtain multimodal category information of the to-be-classified video; and fusing the multimodal category information of the to-be-classified video, to obtain category information of the to-be-classified video. This embodiment improves the accuracy of video classification.

Patent Agency Ranking