Patent search ap:("Beijing Baidu Netcom Science AND Technology Co. Page Ltd.") AND inv:"Dongliang He"

11.

发明授权
Method and apparatus for classifying video 有权

公开(公告)号：US11256920B2

公开(公告)日：2022-02-22

申请号：US16830895

申请日：2020-03-26

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Xiang Long , Dongliang He , Fu Li , Zhizhen Chi , Zhichao Zhou , Xiang Zhao , Ping Wang , Hao Sun , Shilei Wen , Errui Ding

IPC: G06K9/00 , G06K9/62 , G06N3/08

Abstract: A method and an apparatus for classifying a video are provided. The method may include: acquiring a to-be-classified video; extracting a set of multimodal features of the to-be-classified video; inputting the set of multimodal features into a post-fusion model corresponding to each modal respectively, to obtain multimodal category information of the to-be-classified video; and fusing the multimodal category information of the to-be-classified video, to obtain category information of the to-be-classified video. This embodiment improves the accuracy of video classification.

12.

发明申请
SATELLITE IMAGE PROCESSING METHOD, NETWORK TRAINING METHOD, RELATED DEVICES AND ELECTRONIC DEVICE 有权

公开(公告)号：US20210295546A1

公开(公告)日：2021-09-23

申请号：US17335647

申请日：2021-06-01

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Dongliang He , Henan Zhang , Hao Sun

IPC: G06T7/55 , G06N3/08 , G06N3/04

Abstract: The satellite image processing method includes: acquiring a first target satellite image; defogging the first target satellite image through a first neural network to acquire a first satellite image; and adjusting an image quality parameter of the first satellite image through a second neural network to acquire a second satellite image.

13.

发明申请
METHOD, DEVICE, APPARATUS FOR PREDICTING VIDEO CODING COMPLEXITY AND STORAGE MEDIUM 审中-公开

公开(公告)号：US20200374526A1

公开(公告)日：2020-11-26

申请号：US16797911

申请日：2020-02-21

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd

Inventor： Zhichao Zhou , Dongliang He , Fu Li , Xiang Zhao , Xin Li , Zhizhen Chi , Xiang Long , Hao Sun

IPC: H04N19/14 , H04N19/12 , H04N19/625 , G06N3/08

Abstract: A method, device, apparatus for predicting a video coding complexity and a computer-readable storage medium are provided. The method includes: acquiring an attribute feature of a target video; extracting a plurality of first target image frames from the target video; performing a frame difference calculation on the plurality of the first target image frames, to acquire a plurality of first frame difference images; determining a histogram feature for frame difference images of the target video according to a statistical histogram of each first frame difference image; and inputting a plurality of features of the target video into a coding complexity prediction model to acquire a coding complexity prediction value of the target video. Through the above method, the BPP prediction value can be acquired intelligently.

14.

发明授权
Method and apparatus for identifying video 有权

公开(公告)号：US11657612B2

公开(公告)日：2023-05-23

申请号：US17194131

申请日：2021-03-05

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Dongliang He , Xiao Tan , Shilei Wen , Hao Sun

IPC: G06V20/40 , G06F18/2411

CPC classification number: G06V20/40 , G06F18/2411 , G06V20/41 , G06V20/46 , G06V20/48

Abstract: Embodiments of the present disclosure disclose a method and apparatus for identifying a video. A specific embodiment of the method includes: acquiring a predetermined number of video frames from a video to be identified to obtain a video frame sequence; performing the following processing step: importing the video frame sequence into a pre-trained video identification model to obtain a classification tag probability corresponding to the video frame sequence, wherein the classification tag probability is used to characterize a probability of identifying a corresponding tag category of the video to be identified; and setting, in response to the classification tag probability being greater than or equal to a preset identification accuracy threshold, a video tag for the video to be identified according to the classification tag probability, or else increasing the number of video frames in the video frame sequence and continuing to perform the above processing step.

15.

发明授权
Method and apparatus for searching video segment, device, and medium 有权

公开(公告)号：US11625433B2

公开(公告)日：2023-04-11

申请号：US17182960

申请日：2021-02-23

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Xiang Long , Ping Wang , Fu Li , Dongliang He , Hao Sun , Shilei Wen

IPC: G06F16/30 , G06F16/783 , G06V20/40 , G06F18/22

Abstract: Embodiments of the present disclosure disclose a method and apparatus for searching a video segment, a device and a medium, and relate to the field of video data search. The method includes: sampling video frames from a target video and videos to be searched in a video library, and extracting features from the sampled frames; matching the target video and the videos to be searched according to the extracted features to determine a candidate video to be searched that matches the target video; determining at least one candidate video segment from the determined candidate video, and calculating a degree of matching between the target video and each candidate video segment based on the extracted features of each sampled frame; and determining a video segment matching the target video in the videos to be searched according to the calculated degree of matching between the target video and each candidate video segment.

16.

发明授权
Method and apparatus for detecting temporal action of video, electronic device and storage medium 有权

公开(公告)号：US11600069B2

公开(公告)日：2023-03-07

申请号：US17144205

申请日：2021-01-08

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Tianwei Lin , Xin Li , Dongliang He , Fu Li , Hao Sun , Shilei Wen , Errui Ding

IPC: G06K9/00 , G06K9/62 , G06V20/40

Abstract: A method and apparatus for detecting a temporal action of a video, an electronic device and a storage medium are disclosed, which relates to the field of video processing technologies. An implementation includes: acquiring an initial temporal feature sequence of a video to be detected; acquiring, by a pre-trained video-temporal-action detecting module, implicit features and explicit features of a plurality of configured temporal anchor boxes based on the initial temporal feature sequence; and acquiring, by the video-temporal-action detecting module, the starting position and the ending position of a video clip containing a specified action, the category of the specified action and the probability that the specified action belongs to the category from the plural temporal anchor boxes according to the explicit features and the implicit features of the plural temporal anchor boxes.

17.

发明授权
Method and apparatus for grounding a target video clip in a video 有权

公开(公告)号：US11410422B2

公开(公告)日：2022-08-09

申请号：US16905482

申请日：2020-06-18

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Dongliang He , Xiang Zhao , Jizhou Huang , Fu Li , Xiao Liu , Shilei Wen

IPC: G06V20/40 , G06N3/00 , G06V10/40 , G06F16/783 , G06N3/08 , G06N3/04 , G06F16/00 , G06K9/62

Abstract: A method and an apparatus for grounding a target video clip in a video are provided. The method includes: determining a current video clip in the video based on a current position; acquiring descriptive information indicative of a pre-generated target video clip descriptive feature, and executing a target video clip determining step which includes: determining current state information of the current video clip, wherein the current state information includes information indicative of a feature of the current video clip; generating a current action policy based on the descriptive information and the current state information, the current action policy being indicative of a position change of the current video clip in the video; the method further comprises: in response to reaching a preset condition, using a video clip resulting from executing the current action policy on the current video clip as the target video clip.

18.

发明授权
Method, device, apparatus for predicting video coding complexity and storage medium 有权

公开(公告)号：US11259029B2

公开(公告)日：2022-02-22

申请号：US16797911

申请日：2020-02-21

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Zhichao Zhou , Dongliang He , Fu Li , Xiang Zhao , Xin Li , Zhizhen Chi , Xiang Long , Hao Sun

IPC: H04N19/14 , H04N19/12 , H04N19/625 , G06N3/08

Abstract: A method, device, apparatus for predicting a video coding complexity and a computer-readable storage medium are provided. The method includes: acquiring an attribute feature of a target video; extracting a plurality of first target image frames from the target video; performing a frame difference calculation on the plurality of the first target image frames, to acquire a plurality of first frame difference images; determining a histogram feature for frame difference images of the target video according to a statistical histogram of each first frame difference image; and inputting a plurality of features of the target video into a coding complexity prediction model to acquire a coding complexity prediction value of the target video. Through the above method, the BPP prediction value can be acquired intelligently.

19.

发明申请
METHOD AND APPARATUS FOR PROCESSING VIDEO FRAME 有权

公开(公告)号：US20210334579A1

公开(公告)日：2021-10-28

申请号：US17184379

申请日：2021-02-24

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Tianwei LIN , Xin Li , Fu Li , Dongliang He , Hao Sun , Henan Zhang

IPC: G06K9/62 , G06N3/04 , G06K9/00

Abstract: A method and apparatus for processing a video frame are provided. The method may include: converting, using an optical flow generated based on a previous frame and a next frame of adjacent frames in a video, a feature map of the previous frame to obtain a converted feature map; determining, based on an error of the optical flow, a weight of the converted feature map, and obtaining a fused feature map based on a weighted result of a feature of the converted feature map and a feature of a feature map of the next frame; and updating the feature map of the next frame as the fused feature map.

20.

发明申请
METHOD AND APPARATUS FOR CLASSIFYING VIDEO 有权

公开(公告)号：US20210019531A1

公开(公告)日：2021-01-21

申请号：US16830895

申请日：2020-03-26

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Xiang Long , Dongliang He , Fu Li , Zhizhen Chi , Zhichao Zhou , Xiang Zhao , Ping Wang , Hao Sun , Shilei Wen , Errui Ding

IPC: G06K9/00 , G06N3/08 , G06K9/62

Abstract: a method and an apparatus for classifying a video are provided. The method may include: acquiring a to-be-classified video; extracting a set of multimodal features of the to-be-classified video; inputting the set of multimodal features into a post-fusion model corresponding to each modal respectively, to obtain multimodal category information of the to-be-classified video; and fusing the multimodal category information of the to-be-classified video, to obtain category information of the to-be-classified video. This embodiment improves the accuracy of video classification.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification