Patent search ap:("Beijing Baidu Netcom Science AND Technology Co. Page LTD.") AND inv:"Hao Sun"

21.

发明授权
Method and apparatus for searching video segment, device, and medium 有权

公开(公告)号：US11625433B2

公开(公告)日：2023-04-11

申请号：US17182960

申请日：2021-02-23

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Xiang Long , Ping Wang , Fu Li , Dongliang He , Hao Sun , Shilei Wen

IPC: G06F16/30 , G06F16/783 , G06V20/40 , G06F18/22

Abstract: Embodiments of the present disclosure disclose a method and apparatus for searching a video segment, a device and a medium, and relate to the field of video data search. The method includes: sampling video frames from a target video and videos to be searched in a video library, and extracting features from the sampled frames; matching the target video and the videos to be searched according to the extracted features to determine a candidate video to be searched that matches the target video; determining at least one candidate video segment from the determined candidate video, and calculating a degree of matching between the target video and each candidate video segment based on the extracted features of each sampled frame; and determining a video segment matching the target video in the videos to be searched according to the calculated degree of matching between the target video and each candidate video segment.

22.

发明授权
Vehicle information detection method, electronic device and storage medium 有权

公开(公告)号：US11615605B2

公开(公告)日：2023-03-28

申请号：US17349055

申请日：2021-06-16

Applicant: Beijing Baidu Netcom Science and Technology Co., LTD

Inventor： Xiaoqing Ye , Xiao Tan , Hao Sun

IPC: G06K9/00 , G06V10/40 , G06T7/50 , G06K9/62 , G06T17/00

Abstract: A vehicle information detection method, an electronic device and a storage medium are provided, and relates to the technical field of artificial intelligence, in particular to the technical field of computer vision and deep learning. The method includes: determining a bird's-eye view of a target vehicle based on an image of the target vehicle; performing feature extraction on the image of the target vehicle and the bird's-eye view respectively, to obtain first feature information corresponding to the image of the target vehicle and second feature information corresponding to the bird's-eye view of the target vehicle; and determining three-dimensional information of the target vehicle based on the first feature information and the second feature information. According to embodiments of the disclosure, accurate detection of vehicle information can be realized based on a monocular image.

23.

发明授权
Method and apparatus for detecting temporal action of video, electronic device and storage medium 有权

公开(公告)号：US11600069B2

公开(公告)日：2023-03-07

申请号：US17144205

申请日：2021-01-08

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Tianwei Lin , Xin Li , Dongliang He , Fu Li , Hao Sun , Shilei Wen , Errui Ding

IPC: G06K9/00 , G06K9/62 , G06V20/40

Abstract: A method and apparatus for detecting a temporal action of a video, an electronic device and a storage medium are disclosed, which relates to the field of video processing technologies. An implementation includes: acquiring an initial temporal feature sequence of a video to be detected; acquiring, by a pre-trained video-temporal-action detecting module, implicit features and explicit features of a plurality of configured temporal anchor boxes based on the initial temporal feature sequence; and acquiring, by the video-temporal-action detecting module, the starting position and the ending position of a video clip containing a specified action, the category of the specified action and the probability that the specified action belongs to the category from the plural temporal anchor boxes according to the explicit features and the implicit features of the plural temporal anchor boxes.

24.

发明授权
Three-dimensional object detection method, electronic device and readable storage medium 有权

公开(公告)号：US11587338B2

公开(公告)日：2023-02-21

申请号：US17211491

申请日：2021-03-24

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Xiaoqing Ye , Xiao Tan , Hao Sun , Hongwu Zhang

IPC: G06K9/00 , G06V20/64 , G06T7/50 , G06T7/73 , G06T3/00 , G06T7/60

Abstract: The present disclosure provides a three-dimensional (3D) object detection method, a 3D object detection apparatus, an electronic device, and a readable storage medium, belonging to a field of computer vision technologies. Two-dimensional (2D) image parameters and initial 3D image parameters are determined for a target object. Candidate 3D image parameters are determined for the target object based on a disturbance range of 3D parameters and the initial 3D image parameters determined for the target object. Target 3D image parameters are selected for the target object from the candidate 3D image parameters determined for the target object based on the 2D image parameters. A 3D detection result of the target object is determined based on the target 3D image parameters.

25.

发明授权
Method and apparatus for generating virtual avatar 有权

公开(公告)号：US11455765B2

公开(公告)日：2022-09-27

申请号：US17182604

申请日：2021-02-23

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Xiang Long , Xin Li , Henan Zhang , Hao Sun

IPC: G06T13/40 , G06T19/20 , G06N3/04

Abstract: A method and apparatus for generating a virtual avatar are provided. The method may include: acquiring a first avatar, and determining an expression parameter of the first avatar, where the expression parameter of the first avatar including an expression parameter of at least one of five sense organs; and determining, based on the expression parameter of at least one of the five sense organs, a target virtual avatar that is associated with an attribute of the first avatar and has an expression of the first avatar.

26.

发明授权
Method, device, apparatus for predicting video coding complexity and storage medium 有权

公开(公告)号：US11259029B2

公开(公告)日：2022-02-22

申请号：US16797911

申请日：2020-02-21

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Zhichao Zhou , Dongliang He , Fu Li , Xiang Zhao , Xin Li , Zhizhen Chi , Xiang Long , Hao Sun

IPC: H04N19/14 , H04N19/12 , H04N19/625 , G06N3/08

Abstract: A method, device, apparatus for predicting a video coding complexity and a computer-readable storage medium are provided. The method includes: acquiring an attribute feature of a target video; extracting a plurality of first target image frames from the target video; performing a frame difference calculation on the plurality of the first target image frames, to acquire a plurality of first frame difference images; determining a histogram feature for frame difference images of the target video according to a statistical histogram of each first frame difference image; and inputting a plurality of features of the target video into a coding complexity prediction model to acquire a coding complexity prediction value of the target video. Through the above method, the BPP prediction value can be acquired intelligently.

27.

发明申请
METHOD AND APPARATUS FOR TRACKING TARGET 有权

公开(公告)号：US20210334985A1

公开(公告)日：2021-10-28

申请号：US17181800

申请日：2021-02-22

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Xiangbo SU , Yuchen Yuan , Hao Sun

IPC: G06T7/246 , G06K9/32 , G06T7/73

Abstract: A method and apparatus for tracking a target are provided. The method may include: generating a position of a candidate box of a to-be-tracked target in a to-be-processed image; determining, for a pixel in the to-be-processed image, a probability that each anchor box of at least one anchor box arranged for the pixel includes the to-be-tracked target, and determining a deviation of the candidate box corresponding to the anchor box relative to the anchor box; determining candidate positions of the to-be-tracked target corresponding to the at least two anchor boxes respectively; and combining at least two candidate positions among the determined candidate positions to obtain a position of the to-be-tracked target in the to-be-processed image.

28.

发明申请
METHOD AND APPARATUS FOR PROCESSING VIDEO FRAME 有权

公开(公告)号：US20210334579A1

公开(公告)日：2021-10-28

申请号：US17184379

申请日：2021-02-24

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Tianwei LIN , Xin Li , Fu Li , Dongliang He , Hao Sun , Henan Zhang

IPC: G06K9/62 , G06N3/04 , G06K9/00

Abstract: A method and apparatus for processing a video frame are provided. The method may include: converting, using an optical flow generated based on a previous frame and a next frame of adjacent frames in a video, a feature map of the previous frame to obtain a converted feature map; determining, based on an error of the optical flow, a weight of the converted feature map, and obtaining a fused feature map based on a weighted result of a feature of the converted feature map and a feature of a feature map of the next frame; and updating the feature map of the next frame as the fused feature map.

29.

发明申请
METHOD AND APPARATUS FOR GENERATING POSITION INFORMATION, DEVICE, AND MEDIUM 有权

公开(公告)号：US20210319579A1

公开(公告)日：2021-10-14

申请号：US17179456

申请日：2021-02-19

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Xiaoqing Ye , Xiao Tan , Hao Sun , Hongwu Zhang

IPC: G06T7/593 , G06T7/73 , G06K9/62 , G06K9/00

Abstract: Embodiments of the present disclosure disclose a method and apparatus for generating position information, a device and a medium. A specific embodiment of the method includes: acquiring an image and vehicle position information, wherein the image includes a target element; inputting the image into a pre-established depth map generation model to obtain a first depth map, wherein the focal length of sample images of sample data used during the training of the model is a sample focal length; generating a second depth map based on the sample focal length, the first depth map, and an estimated focal length of the image; determining depth information of the target element according to element position information of the target element in the image and the second depth map; and generating position information of the target element based on the vehicle position information and the depth information of the target element.

30.

发明申请
HUMAN BODY IDENTIFICATION METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20210312172A1

公开(公告)日：2021-10-07

申请号：US17353324

申请日：2021-06-21

Applicant: Beijing Baidu Netcom Science and Technology Co., LTD

Inventor： Zipeng Lu , Jian Wang , Yuchen Yuan , Hao Sun , Errui Ding

IPC: G06K9/00 , G06K9/32 , G06K9/62

Abstract: A human body identification method, an electronic device and a storage medium, related to the technical field of artificial intelligence such as computer vision and deep learning, are provided. The method includes: inputting an image to be identified into a human body detection model, to obtain a plurality of preselected detection boxes; identifying a plurality of key points from each of the preselected detection boxes respectively according to a human body key point detection model, and obtaining a key point score of each of the key points; determining a target detection box from each of the preselected detection boxes, according to a number of the key points whose key point scores meet a key point threshold; and inputting the target detection box into a human body key point classification model, to obtain a human body identification result for the image to be identified.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification