-
公开(公告)号:US11625433B2
公开(公告)日:2023-04-11
申请号:US17182960
申请日:2021-02-23
Inventor: Xiang Long , Ping Wang , Fu Li , Dongliang He , Hao Sun , Shilei Wen
IPC: G06F16/30 , G06F16/783 , G06V20/40 , G06F18/22
Abstract: Embodiments of the present disclosure disclose a method and apparatus for searching a video segment, a device and a medium, and relate to the field of video data search. The method includes: sampling video frames from a target video and videos to be searched in a video library, and extracting features from the sampled frames; matching the target video and the videos to be searched according to the extracted features to determine a candidate video to be searched that matches the target video; determining at least one candidate video segment from the determined candidate video, and calculating a degree of matching between the target video and each candidate video segment based on the extracted features of each sampled frame; and determining a video segment matching the target video in the videos to be searched according to the calculated degree of matching between the target video and each candidate video segment.
-
公开(公告)号:US11615605B2
公开(公告)日:2023-03-28
申请号:US17349055
申请日:2021-06-16
Inventor: Xiaoqing Ye , Xiao Tan , Hao Sun
Abstract: A vehicle information detection method, an electronic device and a storage medium are provided, and relates to the technical field of artificial intelligence, in particular to the technical field of computer vision and deep learning. The method includes: determining a bird's-eye view of a target vehicle based on an image of the target vehicle; performing feature extraction on the image of the target vehicle and the bird's-eye view respectively, to obtain first feature information corresponding to the image of the target vehicle and second feature information corresponding to the bird's-eye view of the target vehicle; and determining three-dimensional information of the target vehicle based on the first feature information and the second feature information. According to embodiments of the disclosure, accurate detection of vehicle information can be realized based on a monocular image.
-
23.
公开(公告)号:US11600069B2
公开(公告)日:2023-03-07
申请号:US17144205
申请日:2021-01-08
Inventor: Tianwei Lin , Xin Li , Dongliang He , Fu Li , Hao Sun , Shilei Wen , Errui Ding
Abstract: A method and apparatus for detecting a temporal action of a video, an electronic device and a storage medium are disclosed, which relates to the field of video processing technologies. An implementation includes: acquiring an initial temporal feature sequence of a video to be detected; acquiring, by a pre-trained video-temporal-action detecting module, implicit features and explicit features of a plurality of configured temporal anchor boxes based on the initial temporal feature sequence; and acquiring, by the video-temporal-action detecting module, the starting position and the ending position of a video clip containing a specified action, the category of the specified action and the probability that the specified action belongs to the category from the plural temporal anchor boxes according to the explicit features and the implicit features of the plural temporal anchor boxes.
-
24.
公开(公告)号:US11587338B2
公开(公告)日:2023-02-21
申请号:US17211491
申请日:2021-03-24
Inventor: Xiaoqing Ye , Xiao Tan , Hao Sun , Hongwu Zhang
Abstract: The present disclosure provides a three-dimensional (3D) object detection method, a 3D object detection apparatus, an electronic device, and a readable storage medium, belonging to a field of computer vision technologies. Two-dimensional (2D) image parameters and initial 3D image parameters are determined for a target object. Candidate 3D image parameters are determined for the target object based on a disturbance range of 3D parameters and the initial 3D image parameters determined for the target object. Target 3D image parameters are selected for the target object from the candidate 3D image parameters determined for the target object based on the 2D image parameters. A 3D detection result of the target object is determined based on the target 3D image parameters.
-
公开(公告)号:US11455765B2
公开(公告)日:2022-09-27
申请号:US17182604
申请日:2021-02-23
Inventor: Xiang Long , Xin Li , Henan Zhang , Hao Sun
Abstract: A method and apparatus for generating a virtual avatar are provided. The method may include: acquiring a first avatar, and determining an expression parameter of the first avatar, where the expression parameter of the first avatar including an expression parameter of at least one of five sense organs; and determining, based on the expression parameter of at least one of the five sense organs, a target virtual avatar that is associated with an attribute of the first avatar and has an expression of the first avatar.
-
公开(公告)号:US11259029B2
公开(公告)日:2022-02-22
申请号:US16797911
申请日:2020-02-21
Inventor: Zhichao Zhou , Dongliang He , Fu Li , Xiang Zhao , Xin Li , Zhizhen Chi , Xiang Long , Hao Sun
IPC: H04N19/14 , H04N19/12 , H04N19/625 , G06N3/08
Abstract: A method, device, apparatus for predicting a video coding complexity and a computer-readable storage medium are provided. The method includes: acquiring an attribute feature of a target video; extracting a plurality of first target image frames from the target video; performing a frame difference calculation on the plurality of the first target image frames, to acquire a plurality of first frame difference images; determining a histogram feature for frame difference images of the target video according to a statistical histogram of each first frame difference image; and inputting a plurality of features of the target video into a coding complexity prediction model to acquire a coding complexity prediction value of the target video. Through the above method, the BPP prediction value can be acquired intelligently.
-
公开(公告)号:US20210334985A1
公开(公告)日:2021-10-28
申请号:US17181800
申请日:2021-02-22
Inventor: Xiangbo SU , Yuchen Yuan , Hao Sun
Abstract: A method and apparatus for tracking a target are provided. The method may include: generating a position of a candidate box of a to-be-tracked target in a to-be-processed image; determining, for a pixel in the to-be-processed image, a probability that each anchor box of at least one anchor box arranged for the pixel includes the to-be-tracked target, and determining a deviation of the candidate box corresponding to the anchor box relative to the anchor box; determining candidate positions of the to-be-tracked target corresponding to the at least two anchor boxes respectively; and combining at least two candidate positions among the determined candidate positions to obtain a position of the to-be-tracked target in the to-be-processed image.
-
公开(公告)号:US20210334579A1
公开(公告)日:2021-10-28
申请号:US17184379
申请日:2021-02-24
Inventor: Tianwei LIN , Xin Li , Fu Li , Dongliang He , Hao Sun , Henan Zhang
Abstract: A method and apparatus for processing a video frame are provided. The method may include: converting, using an optical flow generated based on a previous frame and a next frame of adjacent frames in a video, a feature map of the previous frame to obtain a converted feature map; determining, based on an error of the optical flow, a weight of the converted feature map, and obtaining a fused feature map based on a weighted result of a feature of the converted feature map and a feature of a feature map of the next frame; and updating the feature map of the next frame as the fused feature map.
-
公开(公告)号:US20210319579A1
公开(公告)日:2021-10-14
申请号:US17179456
申请日:2021-02-19
Inventor: Xiaoqing Ye , Xiao Tan , Hao Sun , Hongwu Zhang
Abstract: Embodiments of the present disclosure disclose a method and apparatus for generating position information, a device and a medium. A specific embodiment of the method includes: acquiring an image and vehicle position information, wherein the image includes a target element; inputting the image into a pre-established depth map generation model to obtain a first depth map, wherein the focal length of sample images of sample data used during the training of the model is a sample focal length; generating a second depth map based on the sample focal length, the first depth map, and an estimated focal length of the image; determining depth information of the target element according to element position information of the target element in the image and the second depth map; and generating position information of the target element based on the vehicle position information and the depth information of the target element.
-
公开(公告)号:US20210312172A1
公开(公告)日:2021-10-07
申请号:US17353324
申请日:2021-06-21
Inventor: Zipeng Lu , Jian Wang , Yuchen Yuan , Hao Sun , Errui Ding
Abstract: A human body identification method, an electronic device and a storage medium, related to the technical field of artificial intelligence such as computer vision and deep learning, are provided. The method includes: inputting an image to be identified into a human body detection model, to obtain a plurality of preselected detection boxes; identifying a plurality of key points from each of the preselected detection boxes respectively according to a human body key point detection model, and obtaining a key point score of each of the key points; determining a target detection box from each of the preselected detection boxes, according to a number of the key points whose key point scores meet a key point threshold; and inputting the target detection box into a human body key point classification model, to obtain a human body identification result for the image to be identified.
-
-
-
-
-
-
-
-
-