-
公开(公告)号:US11610389B2
公开(公告)日:2023-03-21
申请号:US17201665
申请日:2021-03-15
Inventor: Jian Wang , Zipeng Lu , Hao Sun , Hongwu Zhang , Shilei Wen , Errui Ding
IPC: G06T7/73 , G06V10/46 , G06N3/04 , G06F18/213 , G06V10/764 , G06V10/82 , G06V40/20
Abstract: A method and apparatus for positioning a key point, a device, and a storage medium are provided. The method may include: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point.
-
公开(公告)号:US11557062B2
公开(公告)日:2023-01-17
申请号:US17172883
申请日:2021-02-10
Inventor: Xiaoqing Ye , Xiao Tan , Hao Sun , Hongwu Zhang
Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a video frame, and relates to the field of computer vision technology. The method may include: acquiring a plurality of candidate first-order radial distortion parameters preset for a to-be-processed video frame, and acquiring a specified value of a specified radial distortion parameter; performing radial distortion correction on the to-be-processed video frame to obtain a first initial corrected video frame; selecting a first initial corrected video frame in which a local region except for a center region after distortion correction includes a largest number of straight line segments; and determining a candidate first-order radial distortion parameter corresponding to the selected first initial corrected video frame for use as a target first-order radial distortion parameter of the to-be-processed video frame.
-
公开(公告)号:US11521331B2
公开(公告)日:2022-12-06
申请号:US17179456
申请日:2021-02-19
Inventor: Xiaoqing Ye , Xiao Tan , Hao Sun , Hongwu Zhang
Abstract: Embodiments of the present disclosure disclose a method and apparatus for generating position information, a device and a medium. A specific embodiment of the method includes: acquiring an image and vehicle position information, wherein the image includes a target element; inputting the image into a pre-established depth map generation model to obtain a first depth map, wherein the focal length of sample images of sample data used during the training of the model is a sample focal length; generating a second depth map based on the sample focal length, the first depth map, and an estimated focal length of the image; determining depth information of the target element according to element position information of the target element in the image and the second depth map; and generating position information of the target element based on the vehicle position information and the depth information of the target element.
-
公开(公告)号:US11514676B2
公开(公告)日:2022-11-29
申请号:US17116578
申请日:2020-12-09
Inventor: Zhichao Zhou , Dongliang He , Fu Li , Hao Sun
Abstract: The present disclosure provides a method and apparatus for detecting a region of interest in a video, a device and a storage medium. The method may include: acquiring a current to-be-processed frame from a picture frame sequence of a video; detecting a region of interest (ROI) in the current to-be-processed frame, in response to determining that the current to-be-processed frame is a detection picture frame, to determine at least one ROI in the current to-be-processed frame; and updating a to-be-tracked ROI, based on the ROI in the current to-be-processed frame and a tracking result determined by a pre-order tracking picture frame; and tracking the current to-be-processed frame based on the existing to-be-tracked ROI, in response to determining that the current to-be-processed frame is a tracking picture frame, to determine at least one tracking result as the ROI of the current to-be-processed frame.
-
公开(公告)号:US11514263B2
公开(公告)日:2022-11-29
申请号:US16869024
申请日:2020-05-07
Inventor: Wei Zhang , Xiao Tan , Hao Sun , Shilei Wen , Errui Ding
IPC: G06K9/62 , G06N3/04 , G06N3/08 , G06V10/44 , G06V10/80 , G06V10/82 , G06V40/16 , G06V10/26 , G06V10/22
Abstract: Embodiments of the present disclosure disclose a method and apparatus for processing an image. A specific embodiment of the method includes: acquiring a feature map of a target image, where the target image contains a target object; determining a local feature map of a target size in the feature map; combining features of different channels in the local feature map to obtain a local texture feature map; and obtaining location information of the target object based on the local texture feature map.
-
公开(公告)号:US11490168B2
公开(公告)日:2022-11-01
申请号:US17026488
申请日:2020-09-21
Inventor: Fu Li , Dongliang He , Hao Sun
IPC: H04N21/234 , H04N21/439 , H04N21/44 , H04N21/466 , H04N21/81 , H04N21/658 , H04N21/8549
Abstract: Embodiments of the present disclosure relate to a method and apparatus for selecting a video clip, a server and a medium. The method may include: determining at least two video clips from a video; for each video clip, perform following excitement determination steps: inputting a feature sequence of a video frame in the video clip and title information of the video into a pre-established prediction model to obtain a relevance between the inputted video frame and a title of the video; and determining an excitement of the video clip, based on the relevance between the video frame in the video clip and the title; and determining a target video clip from the video clips, based on the excitement of each of the video clips.
-
公开(公告)号:US20220270373A1
公开(公告)日:2022-08-25
申请号:US17743410
申请日:2022-05-12
Inventor: Xipeng Yang , Minyue Jiang , Xiao Tan , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
IPC: G06V20/52 , G06V10/40 , G06V10/764 , G06V10/774 , G06V10/22
Abstract: A method, an electronic device and a storage medium are provided. The method may include: acquiring a to-be-inspected image; inputting the to-be-inspected image into a pre-established vehicle detection model to obtain a vehicle detection result, where the vehicle detection result includes category information, coordinate information, coordinate reliabilities, and coordinate error information of detection boxes, and the vehicle detection model is configured for characterizing a corresponding relationship between images and vehicle detection results; selecting, based on the coordinate reliabilities of the detection boxes, a detection box from the vehicle detection result for use as a to-be-processed detection box; and generating, based on coordinate information and coordinate error information of the to-be-processed detection box, coordinate information of a processed detection box.
-
公开(公告)号:US11256920B2
公开(公告)日:2022-02-22
申请号:US16830895
申请日:2020-03-26
Inventor: Xiang Long , Dongliang He , Fu Li , Zhizhen Chi , Zhichao Zhou , Xiang Zhao , Ping Wang , Hao Sun , Shilei Wen , Errui Ding
Abstract: A method and an apparatus for classifying a video are provided. The method may include: acquiring a to-be-classified video; extracting a set of multimodal features of the to-be-classified video; inputting the set of multimodal features into a post-fusion model corresponding to each modal respectively, to obtain multimodal category information of the to-be-classified video; and fusing the multimodal category information of the to-be-classified video, to obtain category information of the to-be-classified video. This embodiment improves the accuracy of video classification.
-
公开(公告)号:US20210312209A1
公开(公告)日:2021-10-07
申请号:US17349055
申请日:2021-06-16
Inventor: Xiaoqing Ye , Xiao Tan , Hao Sun
Abstract: A vehicle information detection method, an electronic device and a storage medium are provided, and relates to the technical field of artificial intelligence, in particular to the technical field of computer vision and deep learning. The method includes: determining a bird's-eye view of a target vehicle based on an image of the target vehicle; performing feature extraction on the image of the target vehicle and the bird's-eye view respectively, to obtain first feature information corresponding to the image of the target vehicle and second feature information corresponding to the bird's-eye view of the target vehicle; and determining three-dimensional information of the target vehicle based on the first feature information and the second feature information. According to embodiments of the disclosure, accurate detection of vehicle information can be realized based on a monocular image.
-
50.
公开(公告)号:US20210295546A1
公开(公告)日:2021-09-23
申请号:US17335647
申请日:2021-06-01
Inventor: Dongliang He , Henan Zhang , Hao Sun
Abstract: The satellite image processing method includes: acquiring a first target satellite image; defogging the first target satellite image through a first neural network to acquire a first satellite image; and adjusting an image quality parameter of the first satellite image through a second neural network to acquire a second satellite image.
-
-
-
-
-
-
-
-
-