-
11.
公开(公告)号:US11587338B2
公开(公告)日:2023-02-21
申请号:US17211491
申请日:2021-03-24
Inventor: Xiaoqing Ye , Xiao Tan , Hao Sun , Hongwu Zhang
Abstract: The present disclosure provides a three-dimensional (3D) object detection method, a 3D object detection apparatus, an electronic device, and a readable storage medium, belonging to a field of computer vision technologies. Two-dimensional (2D) image parameters and initial 3D image parameters are determined for a target object. Candidate 3D image parameters are determined for the target object based on a disturbance range of 3D parameters and the initial 3D image parameters determined for the target object. Target 3D image parameters are selected for the target object from the candidate 3D image parameters determined for the target object based on the 2D image parameters. A 3D detection result of the target object is determined based on the target 3D image parameters.
-
公开(公告)号:US20210319579A1
公开(公告)日:2021-10-14
申请号:US17179456
申请日:2021-02-19
Inventor: Xiaoqing Ye , Xiao Tan , Hao Sun , Hongwu Zhang
Abstract: Embodiments of the present disclosure disclose a method and apparatus for generating position information, a device and a medium. A specific embodiment of the method includes: acquiring an image and vehicle position information, wherein the image includes a target element; inputting the image into a pre-established depth map generation model to obtain a first depth map, wherein the focal length of sample images of sample data used during the training of the model is a sample focal length; generating a second depth map based on the sample focal length, the first depth map, and an estimated focal length of the image; determining depth information of the target element according to element position information of the target element in the image and the second depth map; and generating position information of the target element based on the vehicle position information and the depth information of the target element.
-
公开(公告)号:US20210304438A1
公开(公告)日:2021-09-30
申请号:US17346835
申请日:2021-06-14
Inventor: Xiaoqing Ye , Zhikang Zou , Xiao Tan , Hao Sun
Abstract: The present disclosure provides an object pose obtaining method, and an electronic device, relates to technology fields of image processing, computer vision, and deep learning. A detailed implementation is: extracting an image block of an object from an image, and generating a local coordinate system corresponding to the image block; obtaining 2D projection key points in an image coordinate system corresponding to a plurality of 3D key points on a 3D model of the object; converting the 2D projection key points into the local coordinate system to generate corresponding 2D prediction key points; obtaining direction vectors between each pixel point in the image block and each 2D prediction key point, and obtaining a 2D target key point corresponding to each 2D predication key point based on the direction vectors; and determining a pose of the object according to the 3D key points and the 2D target key points.
-
公开(公告)号:US11915466B2
公开(公告)日:2024-02-27
申请号:US17338328
申请日:2021-06-03
Inventor: Xipeng Yang , Xiao Tan , Hao Sun , Hongwu Zhang
IPC: G06T7/246 , G06V10/44 , G06T7/215 , G06T7/73 , G06F18/213 , G06V10/25 , G06V30/19 , G06V30/24 , G06V10/82
CPC classification number: G06V10/454 , G06F18/213 , G06T7/215 , G06T7/246 , G06T7/74 , G06V10/25 , G06V10/82 , G06V30/19173 , G06V30/2504 , G06T2207/20016 , G06T2207/20081 , G06V2201/07
Abstract: Embodiments of the present disclosure disclose a method and apparatus for determining a target anchor, a device and a storage medium. The method may include: extracting a plurality of feature maps of an original image using a feature extraction network; inputting the plurality of feature maps into a feature pyramid network to perform feature fusion, to obtain a plurality of fused feature maps; and using a region proposal network to implement operations as follows: determining an initial anchor of a network head using the fused feature map, based on a size of each fused feature map, and determining an offset parameter of the initial anchor, based on a ratio of the size of the fused feature map to the original image, and generating a plurality of candidate anchors in different directions, based on the offset parameter of the initial anchor.
-
15.
公开(公告)号:US11727676B2
公开(公告)日:2023-08-15
申请号:US17213746
申请日:2021-03-26
Inventor: Yingying Li , Xiao Tan , Minyue Jiang , Hao Sun
IPC: G06V10/82 , G06T3/00 , G06V10/77 , G06V10/80 , G06F18/213 , G06F18/211 , G06F18/25
CPC classification number: G06V10/82 , G06F18/211 , G06F18/213 , G06F18/253 , G06T3/00 , G06V10/7715 , G06V10/806
Abstract: The present disclosure provides an image processing method. An image to be classified is input into a feature extraction model to generate N dimensional features. Dimension fusion is performed on M features of the N dimensional features to obtain M dimension fusion features. The image to be classified is processed based on M dimension fusion features and remaining features of the N dimensional features other than the M features.
-
公开(公告)号:US11557062B2
公开(公告)日:2023-01-17
申请号:US17172883
申请日:2021-02-10
Inventor: Xiaoqing Ye , Xiao Tan , Hao Sun , Hongwu Zhang
Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a video frame, and relates to the field of computer vision technology. The method may include: acquiring a plurality of candidate first-order radial distortion parameters preset for a to-be-processed video frame, and acquiring a specified value of a specified radial distortion parameter; performing radial distortion correction on the to-be-processed video frame to obtain a first initial corrected video frame; selecting a first initial corrected video frame in which a local region except for a center region after distortion correction includes a largest number of straight line segments; and determining a candidate first-order radial distortion parameter corresponding to the selected first initial corrected video frame for use as a target first-order radial distortion parameter of the to-be-processed video frame.
-
公开(公告)号:US11521331B2
公开(公告)日:2022-12-06
申请号:US17179456
申请日:2021-02-19
Inventor: Xiaoqing Ye , Xiao Tan , Hao Sun , Hongwu Zhang
Abstract: Embodiments of the present disclosure disclose a method and apparatus for generating position information, a device and a medium. A specific embodiment of the method includes: acquiring an image and vehicle position information, wherein the image includes a target element; inputting the image into a pre-established depth map generation model to obtain a first depth map, wherein the focal length of sample images of sample data used during the training of the model is a sample focal length; generating a second depth map based on the sample focal length, the first depth map, and an estimated focal length of the image; determining depth information of the target element according to element position information of the target element in the image and the second depth map; and generating position information of the target element based on the vehicle position information and the depth information of the target element.
-
公开(公告)号:US11514263B2
公开(公告)日:2022-11-29
申请号:US16869024
申请日:2020-05-07
Inventor: Wei Zhang , Xiao Tan , Hao Sun , Shilei Wen , Errui Ding
IPC: G06K9/62 , G06N3/04 , G06N3/08 , G06V10/44 , G06V10/80 , G06V10/82 , G06V40/16 , G06V10/26 , G06V10/22
Abstract: Embodiments of the present disclosure disclose a method and apparatus for processing an image. A specific embodiment of the method includes: acquiring a feature map of a target image, where the target image contains a target object; determining a local feature map of a target size in the feature map; combining features of different channels in the local feature map to obtain a local texture feature map; and obtaining location information of the target object based on the local texture feature map.
-
公开(公告)号:US20220270373A1
公开(公告)日:2022-08-25
申请号:US17743410
申请日:2022-05-12
Inventor: Xipeng Yang , Minyue Jiang , Xiao Tan , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
IPC: G06V20/52 , G06V10/40 , G06V10/764 , G06V10/774 , G06V10/22
Abstract: A method, an electronic device and a storage medium are provided. The method may include: acquiring a to-be-inspected image; inputting the to-be-inspected image into a pre-established vehicle detection model to obtain a vehicle detection result, where the vehicle detection result includes category information, coordinate information, coordinate reliabilities, and coordinate error information of detection boxes, and the vehicle detection model is configured for characterizing a corresponding relationship between images and vehicle detection results; selecting, based on the coordinate reliabilities of the detection boxes, a detection box from the vehicle detection result for use as a to-be-processed detection box; and generating, based on coordinate information and coordinate error information of the to-be-processed detection box, coordinate information of a processed detection box.
-
公开(公告)号:US20210312209A1
公开(公告)日:2021-10-07
申请号:US17349055
申请日:2021-06-16
Inventor: Xiaoqing Ye , Xiao Tan , Hao Sun
Abstract: A vehicle information detection method, an electronic device and a storage medium are provided, and relates to the technical field of artificial intelligence, in particular to the technical field of computer vision and deep learning. The method includes: determining a bird's-eye view of a target vehicle based on an image of the target vehicle; performing feature extraction on the image of the target vehicle and the bird's-eye view respectively, to obtain first feature information corresponding to the image of the target vehicle and second feature information corresponding to the bird's-eye view of the target vehicle; and determining three-dimensional information of the target vehicle based on the first feature information and the second feature information. According to embodiments of the disclosure, accurate detection of vehicle information can be realized based on a monocular image.
-
-
-
-
-
-
-
-
-