-
公开(公告)号:US11854237B2
公开(公告)日:2023-12-26
申请号:US17353324
申请日:2021-06-21
Inventor: Zipeng Lu , Jian Wang , Yuchen Yuan , Hao Sun , Errui Ding
IPC: G06V10/25 , G06V40/10 , G06F18/214 , G06V10/75 , G06V10/764 , G06V10/774 , G06V10/82
CPC classification number: G06V10/25 , G06F18/214 , G06V10/757 , G06V10/764 , G06V10/774 , G06V10/82 , G06V40/10 , G06V40/103
Abstract: A human body identification method, an electronic device and a storage medium, related to the technical field of artificial intelligence such as computer vision and deep learning, are provided. The method includes: inputting an image to be identified into a human body detection model, to obtain a plurality of preselected detection boxes; identifying a plurality of key points from each of the preselected detection boxes respectively according to a human body key point detection model, and obtaining a key point score of each of the key points; determining a target detection box from each of the preselected detection boxes, according to a number of the key points whose key point scores meet a key point threshold; and inputting the target detection box into a human body key point classification model, to obtain a human body identification result for the image to be identified.
-
公开(公告)号:US11463631B2
公开(公告)日:2022-10-04
申请号:US17025255
申请日:2020-09-18
Inventor: Henan Zhang , Xin Li , Fu Li , Tianwei Lin , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
Abstract: Embodiments of the present disclosure provide a method and apparatus for generating an image. The method may include: receiving a first image including a face input by a user in an interactive scene; presenting the first image to the user; inputting the first image into a pre-trained generative adversarial network in a backend to obtain a second image output by the generative adversarial network; where the generative adversarial network uses face attribute information generated based on the input image as a constraint; and presenting the second image to the user in response to obtaining the second image output by the generative adversarial network in the backend.
-
公开(公告)号:US20220270289A1
公开(公告)日:2022-08-25
申请号:US17743402
申请日:2022-05-12
Inventor: Wei Zhang , Xiaoqing Ye , Xiao Tan , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
IPC: G06T7/73 , G06T7/194 , G06T7/593 , H04N13/128
Abstract: A method and device for detecting a vehicle pose, relating to the fields of computer vision and automatic driving. The specific implementation solution comprises: inputting a vehicle left view point image and a vehicle right view point image into a part prediction and mask segmentation network model, and determining foreground pixel points and part coordinates thereof in a reference image; converting coordinates of the foreground pixels in the reference image into coordinates of the foreground pixels in a camera coordinate system so as to obtain a pseudo-point cloud, and fusing part coordinate of the foreground pixels and the pseudo-point cloud to obtain fused pseudo-point cloud; and inputting the fused pseudo-point cloud into a pre-trained pose prediction model to obtain a pose information of the vehicle to be detected.
-
公开(公告)号:US11416967B2
公开(公告)日:2022-08-16
申请号:US17024253
申请日:2020-09-17
Inventor: Chao Li , Shilei Wen , Errui Ding
Abstract: Embodiments of the present disclosure provide a video processing method, a video processing device and a related non-transitory computer readable storage medium. The method includes the following. Frame sequence data of a low-resolution video to be converted is obtained. Pixel tensors of each frame in the frame sequence data are inputted into a pre-trained neural network model to obtain high-resolution video frame sequence data corresponding to the video to be converted output by the neural network model. The neural network model obtains the high-resolution video frame sequence data based on high-order pixel information of each frame in the frame sequence data.
-
公开(公告)号:US11363271B2
公开(公告)日:2022-06-14
申请号:US17125370
申请日:2020-12-17
Inventor: Chao Li , Yukang Ding , Dongliang He , Fu Li , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding
IPC: H04N19/132 , H04N19/172 , G06K9/62 , G06N3/04
Abstract: A method for video frame interpolation, a related electronic device and a storage medium is disclosed. A video is obtained. An (i−1)th frame and an ith frame of the video are obtained. Visual semantic feature maps and depth maps of the (i−1)th frame and the ith frame are obtained. Frame interpolation information is obtained based on the visual semantic feature maps and the depth maps. An interpolated frame between the (i−1)th frame and the ith frame is generated based on the frame interpolation information and the (i−1)th frame and is inserted between the (i−1)th frame and the ith frame.
-
公开(公告)号:US11210546B2
公开(公告)日:2021-12-28
申请号:US16822085
申请日:2020-03-18
Inventor: Yipeng Sun , Chengquan Zhang , Zuming Huang , Jiaming Liu , Junyu Han , Errui Ding
Abstract: The present disclosure proposes an end-to-end text recognition method and apparatus, computer device and readable medium. The method comprises: obtaining a to-be-recognized picture containing a text region; recognizing a position of the text region in the to-be-recognized picture and text content included in the text region with a pre-trained end-to-end text recognition model; the end-to-end text recognition model comprising a region of interest perspective transformation processing module for performing perspective transformation processing for the text region. The technical solution of the present disclosure does not need to serially arrange a plurality of steps, and may avoid introducing the accumulated errors and may effectively improve the accuracy of the text recognition.
-
公开(公告)号:US11776155B2
公开(公告)日:2023-10-03
申请号:US16894123
申请日:2020-06-05
Inventor: Xiaoqing Ye , Xiao Tan , Wei Zhang , Hao Sun , Errui Ding
IPC: G06T7/73 , G06N3/04 , G06N3/08 , G06T7/70 , G06T11/20 , G06V10/764 , G06V10/82 , G06V20/58 , G06V20/64 , G06V10/25 , G06F18/24 , G06F18/214
CPC classification number: G06T7/73 , G06F18/214 , G06F18/24 , G06N3/04 , G06N3/08 , G06T7/70 , G06T11/20 , G06V10/764 , G06V10/82 , G06V20/58 , G06V20/647 , G06T2207/20081 , G06T2207/20084 , G06T2210/12
Abstract: Embodiments of the present disclosure provide a method and apparatus for detecting a target object in an image. The method includes: performing following prediction operations using a pre-trained neural network: detecting a target object in a two-dimensional image to determine a two-dimensional bounding box of the target object; and determining a relative position constraint relationship between the two-dimensional bounding box of the target object and a three-dimensional projection bounding box obtained by projecting a three-dimensional bounding box of the target object into the two-dimensional image; and the method further including: determining the three-dimensional projection bounding box of the target object, based on the two-dimensional bounding box of the target object and the relative position constraint relationship between the two-dimensional bounding box of the target object and the three-dimensional projection bounding box.
-
8.
公开(公告)号:US11538286B2
公开(公告)日:2022-12-27
申请号:US16710464
申请日:2019-12-11
Inventor: Wei Zhang , Xiao Tan , Hao Sun , Shilei Wen , Errui Ding
Abstract: A method and apparatus for vehicle damage assessment, an electronic device, and a computer-readable storage medium are provided. The method may include: extracting, from an input image, a first feature characterizing a part of a vehicle and a second feature characterizing a damage type of the vehicle; integrating the first feature and the second feature to generate a third feature characterizing a corresponding relation between the part and the damage type; converting the third feature into a characteristic vector; and determining a damage recognition result based on the characteristic vector. According to the technical solution of the disclosure, users can rapidly and accurately learn about the damage condition of the vehicle by providing pictures or videos of the damaged vehicle, thus providing an objective basis for subsequent damage assessment, claim settlement, and repair.
-
公开(公告)号:US11482023B2
公开(公告)日:2022-10-25
申请号:US16710528
申请日:2019-12-11
Inventor: Chengquan Zhang , Zuming Huang , Mengyi En , Junyu Han , Errui Ding
IPC: G06V30/262 , G06N20/00 , G06V10/22 , G06V30/148 , G06V30/10
Abstract: A method and apparatus for detecting text regions in an image, a device, and a medium are provided. The method may include: detecting, based on feature representation of an image, a first text region in the image, where the first text region covers a text in the image, a region occupied by the text being of a certain shape; determining, based on a feature block of the first text region, text geometry information associated with the text, where the text geometry information includes a text centerline of the text and distance information of the centerline from the upper and lower borders of the text; and adjusting, based on the text geometry information associated with the text, the first text region to a second text region, where the second text region also covers the text and is smaller than the first text region.
-
公开(公告)号:US11379696B2
公开(公告)日:2022-07-05
申请号:US16817419
申请日:2020-03-12
Inventor: Zhigang Wang , Jian Wang , Shilei Wen , Errui Ding , Hao Sun
Abstract: The present disclosure provides a pedestrian re-identification method and apparatus, computer device and readable medium. The method comprises: collecting a target image and a to-be-identified image including a pedestrian image; obtaining a feature expression of the target image and a feature expression of the to-be-identified image respectively, based on a pre-trained feature extraction model; wherein the feature extraction model is obtained by training based on a self-attention feature of a base image as well as a co-attention feature of the base image relative to a reference image; identifying whether a pedestrian in the to-be-identified image is the same pedestrian as that in the target image according to the feature expression of the target image and the feature expression of the to-be-identified image. According to the pedestrian re-identification method of the present disclosure, the accuracy of the pedestrian re-identification can be effectively improved when the feature extraction model is used to perform the pedestrian re-identification.
-
-
-
-
-
-
-
-
-