Patent search ap:("Beijing Baidu Netcom Science AND Technology Co. Page Ltd.") AND inv:"Errui Ding"

1.

发明授权
Human body identification method, electronic device and storage medium 有权

公开(公告)号：US11854237B2

公开(公告)日：2023-12-26

申请号：US17353324

申请日：2021-06-21

Applicant: Beijing Baidu Netcom Science and Technology Co., LTD

Inventor： Zipeng Lu , Jian Wang , Yuchen Yuan , Hao Sun , Errui Ding

IPC: G06V10/25 , G06V40/10 , G06F18/214 , G06V10/75 , G06V10/764 , G06V10/774 , G06V10/82

CPC classification number: G06V10/25 , G06F18/214 , G06V10/757 , G06V10/764 , G06V10/774 , G06V10/82 , G06V40/10 , G06V40/103

Abstract: A human body identification method, an electronic device and a storage medium, related to the technical field of artificial intelligence such as computer vision and deep learning, are provided. The method includes: inputting an image to be identified into a human body detection model, to obtain a plurality of preselected detection boxes; identifying a plurality of key points from each of the preselected detection boxes respectively according to a human body key point detection model, and obtaining a key point score of each of the key points; determining a target detection box from each of the preselected detection boxes, according to a number of the key points whose key point scores meet a key point threshold; and inputting the target detection box into a human body key point classification model, to obtain a human body identification result for the image to be identified.

2.

发明授权
Method and apparatus for generating face image 有权

公开(公告)号：US11463631B2

公开(公告)日：2022-10-04

申请号：US17025255

申请日：2020-09-18

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Henan Zhang , Xin Li , Fu Li , Tianwei Lin , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding

IPC: H04N5/262 , H04N5/232 , G06T5/00 , G06V40/16

Abstract: Embodiments of the present disclosure provide a method and apparatus for generating an image. The method may include: receiving a first image including a face input by a user in an interactive scene; presenting the first image to the user; inputting the first image into a pre-trained generative adversarial network in a backend to obtain a second image output by the generative adversarial network; where the generative adversarial network uses face attribute information generated based on the input image as a constraint; and presenting the second image to the user in response to obtaining the second image output by the generative adversarial network in the backend.

3.

发明申请
METHOD AND APPARATUS FOR DETECTING VEHICLE POSE 有权

公开(公告)号：US20220270289A1

公开(公告)日：2022-08-25

申请号：US17743402

申请日：2022-05-12

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Wei Zhang , Xiaoqing Ye , Xiao Tan , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding

IPC: G06T7/73 , G06T7/194 , G06T7/593 , H04N13/128

Abstract: A method and device for detecting a vehicle pose, relating to the fields of computer vision and automatic driving. The specific implementation solution comprises: inputting a vehicle left view point image and a vehicle right view point image into a part prediction and mask segmentation network model, and determining foreground pixel points and part coordinates thereof in a reference image; converting coordinates of the foreground pixels in the reference image into coordinates of the foreground pixels in a camera coordinate system so as to obtain a pseudo-point cloud, and fusing part coordinate of the foreground pixels and the pseudo-point cloud to obtain fused pseudo-point cloud; and inputting the fused pseudo-point cloud into a pre-trained pose prediction model to obtain a pose information of the vehicle to be detected.

4.

发明授权
Video processing method, apparatus, device and storage medium 有权

公开(公告)号：US11416967B2

公开(公告)日：2022-08-16

申请号：US17024253

申请日：2020-09-17

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Chao Li , Shilei Wen , Errui Ding

IPC: G06K9/00 , G06T3/40 , G06T7/13 , G06N3/08 , G06T7/40

Abstract: Embodiments of the present disclosure provide a video processing method, a video processing device and a related non-transitory computer readable storage medium. The method includes the following. Frame sequence data of a low-resolution video to be converted is obtained. Pixel tensors of each frame in the frame sequence data are inputted into a pre-trained neural network model to obtain high-resolution video frame sequence data corresponding to the video to be converted output by the neural network model. The neural network model obtains the high-resolution video frame sequence data based on high-order pixel information of each frame in the frame sequence data.

5.

发明授权
Method for video frame interpolation, related electronic device and storage medium 有权

公开(公告)号：US11363271B2

公开(公告)日：2022-06-14

申请号：US17125370

申请日：2020-12-17

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Chao Li , Yukang Ding , Dongliang He , Fu Li , Hao Sun , Shilei Wen , Hongwu Zhang , Errui Ding

IPC: H04N19/132 , H04N19/172 , G06K9/62 , G06N3/04

Abstract: A method for video frame interpolation, a related electronic device and a storage medium is disclosed. A video is obtained. An (i−1)th frame and an ith frame of the video are obtained. Visual semantic feature maps and depth maps of the (i−1)th frame and the ith frame are obtained. Frame interpolation information is obtained based on the visual semantic feature maps and the depth maps. An interpolated frame between the (i−1)th frame and the ith frame is generated based on the frame interpolation information and the (i−1)th frame and is inserted between the (i−1)th frame and the ith frame.

6.

发明授权
End-to-end text recognition method and apparatus, computer device and readable medium 有权

公开(公告)号：US11210546B2

公开(公告)日：2021-12-28

申请号：US16822085

申请日：2020-03-18

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Yipeng Sun , Chengquan Zhang , Zuming Huang , Jiaming Liu , Junyu Han , Errui Ding

IPC: G06K9/46 , G06K9/32 , G06K9/62

Abstract: The present disclosure proposes an end-to-end text recognition method and apparatus, computer device and readable medium. The method comprises: obtaining a to-be-recognized picture containing a text region; recognizing a position of the text region in the to-be-recognized picture and text content included in the text region with a pre-trained end-to-end text recognition model; the end-to-end text recognition model comprising a region of interest perspective transformation processing module for performing perspective transformation processing for the text region. The technical solution of the present disclosure does not need to serially arrange a plurality of steps, and may avoid introducing the accumulated errors and may effectively improve the accuracy of the text recognition.

7.

发明授权
Method and apparatus for detecting target object in image 有权

公开(公告)号：US11776155B2

公开(公告)日：2023-10-03

申请号：US16894123

申请日：2020-06-05

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Xiaoqing Ye , Xiao Tan , Wei Zhang , Hao Sun , Errui Ding

IPC: G06T7/73 , G06N3/04 , G06N3/08 , G06T7/70 , G06T11/20 , G06V10/764 , G06V10/82 , G06V20/58 , G06V20/64 , G06V10/25 , G06F18/24 , G06F18/214

CPC classification number: G06T7/73 , G06F18/214 , G06F18/24 , G06N3/04 , G06N3/08 , G06T7/70 , G06T11/20 , G06V10/764 , G06V10/82 , G06V20/58 , G06V20/647 , G06T2207/20081 , G06T2207/20084 , G06T2210/12

Abstract: Embodiments of the present disclosure provide a method and apparatus for detecting a target object in an image. The method includes: performing following prediction operations using a pre-trained neural network: detecting a target object in a two-dimensional image to determine a two-dimensional bounding box of the target object; and determining a relative position constraint relationship between the two-dimensional bounding box of the target object and a three-dimensional projection bounding box obtained by projecting a three-dimensional bounding box of the target object into the two-dimensional image; and the method further including: determining the three-dimensional projection bounding box of the target object, based on the two-dimensional bounding box of the target object and the relative position constraint relationship between the two-dimensional bounding box of the target object and the three-dimensional projection bounding box.

8.

发明授权
Method and apparatus for vehicle damage assessment, electronic device, and computer storage medium 有权

公开(公告)号：US11538286B2

公开(公告)日：2022-12-27

申请号：US16710464

申请日：2019-12-11

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Wei Zhang , Xiao Tan , Hao Sun , Shilei Wen , Errui Ding

IPC: G07C5/00 , G06N3/04 , G06Q10/00 , G06V20/10

Abstract: A method and apparatus for vehicle damage assessment, an electronic device, and a computer-readable storage medium are provided. The method may include: extracting, from an input image, a first feature characterizing a part of a vehicle and a second feature characterizing a damage type of the vehicle; integrating the first feature and the second feature to generate a third feature characterizing a corresponding relation between the part and the damage type; converting the third feature into a characteristic vector; and determining a damage recognition result based on the characteristic vector. According to the technical solution of the disclosure, users can rapidly and accurately learn about the damage condition of the vehicle by providing pictures or videos of the damaged vehicle, thus providing an objective basis for subsequent damage assessment, claim settlement, and repair.

9.

发明授权
Method and apparatus for detecting text regions in image, device, and medium 有权

公开(公告)号：US11482023B2

公开(公告)日：2022-10-25

申请号：US16710528

申请日：2019-12-11

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Chengquan Zhang , Zuming Huang , Mengyi En , Junyu Han , Errui Ding

IPC: G06V30/262 , G06N20/00 , G06V10/22 , G06V30/148 , G06V30/10

Abstract: A method and apparatus for detecting text regions in an image, a device, and a medium are provided. The method may include: detecting, based on feature representation of an image, a first text region in the image, where the first text region covers a text in the image, a region occupied by the text being of a certain shape; determining, based on a feature block of the first text region, text geometry information associated with the text, where the text geometry information includes a text centerline of the text and distance information of the centerline from the upper and lower borders of the text; and adjusting, based on the text geometry information associated with the text, the first text region to a second text region, where the second text region also covers the text and is smaller than the first text region.

10.

发明授权
Pedestrian re-identification method, computer device and readable medium 有权

公开(公告)号：US11379696B2

公开(公告)日：2022-07-05

申请号：US16817419

申请日：2020-03-12

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Zhigang Wang , Jian Wang , Shilei Wen , Errui Ding , Hao Sun

IPC: G06K9/62 , G06V10/40 , G06V40/10

Abstract: The present disclosure provides a pedestrian re-identification method and apparatus, computer device and readable medium. The method comprises: collecting a target image and a to-be-identified image including a pedestrian image; obtaining a feature expression of the target image and a feature expression of the to-be-identified image respectively, based on a pre-trained feature extraction model; wherein the feature extraction model is obtained by training based on a self-attention feature of a base image as well as a co-attention feature of the base image relative to a reference image; identifying whether a pedestrian in the to-be-identified image is the same pedestrian as that in the target image according to the feature expression of the target image and the feature expression of the to-be-identified image. According to the pedestrian re-identification method of the present disclosure, the accuracy of the pedestrian re-identification can be effectively improved when the feature extraction model is used to perform the pedestrian re-identification.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification