Patent search ap:("Beijing Baidu Netcom Science AND Technology Co. Page Ltd.") AND inv:"Errui DING"

1.

发明申请
METHOD FOR DETECTING SURFACE DEFECT, METHOD FOR TRAINING MODEL, APPARATUS, DEVICE, AND MEDIA 有权

公开(公告)号：US20210390682A1

公开(公告)日：2021-12-16

申请号：US17116597

申请日：2020-12-09

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Shufei LIN , Jianfeng ZHU , Pengcheng YUAN , Bin ZHANG , Shumin HAN , Yingbo XU , Yuan FENG , Ying XIN , Xiaodi WANG , Jingwei LIU , Shilei WEN , Hongwu ZHANG , Errui DING

IPC: G06T7/00 , G06K9/62 , G06K9/46 , G06N3/08 , G06N3/04

Abstract: A method for detecting a surface defect, a method for training model, an apparatus, a device, and a medium, are provided. The method includes: inputting a surface image of the article for detection into a defect detection model to perform a defect detection, and acquiring a defect detection result output by the defect detection model; inputting a surface image of a defective article determined to be defective into an image discrimination model based on the defect detection result to determine whether the surface image of the defective article is defective, wherein the image discrimination model is a trained generative adversarial networks model, and the generative adversarial networks model is obtained by training using a surface image of a defect-free good article; and adjusting the defect detection result of the surface image of the defective article according to a determination result of the image discrimination model.

2.

发明申请
METHOD AND APPARATUS FOR GENERATING IMAGE 有权

公开(公告)号：US20210227152A1

公开(公告)日：2021-07-22

申请号：US17025255

申请日：2020-09-18

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Henan ZHANG , Xin LI , Fu LI , Tianwei LIN , Hao SUN , Shilei WEN , Hongwu ZHANG , Errui DING

IPC: H04N5/262 , H04N5/232 , G06K9/00 , G06T5/00

Abstract: Embodiments of the present disclosure provide a method and apparatus for generating an image. The method may include: receiving a first image including a face input by a user in an interactive scene; presenting the first image to the user; inputting the first image into a pre-trained generative adversarial network in a backend to obtain a second image output by the generative adversarial network; where the generative adversarial network uses face attribute information generated based on the input image as a constraint; and presenting the second image to the user in response to obtaining the second image output by the generative adversarial network in the backend.

3.

发明申请
METHOD AND APPARATUS FOR DETECTING TEMPORAL ACTION OF VIDEO, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20210216783A1

公开(公告)日：2021-07-15

申请号：US17144523

申请日：2021-01-08

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Xiang LONG , Dongliang HE , Fu LI , Xiang ZHAO , Tianwei LIN , Hao SUN , Shilei WEN , Errui DING

IPC: G06K9/00 , G06K9/62

Abstract: A method includes screening, by a video-clip screening module in a video description model, a plurality of video proposal clips acquired from a video to be analyzed, to acquire a plurality of video clips suitable for description. The plural video proposal clips acquired from the video to be analyzed may be screened by the video-clip screening module to acquire the plural video clips suitable for description; and then, each video clip is described by a video-clip describing module, thus avoiding description of all the video proposal clips, only describing the screened video clips which have strong correlation with the video and are suitable for description, removing the interference of the description of the video clips which are not suitable for description in the description of the video, guaranteeing the accuracy of the final descriptions of the video clips, and improving the quality of the descriptions of the video clips.

4.

发明申请
SUPER-RESOLUTION VIDEO RECONSTRUCTION METHOD, DEVICE, APPARATUS AND COMPUTER-READABLE STORAGE MEDIUM 审中-公开

公开(公告)号：US20200372609A1

公开(公告)日：2020-11-26

申请号：US16810986

申请日：2020-03-06

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Chao LI , Dongliang HE , Xiao LIU , Yukang DING , Shilei WEN , Errui DING , Henan ZHANG , Hao SUN

IPC: G06T3/40

Abstract: A super-resolution video reconstruction method, device, apparatus and a computer-readable storage medium are provided. The method includes: extracting a hypergraph from consecutive frames of an original video; inputting a hypergraph vector of the hypergraph into a residual convolutional neural network to obtain an output result of the residual convolutional neural network; and inputting the output result of the residual convolutional neural network into a spatial upsampling network to obtain a super-resolution frame, wherein a super-resolution video of the original video is formed by multiple super-resolution frames.

5.

发明申请
METHOD AND APPARATUS FOR POSITIONING KEY POINT, DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20210390731A1

公开(公告)日：2021-12-16

申请号：US17201665

申请日：2021-03-15

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Jian WANG , Zipeng LU , Hao SUN , Hongwu ZHANG , Shilei WEN , Errui DING

IPC: G06T7/73 , G06K9/62 , G06K9/46 , G06N3/04

Abstract: A method and apparatus for positioning a key point, a device, and a storage medium are provided. The method may include: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point.

6.

发明申请
PHOTO-TAKING PROMPTING METHOD AND APPARATUS, AN APPARATUS AND NON-VOLATILE COMPUTER STORAGE MEDIUM 审中-公开

公开(公告)号：US20180007259A1

公开(公告)日：2018-01-04

申请号：US15543969

申请日：2015-11-13

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Fujian WANG , Fuguo ZHU , Errui DING , Long GONG , Yafeng DENG

IPC: H04N5/232 , G06K9/62 , G06K9/00

CPC classification number: H04N5/23222 , G06K9/00248 , G06K9/00255 , G06K9/6202 , H04N5/23219 , H04N5/23293

Abstract: The present disclosure provides a photo-taking prompting method and apparatus, an apparatus and a non-volatile computer storage medium. On the one hand, a user's image information is collected while the user finds view, then the user's face posture information is obtained from the image information, then face posture information of a preset photo-taking template is compared with the user's face posture information, and the user is prompted to adjust the face posture according to a comparison result. The technical solutions provided by embodiments of the present disclosure may implement prompting the user's face posture adjustment while the user finds view and thereby implement providing guidance for the user's face posture, and solve the problem in the prior art about failure to perform photo-taking guidance while the user finds view.

7.

发明申请
Vehicle Tracking Method, Apparatus, and Electronic Device 有权

公开(公告)号：US20210350146A1

公开(公告)日：2021-11-11

申请号：US17379448

申请日：2021-07-19

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Wei ZHANG , Xiao TAN , Hao SUN , Errui DING

IPC: G06K9/00 , G06K9/62 , G06T7/215 , G06T7/246

Abstract: A vehicle tracking method, apparatus, and electronic device relate to the technical field of computer vision and deep learning. A method includes identifying first position information of a first vehicle in a first image of a video stream collected during driving of vehicles; and identifying second position information of a second vehicle in a second image of the video stream. The first image is the previous N frame images adjacent to the second image in the video stream, and N is a positive integer. The method also includes predicting first position offset information of the second vehicle relative to the first vehicle on the basis of the first image and the second image; and determining a tracking result of the second vehicle on the basis of the first position information, the second position information and the first position offset information.

8.

发明申请
METHOD AND APPARATUS FOR GENERATING TARGET RE-RECOGNITION MODEL AND RE-RECOGNIZING TARGET 有权

公开(公告)号：US20210312208A1

公开(公告)日：2021-10-07

申请号：US17304296

申请日：2021-06-17

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Zhigang WANG , Jian WANG , Errui DING , Hao SUN

IPC: G06K9/46 , G06K9/62

Abstract: A method, an apparatus, device and a storage medium for generating a target re-recognition model are provided. The method may include: acquiring a set of labeled samples, a set of unlabeled samples and an initialization model obtained through supervised training; performing feature extraction on each sample in the set of the unlabeled samples by using the initialization model; clustering features extracted from the set of the unlabeled samples by using a clustering algorithm; assigning, for each sample in the set of the unlabeled samples, a pseudo label to the sample according to a cluster corresponding to the sample in a feature space; and mixing a set of samples with a pseudo label and the set of the labeled samples as a set of training samples, and performing supervised training on the initialization model to obtain a target re-recognition model.

9.

发明申请
END-TO-END TEXT RECOGNITION METHOD AND APPARATUS, COMPUTER DEVICE AND READABLE MEDIUM 有权

公开(公告)号：US20210004629A1

公开(公告)日：2021-01-07

申请号：US16822085

申请日：2020-03-18

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Yipeng SUN , Chengquan ZHANG , Zuming HUANG , Jiaming LIU , Junyu HAN , Errui DING

IPC: G06K9/46 , G06K9/62 , G06K9/32

Abstract: The present disclosure proposes an end-to-end text recognition method and apparatus, computer device and readable medium. The method comprises: obtaining a to-be-recognized picture containing a text region; recognizing a position of the text region in the to-be-recognized picture and text content included in the text region with a pre-trained end-to-end text recognition model; the end-to-end text recognition model comprising a region of interest perspective transformation processing module for performing perspective transformation processing for the text region. The technical solution of the present disclosure does not need to serially arrange a plurality of steps, and may avoid introducing the accumulated errors and may effectively improve the accuracy of the text recognition.

10.

发明申请
METHOD FOR VIDEO FRAME INTERPOLATION, RELATED ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20210360252A1

公开(公告)日：2021-11-18

申请号：US17125370

申请日：2020-12-17

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventor： Chao LI , Yukang DING , Dongliang HE , Fu LI , Hao SUN , Shilei WEN , Hongwu ZHANG , Errui DING

IPC: H04N19/132 , H04N19/172 , G06N3/04 , G06K9/62

Abstract: A method for video frame interpolation, a related electronic device and a storage medium is disclosed. A video is obtained. An (i−1)th frame and an ith frame of the video are obtained. Visual semantic feature maps and depth maps of the (i−1)th frame and the ith frame are obtained. Frame interpolation information is obtained based on the visual semantic feature maps and the depth maps. An interpolated frame between the (i−1)th frame and the ith frame is generated based on the frame interpolation information and the (i−1)th frame and is inserted between the (i−1)th frame and the ith frame.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification