-
1.
公开(公告)号:US20210390682A1
公开(公告)日:2021-12-16
申请号:US17116597
申请日:2020-12-09
Inventor: Shufei LIN , Jianfeng ZHU , Pengcheng YUAN , Bin ZHANG , Shumin HAN , Yingbo XU , Yuan FENG , Ying XIN , Xiaodi WANG , Jingwei LIU , Shilei WEN , Hongwu ZHANG , Errui DING
Abstract: A method for detecting a surface defect, a method for training model, an apparatus, a device, and a medium, are provided. The method includes: inputting a surface image of the article for detection into a defect detection model to perform a defect detection, and acquiring a defect detection result output by the defect detection model; inputting a surface image of a defective article determined to be defective into an image discrimination model based on the defect detection result to determine whether the surface image of the defective article is defective, wherein the image discrimination model is a trained generative adversarial networks model, and the generative adversarial networks model is obtained by training using a surface image of a defect-free good article; and adjusting the defect detection result of the surface image of the defective article according to a determination result of the image discrimination model.
-
公开(公告)号:US20210227152A1
公开(公告)日:2021-07-22
申请号:US17025255
申请日:2020-09-18
Inventor: Henan ZHANG , Xin LI , Fu LI , Tianwei LIN , Hao SUN , Shilei WEN , Hongwu ZHANG , Errui DING
Abstract: Embodiments of the present disclosure provide a method and apparatus for generating an image. The method may include: receiving a first image including a face input by a user in an interactive scene; presenting the first image to the user; inputting the first image into a pre-trained generative adversarial network in a backend to obtain a second image output by the generative adversarial network; where the generative adversarial network uses face attribute information generated based on the input image as a constraint; and presenting the second image to the user in response to obtaining the second image output by the generative adversarial network in the backend.
-
3.
公开(公告)号:US20210216783A1
公开(公告)日:2021-07-15
申请号:US17144523
申请日:2021-01-08
Inventor: Xiang LONG , Dongliang HE , Fu LI , Xiang ZHAO , Tianwei LIN , Hao SUN , Shilei WEN , Errui DING
Abstract: A method includes screening, by a video-clip screening module in a video description model, a plurality of video proposal clips acquired from a video to be analyzed, to acquire a plurality of video clips suitable for description. The plural video proposal clips acquired from the video to be analyzed may be screened by the video-clip screening module to acquire the plural video clips suitable for description; and then, each video clip is described by a video-clip describing module, thus avoiding description of all the video proposal clips, only describing the screened video clips which have strong correlation with the video and are suitable for description, removing the interference of the description of the video clips which are not suitable for description in the description of the video, guaranteeing the accuracy of the final descriptions of the video clips, and improving the quality of the descriptions of the video clips.
-
4.
公开(公告)号:US20200372609A1
公开(公告)日:2020-11-26
申请号:US16810986
申请日:2020-03-06
Inventor: Chao LI , Dongliang HE , Xiao LIU , Yukang DING , Shilei WEN , Errui DING , Henan ZHANG , Hao SUN
IPC: G06T3/40
Abstract: A super-resolution video reconstruction method, device, apparatus and a computer-readable storage medium are provided. The method includes: extracting a hypergraph from consecutive frames of an original video; inputting a hypergraph vector of the hypergraph into a residual convolutional neural network to obtain an output result of the residual convolutional neural network; and inputting the output result of the residual convolutional neural network into a spatial upsampling network to obtain a super-resolution frame, wherein a super-resolution video of the original video is formed by multiple super-resolution frames.
-
公开(公告)号:US20210390731A1
公开(公告)日:2021-12-16
申请号:US17201665
申请日:2021-03-15
Inventor: Jian WANG , Zipeng LU , Hao SUN , Hongwu ZHANG , Shilei WEN , Errui DING
Abstract: A method and apparatus for positioning a key point, a device, and a storage medium are provided. The method may include: extracting a first feature map and a second feature map of a to-be-positioned image, the first feature map and the second feature map being different feature maps; determining, based on the first feature map, an initial position of a key point in the to-be-positioned image; determining, based on the second feature map, an offset of the key point; and adding the initial position of the key point with the offset of the key point to obtain a final position of the key point.
-
6.
公开(公告)号:US20180007259A1
公开(公告)日:2018-01-04
申请号:US15543969
申请日:2015-11-13
Inventor: Fujian WANG , Fuguo ZHU , Errui DING , Long GONG , Yafeng DENG
CPC classification number: H04N5/23222 , G06K9/00248 , G06K9/00255 , G06K9/6202 , H04N5/23219 , H04N5/23293
Abstract: The present disclosure provides a photo-taking prompting method and apparatus, an apparatus and a non-volatile computer storage medium. On the one hand, a user's image information is collected while the user finds view, then the user's face posture information is obtained from the image information, then face posture information of a preset photo-taking template is compared with the user's face posture information, and the user is prompted to adjust the face posture according to a comparison result. The technical solutions provided by embodiments of the present disclosure may implement prompting the user's face posture adjustment while the user finds view and thereby implement providing guidance for the user's face posture, and solve the problem in the prior art about failure to perform photo-taking guidance while the user finds view.
-
公开(公告)号:US20210350146A1
公开(公告)日:2021-11-11
申请号:US17379448
申请日:2021-07-19
Inventor: Wei ZHANG , Xiao TAN , Hao SUN , Errui DING
Abstract: A vehicle tracking method, apparatus, and electronic device relate to the technical field of computer vision and deep learning. A method includes identifying first position information of a first vehicle in a first image of a video stream collected during driving of vehicles; and identifying second position information of a second vehicle in a second image of the video stream. The first image is the previous N frame images adjacent to the second image in the video stream, and N is a positive integer. The method also includes predicting first position offset information of the second vehicle relative to the first vehicle on the basis of the first image and the second image; and determining a tracking result of the second vehicle on the basis of the first position information, the second position information and the first position offset information.
-
8.
公开(公告)号:US20210312208A1
公开(公告)日:2021-10-07
申请号:US17304296
申请日:2021-06-17
Inventor: Zhigang WANG , Jian WANG , Errui DING , Hao SUN
Abstract: A method, an apparatus, device and a storage medium for generating a target re-recognition model are provided. The method may include: acquiring a set of labeled samples, a set of unlabeled samples and an initialization model obtained through supervised training; performing feature extraction on each sample in the set of the unlabeled samples by using the initialization model; clustering features extracted from the set of the unlabeled samples by using a clustering algorithm; assigning, for each sample in the set of the unlabeled samples, a pseudo label to the sample according to a cluster corresponding to the sample in a feature space; and mixing a set of samples with a pseudo label and the set of the labeled samples as a set of training samples, and performing supervised training on the initialization model to obtain a target re-recognition model.
-
公开(公告)号:US20210004629A1
公开(公告)日:2021-01-07
申请号:US16822085
申请日:2020-03-18
Inventor: Yipeng SUN , Chengquan ZHANG , Zuming HUANG , Jiaming LIU , Junyu HAN , Errui DING
Abstract: The present disclosure proposes an end-to-end text recognition method and apparatus, computer device and readable medium. The method comprises: obtaining a to-be-recognized picture containing a text region; recognizing a position of the text region in the to-be-recognized picture and text content included in the text region with a pre-trained end-to-end text recognition model; the end-to-end text recognition model comprising a region of interest perspective transformation processing module for performing perspective transformation processing for the text region. The technical solution of the present disclosure does not need to serially arrange a plurality of steps, and may avoid introducing the accumulated errors and may effectively improve the accuracy of the text recognition.
-
公开(公告)号:US20210360252A1
公开(公告)日:2021-11-18
申请号:US17125370
申请日:2020-12-17
Inventor: Chao LI , Yukang DING , Dongliang HE , Fu LI , Hao SUN , Shilei WEN , Hongwu ZHANG , Errui DING
IPC: H04N19/132 , H04N19/172 , G06N3/04 , G06K9/62
Abstract: A method for video frame interpolation, a related electronic device and a storage medium is disclosed. A video is obtained. An (i−1)th frame and an ith frame of the video are obtained. Visual semantic feature maps and depth maps of the (i−1)th frame and the ith frame are obtained. Frame interpolation information is obtained based on the visual semantic feature maps and the depth maps. An interpolated frame between the (i−1)th frame and the ith frame is generated based on the frame interpolation information and the (i−1)th frame and is inserted between the (i−1)th frame and the ith frame.
-
-
-
-
-
-
-
-
-