-
公开(公告)号:US20210383120A1
公开(公告)日:2021-12-09
申请号:US17116578
申请日:2020-12-09
Inventor: Zhichao ZHOU , Dongliang HE , Fu LI , Hao SUN
Abstract: The present disclosure provides a method and apparatus for detecting a region of interest in a video, a device and a storage medium. The method may include: acquiring a current to-be-processed frame from a picture frame sequence of a video; detecting a region of interest (ROI) in the current to-be-processed frame, in response to determining that the current to-be-processed frame is a detection picture frame, to determine at least one ROI in the current to-be-processed frame; and updating a to-be-tracked ROI, based on the ROI in the current to-be-processed frame and a tracking result determined by a pre-order tracking picture frame; and tracking the current to-be-processed frame based on the existing to-be-tracked ROI, in response to determining that the current to-be-processed frame is a tracking picture frame, to determine at least one tracking result as the ROI of the current to-be-processed frame.
-
公开(公告)号:US20210360252A1
公开(公告)日:2021-11-18
申请号:US17125370
申请日:2020-12-17
Inventor: Chao LI , Yukang DING , Dongliang HE , Fu LI , Hao SUN , Shilei WEN , Hongwu ZHANG , Errui DING
IPC: H04N19/132 , H04N19/172 , G06N3/04 , G06K9/62
Abstract: A method for video frame interpolation, a related electronic device and a storage medium is disclosed. A video is obtained. An (i−1)th frame and an ith frame of the video are obtained. Visual semantic feature maps and depth maps of the (i−1)th frame and the ith frame are obtained. Frame interpolation information is obtained based on the visual semantic feature maps and the depth maps. An interpolated frame between the (i−1)th frame and the ith frame is generated based on the frame interpolation information and the (i−1)th frame and is inserted between the (i−1)th frame and the ith frame.
-
公开(公告)号:US20210335008A1
公开(公告)日:2021-10-28
申请号:US17172883
申请日:2021-02-10
Inventor: Xiaoqing YE , Xiao TAN , Hao SUN , Hongwu ZHANG
Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a video frame, and relates to the field of computer vision technology. The method may include: acquiring a plurality of candidate first-order radial distortion parameters preset for a to-be-processed video frame, and acquiring a specified value of a specified radial distortion parameter; performing radial distortion correction on the to-be-processed video frame to obtain a first initial corrected video frame; selecting a first initial corrected video frame in which a local region except for a center region after distortion correction includes a largest number of straight line segments; and determining a candidate first-order radial distortion parameter corresponding to the selected first initial corrected video frame for use as a target first-order radial distortion parameter of the to-be-processed video frame.
-
公开(公告)号:US20210334979A1
公开(公告)日:2021-10-28
申请号:US17366691
申请日:2021-07-02
Inventor: Mian PENG , Jian WANG , Hao SUN , Xiao TAN , Errui DING
Abstract: A method of segmenting an image includes acquiring a first segmentation probability map of an input portrait image and detecting a region where a target part of the input portrait image is located. The method also includes acquiring a partial image including the target part and corresponding to the region and acquiring a partial segmentation probability map of the region in the first segmentation probability map. The method further includes segmenting the partial image in accordance with the partial segmentation probability map to acquire a second segmentation probability map. The first segmentation probability map and the second segmentation probability map are combined to acquire a segmentation result of the input portrait image.
-
公开(公告)号:US20210304413A1
公开(公告)日:2021-09-30
申请号:US17344917
申请日:2021-06-10
Inventor: Hao SUN , Fu LI , Tianwei LIN , Dongliang HE
Abstract: An image processing method, an image processing device and an electronic device, all relate to computer vision and deep learning. The image processing method includes: acquiring a first image and a second image; performing semantic region segmentation on the first image and the second image to acquire a first segmentation image and a second segmentation image respectively; determining an association matrix between the first segmentation image and the second segmentation image; and processing the first image in accordance with the association matrix to acquire a target image.
-
公开(公告)号:US20200342271A1
公开(公告)日:2020-10-29
申请号:US16817419
申请日:2020-03-12
Inventor: Zhigang WANG , Jian WANG , Shilei WEN , Errui DING , Hao SUN
Abstract: The present disclosure provides a pedestrian re-identification method and apparatus, computer device and readable medium. The method comprises: collecting a target image and a to-be-identified image including a pedestrian image; obtaining a feature expression of the target image and a feature expression of the to-be-identified image respectively, based on a pre-trained feature extraction model; wherein the feature extraction model is obtained by training based on a self-attention feature of a base image as well as a co-attention feature of the base image relative to a reference image; identifying whether a pedestrian in the to-be-identified image is the same pedestrian as that in the target image according to the feature expression of the target image and the feature expression of the to-be-identified image. According to the pedestrian re-identification method of the present disclosure, the accuracy of the pedestrian re-identification can be effectively improved when the feature extraction model is used to perform the pedestrian re-identification.
-
公开(公告)号:US20220207299A1
公开(公告)日:2022-06-30
申请号:US17460646
申请日:2021-08-30
Inventor: Chao LI , Dongliang HE , Wenling GAO , Fu LI , Hao SUN
Abstract: A method for building an image enhancement model includes obtaining training data; building a neural network model consisting of a feature extraction module, at least one channel dilated convolution module and a spatial upsampling module, where each channel dilated convolution module includes a spatial downsampling submodule, a channel dilation submodule and a spatial upsampling submodule; training the neural network model by using the video frames and the standard images corresponding to the video frames until the neural network model converges, to obtain an image enhancement model. In addition, a method for image enhancement includes obtaining a video frame to be processed; taking the video frame to be processed as an input of an image enhancement model, and taking an output result of the image enhancement model as an image enhancement result of the video frame to be processed.
-
公开(公告)号:US20210350146A1
公开(公告)日:2021-11-11
申请号:US17379448
申请日:2021-07-19
Inventor: Wei ZHANG , Xiao TAN , Hao SUN , Errui DING
Abstract: A vehicle tracking method, apparatus, and electronic device relate to the technical field of computer vision and deep learning. A method includes identifying first position information of a first vehicle in a first image of a video stream collected during driving of vehicles; and identifying second position information of a second vehicle in a second image of the video stream. The first image is the previous N frame images adjacent to the second image in the video stream, and N is a positive integer. The method also includes predicting first position offset information of the second vehicle relative to the first vehicle on the basis of the first image and the second image; and determining a tracking result of the second vehicle on the basis of the first position information, the second position information and the first position offset information.
-
公开(公告)号:US20210343065A1
公开(公告)日:2021-11-04
申请号:US17373420
申请日:2021-07-12
Inventor: Tianwei LIN , Fu LI , Xin LI , Henan ZHANG , Hao SUN
Abstract: The disclosure discloses a cartoonlization processing method for an image, and relates to a field of computational vision, image processing, face recognition, deep learning technologies. The method includes: performing skin color recognition on a facial image to be processed to determine a target skin color of a face in the facial image; processing the facial image by utilizing any cartoonizing model in a cartoonizing model set to obtain a reference cartoonized image corresponding to the facial image in a case that the cartoonizing model set does not contain a cartoonizing model corresponding to the target skin color; determining a pixel adjustment parameter based on the target skin color and a reference skin color corresponding to the any cartoonizing model; and adjusting a pixel value of each pixel point in the reference cartoonized image based on the pixel adjustment parameter, to obtain a target cartoonized image corresponding to the facial image.
-
20.
公开(公告)号:US20210312208A1
公开(公告)日:2021-10-07
申请号:US17304296
申请日:2021-06-17
Inventor: Zhigang WANG , Jian WANG , Errui DING , Hao SUN
Abstract: A method, an apparatus, device and a storage medium for generating a target re-recognition model are provided. The method may include: acquiring a set of labeled samples, a set of unlabeled samples and an initialization model obtained through supervised training; performing feature extraction on each sample in the set of the unlabeled samples by using the initialization model; clustering features extracted from the set of the unlabeled samples by using a clustering algorithm; assigning, for each sample in the set of the unlabeled samples, a pseudo label to the sample according to a cluster corresponding to the sample in a feature space; and mixing a set of samples with a pseudo label and the set of the labeled samples as a set of training samples, and performing supervised training on the initialization model to obtain a target re-recognition model.
-
-
-
-
-
-
-
-
-