-
公开(公告)号:US11776155B2
公开(公告)日:2023-10-03
申请号:US16894123
申请日:2020-06-05
Inventor: Xiaoqing Ye , Xiao Tan , Wei Zhang , Hao Sun , Errui Ding
IPC: G06T7/73 , G06N3/04 , G06N3/08 , G06T7/70 , G06T11/20 , G06V10/764 , G06V10/82 , G06V20/58 , G06V20/64 , G06V10/25 , G06F18/24 , G06F18/214
CPC classification number: G06T7/73 , G06F18/214 , G06F18/24 , G06N3/04 , G06N3/08 , G06T7/70 , G06T11/20 , G06V10/764 , G06V10/82 , G06V20/58 , G06V20/647 , G06T2207/20081 , G06T2207/20084 , G06T2210/12
Abstract: Embodiments of the present disclosure provide a method and apparatus for detecting a target object in an image. The method includes: performing following prediction operations using a pre-trained neural network: detecting a target object in a two-dimensional image to determine a two-dimensional bounding box of the target object; and determining a relative position constraint relationship between the two-dimensional bounding box of the target object and a three-dimensional projection bounding box obtained by projecting a three-dimensional bounding box of the target object into the two-dimensional image; and the method further including: determining the three-dimensional projection bounding box of the target object, based on the two-dimensional bounding box of the target object and the relative position constraint relationship between the two-dimensional bounding box of the target object and the three-dimensional projection bounding box.
-
公开(公告)号:US11568590B2
公开(公告)日:2023-01-31
申请号:US17373420
申请日:2021-07-12
Inventor: Tianwei Lin , Fu Li , Xin Li , Henan Zhang , Hao Sun
Abstract: The disclosure discloses a cartoonlization processing method for an image, and relates to a field of computational vision, image processing, face recognition, deep learning technologies. The method includes: performing skin color recognition on a facial image to be processed to determine a target skin color of a face in the facial image; processing the facial image by utilizing any cartoonizing model in a cartoonizing model set to obtain a reference cartoonized image corresponding to the facial image in a case that the cartoonizing model set does not contain a cartoonizing model corresponding to the target skin color; determining a pixel adjustment parameter based on the target skin color and a reference skin color corresponding to the any cartoonizing model; and adjusting a pixel value of each pixel point in the reference cartoonized image based on the pixel adjustment parameter, to obtain a target cartoonized image corresponding to the facial image.
-
13.
公开(公告)号:US11538286B2
公开(公告)日:2022-12-27
申请号:US16710464
申请日:2019-12-11
Inventor: Wei Zhang , Xiao Tan , Hao Sun , Shilei Wen , Errui Ding
Abstract: A method and apparatus for vehicle damage assessment, an electronic device, and a computer-readable storage medium are provided. The method may include: extracting, from an input image, a first feature characterizing a part of a vehicle and a second feature characterizing a damage type of the vehicle; integrating the first feature and the second feature to generate a third feature characterizing a corresponding relation between the part and the damage type; converting the third feature into a characteristic vector; and determining a damage recognition result based on the characteristic vector. According to the technical solution of the disclosure, users can rapidly and accurately learn about the damage condition of the vehicle by providing pictures or videos of the damaged vehicle, thus providing an objective basis for subsequent damage assessment, claim settlement, and repair.
-
公开(公告)号:US11379696B2
公开(公告)日:2022-07-05
申请号:US16817419
申请日:2020-03-12
Inventor: Zhigang Wang , Jian Wang , Shilei Wen , Errui Ding , Hao Sun
Abstract: The present disclosure provides a pedestrian re-identification method and apparatus, computer device and readable medium. The method comprises: collecting a target image and a to-be-identified image including a pedestrian image; obtaining a feature expression of the target image and a feature expression of the to-be-identified image respectively, based on a pre-trained feature extraction model; wherein the feature extraction model is obtained by training based on a self-attention feature of a base image as well as a co-attention feature of the base image relative to a reference image; identifying whether a pedestrian in the to-be-identified image is the same pedestrian as that in the target image according to the feature expression of the target image and the feature expression of the to-be-identified image. According to the pedestrian re-identification method of the present disclosure, the accuracy of the pedestrian re-identification can be effectively improved when the feature extraction model is used to perform the pedestrian re-identification.
-
公开(公告)号:US20220198828A1
公开(公告)日:2022-06-23
申请号:US17671125
申请日:2022-02-14
Inventor: Hao Sun , Fu Li , Xin Li , Tianwei Lin
Abstract: A method and apparatus for generating an image are provided. The method comprises: acquiring key point information of at least one of the five sense organs in a real facial image; according to the key point information, determining a target area where the five sense organs are located in a first cartoon facial image, wherein the first cartoon facial image is generated by means of the real facial image; and adding the five pre-established sense organ material to the target area to generate a second cartoon facial image.
-
公开(公告)号:US20210319261A1
公开(公告)日:2021-10-14
申请号:US17354557
申请日:2021-06-22
Inventor: Xiaoqing Ye , Xiao Tan , Hao Sun
Abstract: A vehicle information detection method, a method for training a detection model, an electronic device and a storage medium are provided, and relates to the technical field of artificial intelligence, in particular to the technical field of computer vision and deep learning. The method includes: performing a first target detection operation based on an image of a target vehicle, to obtain a first detection result for target information of the target vehicle; performing an error detection operation based on the first detection result, to obtain error information; and performing a second target detection operation based on the first detection result and the error information, to obtain a second detection result for the target information.
-
公开(公告)号:US20210319062A1
公开(公告)日:2021-10-14
申请号:US17182960
申请日:2021-02-23
Inventor: Xiang Long , Ping Wang , Fu Li , Dongliang He , Hao Sun , Shilei Wen
IPC: G06F16/783 , G06K9/00 , G06K9/62
Abstract: Embodiments of the present disclosure disclose a method and apparatus for searching a video segment, a device and a medium, and relate to the field of video data search. The method includes: sampling video frames from a target video and videos to be searched in a video library, and extracting features from the sampled frames; matching the target video and the videos to be searched according to the extracted features to determine a candidate video to be searched that matches the target video; determining at least one candidate video segment from the determined candidate video, and calculating a degree of matching between the target video and each candidate video segment based on the extracted features of each sampled frame; and determining a video segment matching the target video in the videos to be searched according to the calculated degree of matching between the target video and each candidate video segment.
-
公开(公告)号:US20210232856A1
公开(公告)日:2021-07-29
申请号:US17213746
申请日:2021-03-26
Inventor: Yingying Li , Xiao Tan , Minyue Jiang , Hao Sun
Abstract: The present disclosure provides an image processing method. An image to be classified is input into a feature extraction model to generate N dimensional features. Dimension fusion is performed on M features of the N dimensional features to obtain M dimension fusion features. The image to be classified is processed based on M dimension fusion features and remaining features of the N dimensional features other than the M features.
-
19.
公开(公告)号:US11915484B2
公开(公告)日:2024-02-27
申请号:US17304296
申请日:2021-06-17
Inventor: Zhigang Wang , Jian Wang , Errui Ding , Hao Sun
IPC: G06K9/46 , G06K9/62 , G06V20/52 , G06F18/23 , G06F18/214 , G06F18/21 , G06V10/762 , G06V10/764 , G06V10/774 , G06V10/82 , G06V20/64 , G06V40/10
CPC classification number: G06V20/52 , G06F18/214 , G06F18/2178 , G06F18/23 , G06V10/762 , G06V10/764 , G06V10/7753 , G06V10/82 , G06V20/64 , G06V40/10 , G06V2201/07
Abstract: A method, an apparatus, device and a storage medium for generating a target re-recognition model are provided. The method may include: acquiring a set of labeled samples, a set of unlabeled samples and an initialization model obtained through supervised training; performing feature extraction on each sample in the set of the unlabeled samples by using the initialization model; clustering features extracted from the set of the unlabeled samples by using a clustering algorithm; assigning, for each sample in the set of the unlabeled samples, a pseudo label to the sample according to a cluster corresponding to the sample in a feature space; and mixing a set of samples with a pseudo label and the set of the labeled samples as a set of training samples, and performing supervised training on the initialization model to obtain a target re-recognition model.
-
公开(公告)号:US11657612B2
公开(公告)日:2023-05-23
申请号:US17194131
申请日:2021-03-05
Inventor: Dongliang He , Xiao Tan , Shilei Wen , Hao Sun
IPC: G06V20/40 , G06F18/2411
CPC classification number: G06V20/40 , G06F18/2411 , G06V20/41 , G06V20/46 , G06V20/48
Abstract: Embodiments of the present disclosure disclose a method and apparatus for identifying a video. A specific embodiment of the method includes: acquiring a predetermined number of video frames from a video to be identified to obtain a video frame sequence; performing the following processing step: importing the video frame sequence into a pre-trained video identification model to obtain a classification tag probability corresponding to the video frame sequence, wherein the classification tag probability is used to characterize a probability of identifying a corresponding tag category of the video to be identified; and setting, in response to the classification tag probability being greater than or equal to a preset identification accuracy threshold, a video tag for the video to be identified according to the classification tag probability, or else increasing the number of video frames in the video frame sequence and continuing to perform the above processing step.
-
-
-
-
-
-
-
-
-