-
公开(公告)号:US11881044B2
公开(公告)日:2024-01-23
申请号:US17353540
申请日:2021-06-21
Inventor: Chengquan Zhang , Mengyi En , Ju Huang , Qunyi Xie , Xiameng Qin , Kun Yao , Junyu Han , Jingtuo Liu , Errui Ding
IPC: G06V30/414 , G06T7/136 , G06T7/11 , G06F18/213 , G06V30/146 , G06V30/18 , G06V10/764 , G06V10/82 , G06V30/10
CPC classification number: G06V30/414 , G06F18/213 , G06T7/11 , G06T7/136 , G06V10/764 , G06V10/82 , G06V30/147 , G06V30/18057 , G06T2207/30176 , G06V30/10
Abstract: A method and apparatus for processing an image, a device and a storage medium are provided. An implementation of the method includes: acquiring a template image, the template image including at least one region of interest; determining a first feature map corresponding to each region of interest in the template image; acquiring a target image; determining a second feature map of the target image; and determining at least one region of interest in the target image according to the first feature map and the second feature map.
-
公开(公告)号:US11756332B2
公开(公告)日:2023-09-12
申请号:US17208568
申请日:2021-03-22
Inventor: Zhizhi Guo , Yipeng Sun , Jingtuo Liu , Junyu Han
IPC: G06V40/10 , G06V40/16 , G06N3/08 , G06F18/10 , G06F18/20 , G06N3/045 , G06V10/764 , G06V10/80 , G06V10/82
CPC classification number: G06V40/171 , G06F18/10 , G06F18/29 , G06N3/045 , G06N3/08 , G06V10/764 , G06V10/806 , G06V10/82
Abstract: The present application discloses an image recognition method, apparatus, device, and a computer storage medium, which is related to a technical field of artificial intelligence, and in particular, to a technical field of image processing. The method includes: performing organ recognition on a human face image and marking positions of the human facial five sense organs in the human face image, obtaining a marked human face image; inputting the marked human face image into a backbone network model and performing feature extraction, obtaining defect features of the marked human face image outputted by different convolutional neural network levels of the backbone network model; and fusing the defect features of different levels that are located in a same area of the human face image, obtaining a defect recognition result of the human face image.
-
公开(公告)号:US11861919B2
公开(公告)日:2024-01-02
申请号:US17352668
申请日:2021-06-21
Inventor: Chengquan Zhang , Pengyuan Lv , Kun Yao , Junyu Han , Jingtuo Liu
IPC: G06V20/00 , G06V20/62 , G06N3/08 , G06V30/262 , G06V20/58 , G06V30/148 , G06N3/045 , G06V30/28 , G06V30/10
CPC classification number: G06V20/62 , G06N3/045 , G06N3/08 , G06V20/582 , G06V20/63 , G06V30/153 , G06V30/262 , G06V30/274 , G06V30/10 , G06V30/287 , G06V30/293
Abstract: A text recognition method includes: acquiring an image including text information, the text information including M characters, M being a positive integer greater than 1; performing text recognition on the image to acquire character information about the M characters; recognizing reading direction information about each character in accordance with the character information about the M characters, the reading direction information being used to indicate a next character corresponding to a current character in a semantic reading order; and ranking the M characters in accordance with the reading direction information about the M characters to acquire a text recognition result of the text information.
-
公开(公告)号:US11074437B2
公开(公告)日:2021-07-27
申请号:US15930714
申请日:2020-05-13
Inventor: Shihu Li , Xiangda Yan , Yuanzhang Chang , Zhibin Hong , Tianshu Hu , Kun Yao , Junyu Han , Jingtuo Liu , Shengxian Zhu
Abstract: A method, an electronic device and a storage medium for expression driving are disclosed. The method may include: performing facial key point detection on a driven character in a first image to obtain a first facial key point sequence; performing the following processing for each second image of a plurality of second images obtained successively: performing facial key point detection on a driving character in the second image to obtain a second facial key point sequence; obtaining a difference between the second facial key point sequence and an expressionless key point sequence which has been determined previously according to an analysis on the second facial key point sequence for a previous second image, and performing expression drive rendering on the driven character based on the difference and the first facial key point sequence. The technical solution may enhance flexibility, interactivity, accuracy etc.
-
公开(公告)号:US20210209343A1
公开(公告)日:2021-07-08
申请号:US17208568
申请日:2021-03-22
Inventor: Zhizhi Guo , Yipeng Sun , Jingtuo Liu , Junyu Han
Abstract: The present application discloses an image recognition method, apparatus, device, and a computer storage medium, which is related to a technical field of artificial intelligence, and in particular, to a technical field of image processing. The method includes: performing organ recognition on a human face image and marking positions of the human facial five sense organs in the human face image, obtaining a marked human face image; inputting the marked human face image into a backbone network model and performing feature extraction, obtaining defect features of the marked human face image outputted by different convolutional neural network levels of the backbone network model; and fusing the defect features of different levels that are located in a same area of the human face image, obtaining a defect recognition result of the human face image.
-
公开(公告)号:US11687779B2
公开(公告)日:2023-06-27
申请号:US17208611
申请日:2021-03-22
Inventor: Zhizhi Guo , Yipeng Sun , Jingtuo Liu , Junyu Han
CPC classification number: G06N3/08 , G06F18/10 , G06F18/253 , G06N3/04 , G06V10/52 , G06V10/806 , G06V40/171 , G06V40/172
Abstract: An image recognition method is provided, which is related to a technical field of artificial intelligence, and in particular, to a technical field of image processing. An implementation includes: performing five-sense-organ recognition on a preprocessed human face image and marking positions of the human facial five sense organs in the human face image, to obtain the marked human face image; determining human face images at multiple scales of the marked human face image, inputting the human face images of multiple scales into a backbone network model, and performing feature extraction, to obtain a wrinkle feature of the human face image at each of the multiple scales; and fusing the wrinkle feature at each scale that is located in a same area of the human face image, to obtain a wrinkle recognition result of the human face image.
-
公开(公告)号:US20210357710A1
公开(公告)日:2021-11-18
申请号:US17352668
申请日:2021-06-21
Inventor: Chengquan Zhang , Pengyuan Lv , Kun Yao , Junyu Han , Jingtuo Liu
Abstract: A text recognition method includes: acquiring an image including text information, the text information including M characters, M being a positive integer greater than 1; performing text recognition on the image to acquire character information about the M characters; recognizing reading direction information about each character in accordance with the character information about the M characters, the reading direction information being used to indicate a next character corresponding to a current character in a semantic reading order; and ranking the M characters in accordance with the reading direction information about the M characters to acquire a text recognition result of the text information.
-
公开(公告)号:US11908219B2
公开(公告)日:2024-02-20
申请号:US17244291
申请日:2021-04-29
Inventor: Zihan Ni , Yipeng Sun , Kun Yao , Junyu Han , Errui Ding , Jingtuo Liu , Haifeng Wang
IPC: G06V30/413 , G06F40/30 , G06V30/414 , G06V10/70
CPC classification number: G06V30/413 , G06F40/30 , G06V10/70 , G06V30/414
Abstract: The disclosure provides a method and a device for processing information, an electronic device, and a storage medium, belonging to a field of artificial intelligence including computer vision, deep learning, and natural language processing. In the method, the computing device recognizes multiple text items in the image. The computing device classifies multiple text items into a first set of name text items and a second set of content text items based on semantics of the text items. The computing device performs a matching operation between the first set and the second set based on a layout of the text items in the image, and determines matched name-content text items. The matched name-content text items include a name text item in the first set and a content text item matching the name text item and in the second set. The computing device outputs the matched name-content text items.
-
公开(公告)号:US20210334540A1
公开(公告)日:2021-10-28
申请号:US17370665
申请日:2021-07-08
Inventor: Yanlong Zhang , Mian Peng , Zuncheng Yang , Junyu Han , Jingtuo Liu
Abstract: A vehicle loss assessment method executed by a mobile terminal, a device, a mobile terminal, a medium and a computer program product are provided. The implementation solution includes: acquiring at least one input image; detecting vehicle identification information in the at least one input image; detecting vehicle damage information in the at least one input image; and determining a vehicle loss assessment result on the basis of the vehicle identification information and the vehicle damage information.
-
公开(公告)号:US20210271870A1
公开(公告)日:2021-09-02
申请号:US17244291
申请日:2021-04-29
Inventor: Zihan Ni , Yipeng Sun , Kun Yao , Junyu Han , Errui Ding , Jingtuo Liu , Haifeng Wang
Abstract: The disclosure provides a method and a device for processing information, an electronic device, and a storage medium, belonging to a field of artificial intelligence including computer vision, deep learning, and natural language processing. In the method, the computing device recognizes multiple text items in the image. The computing device classifies multiple text items into a first set of name text items and a second set of content text items based on semantics of the text items. The computing device performs a matching operation between the first set and the second set based on a layout of the text items in the image, and determines matched name-content text items. The matched name-content text items include a name text item in the first set and a content text item matching the name text item and in the second set. The computing device outputs the matched name-content text items.
-
-
-
-
-
-
-
-
-