-
公开(公告)号:US20210209395A1
公开(公告)日:2021-07-08
申请号:US17212712
申请日:2021-03-25
Inventor: Zihan Ni , Yipeng Sun , Junyu Han
Abstract: The disclosure provides a method for recognizing a license plate. The implementation includes: obtaining a feature map including a plurality of feature vectors of a license plate region; sequentially inputting the plurality of feature vectors based on a first order into a first recurrent neural network for encoding to obtain a first code of each of the plurality of feature vectors; sequentially inputting the plurality of feature vectors based on a second order into a second recurrent neural network for encoding to obtain a second code of each of the plurality of feature vectors; generating a plurality of target codes of the plurality of feature vectors based on the first code of each of the plurality of feature vectors and the second code of each of the plurality of feature vectors; and decoding the plurality of target codes to obtain a plurality of characters in the license plate.
-
公开(公告)号:US20210209343A1
公开(公告)日:2021-07-08
申请号:US17208568
申请日:2021-03-22
Inventor: Zhizhi Guo , Yipeng Sun , Jingtuo Liu , Junyu Han
Abstract: The present application discloses an image recognition method, apparatus, device, and a computer storage medium, which is related to a technical field of artificial intelligence, and in particular, to a technical field of image processing. The method includes: performing organ recognition on a human face image and marking positions of the human facial five sense organs in the human face image, obtaining a marked human face image; inputting the marked human face image into a backbone network model and performing feature extraction, obtaining defect features of the marked human face image outputted by different convolutional neural network levels of the backbone network model; and fusing the defect features of different levels that are located in a same area of the human face image, obtaining a defect recognition result of the human face image.
-
公开(公告)号:US11908219B2
公开(公告)日:2024-02-20
申请号:US17244291
申请日:2021-04-29
Inventor: Zihan Ni , Yipeng Sun , Kun Yao , Junyu Han , Errui Ding , Jingtuo Liu , Haifeng Wang
IPC: G06V30/413 , G06F40/30 , G06V30/414 , G06V10/70
CPC classification number: G06V30/413 , G06F40/30 , G06V10/70 , G06V30/414
Abstract: The disclosure provides a method and a device for processing information, an electronic device, and a storage medium, belonging to a field of artificial intelligence including computer vision, deep learning, and natural language processing. In the method, the computing device recognizes multiple text items in the image. The computing device classifies multiple text items into a first set of name text items and a second set of content text items based on semantics of the text items. The computing device performs a matching operation between the first set and the second set based on a layout of the text items in the image, and determines matched name-content text items. The matched name-content text items include a name text item in the first set and a content text item matching the name text item and in the second set. The computing device outputs the matched name-content text items.
-
公开(公告)号:US20210271870A1
公开(公告)日:2021-09-02
申请号:US17244291
申请日:2021-04-29
Inventor: Zihan Ni , Yipeng Sun , Kun Yao , Junyu Han , Errui Ding , Jingtuo Liu , Haifeng Wang
Abstract: The disclosure provides a method and a device for processing information, an electronic device, and a storage medium, belonging to a field of artificial intelligence including computer vision, deep learning, and natural language processing. In the method, the computing device recognizes multiple text items in the image. The computing device classifies multiple text items into a first set of name text items and a second set of content text items based on semantics of the text items. The computing device performs a matching operation between the first set and the second set based on a layout of the text items in the image, and determines matched name-content text items. The matched name-content text items include a name text item in the first set and a content text item matching the name text item and in the second set. The computing device outputs the matched name-content text items.
-
公开(公告)号:US20210209344A1
公开(公告)日:2021-07-08
申请号:US17208611
申请日:2021-03-22
Inventor: Zhizhi Guo , Yipeng Sun , Jingtuo Liu , Junyu Han
Abstract: An image recognition method is provided, which is related to a technical field of artificial intelligence, and in particular, to a technical field of image processing. An implementation includes: performing five-sense-organ recognition on a preprocessed human face image and marking positions of the human facial five sense organs in the human face image, to obtain the marked human face image; determining human face images at multiple scales of the marked human face image, inputting the human face images of multiple scales into a backbone network model, and performing feature extraction, to obtain a wrinkle feature of the human face image at each of the multiple scales; and fusing the wrinkle feature at each scale that is located in a same area of the human face image, to obtain a wrinkle recognition result of the human face image
-
公开(公告)号:US11756332B2
公开(公告)日:2023-09-12
申请号:US17208568
申请日:2021-03-22
Inventor: Zhizhi Guo , Yipeng Sun , Jingtuo Liu , Junyu Han
IPC: G06V40/10 , G06V40/16 , G06N3/08 , G06F18/10 , G06F18/20 , G06N3/045 , G06V10/764 , G06V10/80 , G06V10/82
CPC classification number: G06V40/171 , G06F18/10 , G06F18/29 , G06N3/045 , G06N3/08 , G06V10/764 , G06V10/806 , G06V10/82
Abstract: The present application discloses an image recognition method, apparatus, device, and a computer storage medium, which is related to a technical field of artificial intelligence, and in particular, to a technical field of image processing. The method includes: performing organ recognition on a human face image and marking positions of the human facial five sense organs in the human face image, obtaining a marked human face image; inputting the marked human face image into a backbone network model and performing feature extraction, obtaining defect features of the marked human face image outputted by different convolutional neural network levels of the backbone network model; and fusing the defect features of different levels that are located in a same area of the human face image, obtaining a defect recognition result of the human face image.
-
公开(公告)号:US11687779B2
公开(公告)日:2023-06-27
申请号:US17208611
申请日:2021-03-22
Inventor: Zhizhi Guo , Yipeng Sun , Jingtuo Liu , Junyu Han
CPC classification number: G06N3/08 , G06F18/10 , G06F18/253 , G06N3/04 , G06V10/52 , G06V10/806 , G06V40/171 , G06V40/172
Abstract: An image recognition method is provided, which is related to a technical field of artificial intelligence, and in particular, to a technical field of image processing. An implementation includes: performing five-sense-organ recognition on a preprocessed human face image and marking positions of the human facial five sense organs in the human face image, to obtain the marked human face image; determining human face images at multiple scales of the marked human face image, inputting the human face images of multiple scales into a backbone network model, and performing feature extraction, to obtain a wrinkle feature of the human face image at each of the multiple scales; and fusing the wrinkle feature at each scale that is located in a same area of the human face image, to obtain a wrinkle recognition result of the human face image.
-
公开(公告)号:US11210546B2
公开(公告)日:2021-12-28
申请号:US16822085
申请日:2020-03-18
Inventor: Yipeng Sun , Chengquan Zhang , Zuming Huang , Jiaming Liu , Junyu Han , Errui Ding
Abstract: The present disclosure proposes an end-to-end text recognition method and apparatus, computer device and readable medium. The method comprises: obtaining a to-be-recognized picture containing a text region; recognizing a position of the text region in the to-be-recognized picture and text content included in the text region with a pre-trained end-to-end text recognition model; the end-to-end text recognition model comprising a region of interest perspective transformation processing module for performing perspective transformation processing for the text region. The technical solution of the present disclosure does not need to serially arrange a plurality of steps, and may avoid introducing the accumulated errors and may effectively improve the accuracy of the text recognition.
-
-
-
-
-
-
-