-
公开(公告)号:US20210224568A1
公开(公告)日:2021-07-22
申请号:US17209987
申请日:2021-03-23
Inventor: Xiaoqiang Zhang , Pengyuan LV , Shanshan Liu , Chengquan Zhang
IPC: G06K9/32 , G06K9/34 , G06K9/62 , G06F40/284 , G06F40/205
Abstract: The present disclosure discloses a method and apparatus for recognizing a text. The method comprises: acquiring images of a text area of an input image, the acquired images including a text centerline graph, a text direction offset graph, a text boundary offset graph, and a text character classification graph; extracting coordinates of feature points of a character center from the text centerline graph; sorting the extracted coordinates of the feature points based on the text direction offset graph to obtain a coordinate sequence of the feature points; determining a polygonal bounding box of the text area based on the coordinate sequence of the feature points of the character center and the text boundary offset graph; and determining a classification result of the feature points of the character center, based on the coordinate sequence of the feature points of the character center and the text character classification graph.
-
公开(公告)号:US20210342621A1
公开(公告)日:2021-11-04
申请号:US17373378
申请日:2021-07-12
Inventor: Pengyuan LV , Chengquan Zhang
Abstract: The disclosure provides a method and an apparatus for character recognition and processing. A character region is labelled for each character contained in each sample image of a sample image set. A character category and a character position code corresponding to each character region are labelled. A preset neural network model for character recognition is trained based on the sample image set having labelled character regions, character categories and character position codes corresponding to the character regions.
-
公开(公告)号:US20230290126A1
公开(公告)日:2023-09-14
申请号:US18115059
申请日:2023-02-28
Inventor: Pengyuan LV , Sen FAN , Chengquan ZHANG , Kun YAO , Junyu HAN , Jingtuo LIU , Errui DING , Jingdong WANG
IPC: G06V10/774 , G06V10/77 , G06V10/42 , G06V10/44 , G06V10/25
CPC classification number: G06V10/7747 , G06V10/25 , G06V10/42 , G06V10/44 , G06V10/7715 , G06V30/148
Abstract: Provided are a method for training a region of interest (ROI) detection model, a method for detecting an ROI, a device, and a medium. The specific implementation includes: performing feature extraction on a sample image to obtain a sample feature data; performing non-linear mapping on the sample feature data to obtain a first feature data and a second feature data; determining an inter-region difference data according to the second feature data and a third feature data of the first feature data in a region associated with a label ROI; and adjusting at least one of a to-be-trained feature extraction parameter and a to-be-trained feature enhancement parameter of the ROI detection model according to the inter-region difference data and the region associated with the label ROI.
-
-