-
公开(公告)号:US12154361B2
公开(公告)日:2024-11-26
申请号:US17313755
申请日:2021-05-06
Inventor: Xingyao Chen , Canlu Huang , Wencan Hu , Yidong Chen , Hanquan Lin , Fei Huang , Geyang Ke , Zhiquan Yang
IPC: G06F17/00 , G06F18/214 , G06F40/106 , G06V10/44 , G06V30/146 , G06V30/148 , G06V30/414 , G06V30/10
Abstract: A method of image-to-document conversion based on optical character recognition (OCR) includes obtaining an image to be converted into a target document, and performing layout segmentation on the image according to image content of the image, to obtain n image layouts, each of the n image layouts corresponding to a content type, and n being a positive integer. The method also includes, for each of the n image layouts, processing image content in the respective image layout according to the content type corresponding to the respective image layout, to obtain converted content corresponding to the respective image layout. The method further includes adding the converted content corresponding to the n image layouts to an electronic document, to obtain the target document.
-
公开(公告)号:US12182976B2
公开(公告)日:2024-12-31
申请号:US17711852
申请日:2022-04-01
IPC: G06T5/77 , G06T3/4053 , G06T5/50 , G06T7/90 , G06V10/56 , G06V10/75 , G06V10/77 , G06V10/774 , G06V10/776
Abstract: Methods, apparatuses and computer-readable storage mediums for processing images to remove moiré patterns. The method entails running an image processing model to perform moiré pattern removal processing on an original image to obtain a target image. The image processing model may include a network model trained according to a moiré pattern training data set. The image processing model can also include a multi-band module, which may be configured to process the original image to obtain an N-layer Laplacian pyramid of the original image, and obtain a first processing result feature map based on feature maps corresponding to N spatial frequency bands of the N-layer Laplacian pyramid, with N being a positive integer greater than or equal to 2. The image processing model can include obtaining the target image according to the first processing result feature map.
-