MACHINE LEARNING ENABLED DOCUMENT DESKEWING
    3.
    发明公开

    公开(公告)号:US20230222632A1

    公开(公告)日:2023-07-13

    申请号:US17570678

    申请日:2022-01-07

    申请人: SAP SE

    摘要: A method may include determining, based at least on an image of a document, a plurality of text bounding boxes enclosing lines of text present in the document. A machine learning model may be trained to determine, based at least on the coordinates defining the text bounding boxes, the coordinates of a document bounding box enclosing the text bounding boxes. The document bounding box may encapsulate the visual aberrations that are present in the image of the document. As such, one or more transformations may be determined based on the coordinates of the document bounding box. The image of the document may be deskewed by applying the transformations. One or more downstream tasks may be performed based on the deskewed image of the document. Related methods and articles of manufacture are also disclosed.

    METHOD OF RECTIFYING TEXT IMAGE, TRAINING METHOD, ELECTRONIC DEVICE, AND MEDIUM

    公开(公告)号:US20230102804A1

    公开(公告)日:2023-03-30

    申请号:US18077026

    申请日:2022-12-07

    摘要: A method of rectifying a text image, a training method, an electronic device, and a medium, which relate to a field of an artificial intelligence technology, in particular to fields of computer vision, deep learning technology, intelligent transportation and high-precision maps. An exemplary implementation includes: performing, based on a gating strategy, a plurality of first layer-wise processing on a text image to be rectified, so as to obtain respective feature maps of a plurality of layer levels, wherein each of the feature maps includes a text structural feature related to the text image to be rectified, and the gating strategy is configured to increase an attention to the text structural feature; and performing a plurality of second layer-wise processing on the respective feature maps of the plurality of layer levels, so as to obtain a rectified text image corresponding to the text image to be rectified.

    Inspection apparatus, control method, and inspection method

    公开(公告)号:US12106532B2

    公开(公告)日:2024-10-01

    申请号:US17929553

    申请日:2022-09-02

    发明人: Reiji Misawa

    摘要: An inspection apparatus selects at least one character area, in a first preview image obtained by reading and previewing a print product, sets a direction, for a character in the selected character area, registers the set direction and the character in the selected character area in association with each other, selects at least one character inspection area, in a second preview image obtained by reading and previewing a print product as an inspection target, sets a direction, for a character in the selected character inspection area, rotates the character inspection area to match the set direction, with the direction set for the character in the selected character area, performs character recognition, for the character in the rotated character inspection area, and inspects the character inspection area, based on a result of the character recognition and a result of recognizing the character in the selected character area.