Computing machine and template management method

    公开(公告)号:US10949610B2

    公开(公告)日:2021-03-16

    申请号:US16270621

    申请日:2019-02-08

    Applicant: Hitachi, Ltd.

    Abstract: A computing machine managing a template manages a document format, a template, and a cluster generated on the basis of a classification result based on a position of the document in a feature space in such a manner that they correspond to one another; determines whether a cluster to which a target document can belong is present on the basis of a position, in the feature space, of the target document in a case of detecting an opportunity of generating a template of the target document; registers the template of the target document in a case of determining that the cluster to which the target document can belong is not present; generates a cluster corresponding to the registered template; and manages a document format of the target document, the registered template, and the generated cluster in such a manner that they correspond to one another.

    Computer, document identification method, and system

    公开(公告)号:US10552674B2

    公开(公告)日:2020-02-04

    申请号:US15918830

    申请日:2018-03-12

    Applicant: HITACHI, LTD.

    Abstract: A computer, which is configured to extract an attribute being a character string indicating a feature of a paper-based document, the computer stores template information dictionary information. The computer is configured to: execute character recognition processing on image data on the paper-based document; extract an attribute corresponding to each of the at least one type of attribute, which is defined in each of the plurality of templates, through use of a result of the character recognition processing and the plurality of templates; calculate a score regarding the extracted attribute for each of the plurality of templates; select one of the plurality of templates that has the highest extraction accuracy of the attribute based on the score; and generate output information through use of the selected template.

    Computer and document identification method

    公开(公告)号:US10783366B2

    公开(公告)日:2020-09-22

    申请号:US16117198

    申请日:2018-08-30

    Applicant: HITACHI, LTD.

    Abstract: A computer that extracts an attribute which is a text string contained in an predetermined examination target document stores template information for managing a plurality of templates in which a type of attribute is defined, executes a text recognition process on image data of the document, extracts an attribute corresponding to the type of attribute using a result of the text recognition process and the plurality of templates, selects a template based on the extracted attribute, generates output information that includes the attribute extracted using the selected template and is used for the examination; determines a type of confirmation operation performed on the output information, before the examination, based on a comparison result between an evaluation value indicating credibility of the output information and a threshold, and corrects the determined type of confirmation operation based on the text string contained in the document.

    COMPUTER AND DOCUMENT IDENTIFICATION METHOD
    4.
    发明申请

    公开(公告)号:US20190138804A1

    公开(公告)日:2019-05-09

    申请号:US16117198

    申请日:2018-08-30

    Applicant: HITACHI, LTD.

    CPC classification number: G06K9/00442 G06K9/00993 G06K9/03 G06K2209/01

    Abstract: A computer that extracts an attribute which is a text string contained in an predetermined examination target document stores template information for managing a plurality of templates in which a type of attribute is defined, executes a text recognition process on image data of the document, extracts an attribute corresponding to the type of attribute using a result of the text recognition process and the plurality of templates, selects a template based on the extracted attribute, generates output information that includes the attribute extracted using the selected template and is used for the examination; determines a type of confirmation operation performed on the output information, before the examination, based on a comparison result between an evaluation value indicating credibility of the output information and a threshold, and corrects the determined type of confirmation operation based on the text string contained in the document.

Patent Agency Ranking