-
公开(公告)号:US10949610B2
公开(公告)日:2021-03-16
申请号:US16270621
申请日:2019-02-08
Applicant: Hitachi, Ltd.
Inventor: Ryosuke Odate , Hiroshi Shinjo , Yasufumi Suzuki , Masahiro Motobayashi
IPC: G06F40/186 , G06T7/70 , G06K9/00 , G06F40/103 , G06F16/35
Abstract: A computing machine managing a template manages a document format, a template, and a cluster generated on the basis of a classification result based on a position of the document in a feature space in such a manner that they correspond to one another; determines whether a cluster to which a target document can belong is present on the basis of a position, in the feature space, of the target document in a case of detecting an opportunity of generating a template of the target document; registers the template of the target document in a case of determining that the cluster to which the target document can belong is not present; generates a cluster corresponding to the registered template; and manages a document format of the target document, the registered template, and the generated cluster in such a manner that they correspond to one another.
-
公开(公告)号:US10552674B2
公开(公告)日:2020-02-04
申请号:US15918830
申请日:2018-03-12
Applicant: HITACHI, LTD.
Inventor: Yasuo Watanabe , Toshio Okochi , Hiroshi Shinjo , Masahiro Motobayashi , Yasufumi Suzuki
Abstract: A computer, which is configured to extract an attribute being a character string indicating a feature of a paper-based document, the computer stores template information dictionary information. The computer is configured to: execute character recognition processing on image data on the paper-based document; extract an attribute corresponding to each of the at least one type of attribute, which is defined in each of the plurality of templates, through use of a result of the character recognition processing and the plurality of templates; calculate a score regarding the extracted attribute for each of the plurality of templates; select one of the plurality of templates that has the highest extraction accuracy of the attribute based on the score; and generate output information through use of the selected template.
-
公开(公告)号:US10783366B2
公开(公告)日:2020-09-22
申请号:US16117198
申请日:2018-08-30
Applicant: HITACHI, LTD.
Inventor: Yasufumi Suzuki , Hiroshi Shinjo , Ryosuke Odate , Masahiro Motobayashi
Abstract: A computer that extracts an attribute which is a text string contained in an predetermined examination target document stores template information for managing a plurality of templates in which a type of attribute is defined, executes a text recognition process on image data of the document, extracts an attribute corresponding to the type of attribute using a result of the text recognition process and the plurality of templates, selects a template based on the extracted attribute, generates output information that includes the attribute extracted using the selected template and is used for the examination; determines a type of confirmation operation performed on the output information, before the examination, based on a comparison result between an evaluation value indicating credibility of the output information and a threshold, and corrects the determined type of confirmation operation based on the text string contained in the document.
-
公开(公告)号:US20190138804A1
公开(公告)日:2019-05-09
申请号:US16117198
申请日:2018-08-30
Applicant: HITACHI, LTD.
Inventor: Yasufumi Suzuki , Hiroshi Shinjo , Ryosuke Odate , Masahiro Motobayashi
CPC classification number: G06K9/00442 , G06K9/00993 , G06K9/03 , G06K2209/01
Abstract: A computer that extracts an attribute which is a text string contained in an predetermined examination target document stores template information for managing a plurality of templates in which a type of attribute is defined, executes a text recognition process on image data of the document, extracts an attribute corresponding to the type of attribute using a result of the text recognition process and the plurality of templates, selects a template based on the extracted attribute, generates output information that includes the attribute extracted using the selected template and is used for the examination; determines a type of confirmation operation performed on the output information, before the examination, based on a comparison result between an evaluation value indicating credibility of the output information and a threshold, and corrects the determined type of confirmation operation based on the text string contained in the document.
-
-
-