-
公开(公告)号:US11816710B2
公开(公告)日:2023-11-14
申请号:US17653097
申请日:2022-03-01
Applicant: Google LLC
Inventor: Yang Xu , Jiang Wang , Shengyang Dai
IPC: G06V30/142 , G06Q30/04 , G06V30/412 , G06V30/414
CPC classification number: G06Q30/04 , G06V30/412 , G06V30/414
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method includes: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair including key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.
-
公开(公告)号:US20200273078A1
公开(公告)日:2020-08-27
申请号:US16802864
申请日:2020-02-27
Applicant: Google LLC
Inventor: Yang Xu , Jiang Wang , Shengyang Dai
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method comprises: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair comprising key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.
-
公开(公告)号:US20220309549A1
公开(公告)日:2022-09-29
申请号:US17653097
申请日:2022-03-01
Applicant: Google LLC
Inventor: Yang Xu , Jiang Wang , Shengyang Dai
IPC: G06Q30/04 , G06V30/412 , G06V30/414
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method includes: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair including key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.
-
公开(公告)号:US11288719B2
公开(公告)日:2022-03-29
申请号:US16802864
申请日:2020-02-27
Applicant: Google LLC
Inventor: Yang Xu , Jiang Wang , Shengyang Dai
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method comprises: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair comprising key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.
-
-
-