Patent search ap:("Google LLC") AND inv:"Yang Xu" Page 1

1.

发明授权
Identifying key-value pairs in documents 有权

公开(公告)号：US11816710B2

公开(公告)日：2023-11-14

申请号：US17653097

申请日：2022-03-01

Applicant: Google LLC

Inventor： Yang Xu , Jiang Wang , Shengyang Dai

IPC: G06V30/142 , G06Q30/04 , G06V30/412 , G06V30/414

CPC classification number: G06Q30/04 , G06V30/412 , G06V30/414

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method includes: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair including key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.

2.

发明申请
IDENTIFYING KEY-VALUE PAIRS IN DOCUMENTS 审中-公开

公开(公告)号：US20200273078A1

公开(公告)日：2020-08-27

申请号：US16802864

申请日：2020-02-27

Applicant: Google LLC

Inventor： Yang Xu , Jiang Wang , Shengyang Dai

IPC: G06Q30/04 , G06N3/08 , G06K9/00

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method comprises: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair comprising key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.

3.

发明申请
IDENTIFYING KEY-VALUE PAIRS IN DOCUMENTS 有权

公开(公告)号：US20220309549A1

公开(公告)日：2022-09-29

申请号：US17653097

申请日：2022-03-01

Applicant: Google LLC

Inventor： Yang Xu , Jiang Wang , Shengyang Dai

IPC: G06Q30/04 , G06V30/412 , G06V30/414

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method includes: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair including key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.

4.

发明授权
Identifying key-value pairs in documents 有权

公开(公告)号：US11288719B2

公开(公告)日：2022-03-29

申请号：US16802864

申请日：2020-02-27

Applicant: Google LLC

Inventor： Yang Xu , Jiang Wang , Shengyang Dai

IPC: G06Q30/04 , G06K9/00 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method comprises: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair comprising key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.

Patent Agency Ranking