-
公开(公告)号:US20230095673A1
公开(公告)日:2023-03-30
申请号:US17888300
申请日:2022-08-15
Applicant: Oracle International Corporation
Inventor: Yakupitiyage Don Thanuja Samodhye Dharmasiri , Xu Zhong , Ahmed Ataallah Ataallah Abobakr , Hongtao Yang , Budhaditya Saha , Shaoke Xu , Shashi Prasad Suravarapu , Mark Edward Johnson , Thanh Long Duong
IPC: G06V10/82 , G06V30/412 , G06V30/148
Abstract: Techniques for extracting key information from a document using machine-learning models in a chatbot system is disclosed herein. In one particular aspect, a method is provided that includes receiving a set of data, which includes key fields, within a document at a data processing system that includes a table detection module, a key information extraction module, and a table extraction module. Text information and corresponding location data are extracted via optical character recognition. The table detection module detects whether one or more tables are present in the document and, if applicable, a location of each of the tables. The key information extraction module extracts text from the key fields. The table extraction module extracts each of the tables based on input from the optical character recognition and the table detection module. Extraction results include the text from the key fields and each of the tables can be output.
-
公开(公告)号:US12217497B2
公开(公告)日:2025-02-04
申请号:US17888300
申请日:2022-08-15
Applicant: Oracle International Corporation
Inventor: Yakupitiyage Don Thanuja Samodhye Dharmasiri , Xu Zhong , Ahmed Ataallah Ataallah Abobakr , Hongtao Yang , Budhaditya Saha , Shaoke Xu , Shashi Prasad Suravarapu , Mark Edward Johnson , Thanh Long Duong
IPC: G06V10/82 , G06V30/148 , G06V30/412
Abstract: Techniques for extracting key information from a document using machine-learning models in a chatbot system is disclosed herein. In one particular aspect, a method is provided that includes receiving a set of data, which includes key fields, within a document at a data processing system that includes a table detection module, a key information extraction module, and a table extraction module. Text information and corresponding location data are extracted via optical character recognition. The table detection module detects whether one or more tables are present in the document and, if applicable, a location of each of the tables. The key information extraction module extracts text from the key fields. The table extraction module extracts each of the tables based on input from the optical character recognition and the table detection module. Extraction results include the text from the key fields and each of the tables can be output.
-
公开(公告)号:US20250157209A1
公开(公告)日:2025-05-15
申请号:US19002208
申请日:2024-12-26
Applicant: Oracle International Corporation
Inventor: Yakupitiyage Don Thanuja Samodhye Dharmasiri , Xu Zhong , Ahmed Ataallah Ataallah Abobakr , Hongtao Yang , Budhaditya Saha , Shaoke Xu , Shashi Prasad Suravarapu , Mark Edward Johnson , Thanh Long Duong
IPC: G06V10/82 , G06V30/148 , G06V30/412
Abstract: Techniques for extracting key information from a document using machine-learning models in a chatbot system is disclosed herein. In one particular aspect, a method is provided that includes receiving a set of data, which includes key fields, within a document at a data processing system that includes a table detection module, a key information extraction module, and a table extraction module. Text information and corresponding location data are extracted via optical character recognition. The table detection module detects whether one or more tables are present in the document and, if applicable, a location of each of the tables. The key information extraction module extracts text from the key fields. The table extraction module extracts each of the tables based on input from the optical character recognition and the table detection module. Extraction results include the text from the key fields and each of the tables can be output.
-
-