Patent search ipc:G06V30/414 Page 1

1.

发明申请
AUTOMATIC IMAGE CAPTURE SYSTEM BASED ON A DETERMINATION AND VERIFICATION OF A PHYSICAL OBJECT SIZE IN A CAPTURED IMAGE 有权

公开(公告)号：US20250061738A1

公开(公告)日：2025-02-20

申请号：US18935835

申请日：2024-11-04

Applicant: Capital One Services, LLC

Inventor： Jason PRIBBLE , Daniel Alan JARVIS , Nicholas CAPURSO

IPC: G06V30/414 , G06T7/62 , G06V30/142

Abstract: Described herein are method, system, and non-transitory computer-readable medium embodiments for capturing an image of a first object. A method can include determining that at least one image parameter value of the first object is within a threshold value based on an outline of the first object and at least one environmental feature. The environmental feature can include at least one of: an angle with respect to the first object and a camera, a distance between the first object and the camera, or a second object other than the first object in an image frame. The method can include triggering the camera to capture an image of the first object.

2.

发明授权
Multi-segment text search using machine learning model for text similarity 有权

公开(公告)号：US12230049B2

公开(公告)日：2025-02-18

申请号：US18205867

申请日：2023-06-05

Applicant: Cognition IP Technology Inc.

Inventor： Bryant Lee , Andrew Tjang , Andrew Perry Chu , Uday Pulleti

IPC: G06F17/00 , G06F40/226 , G06V30/413 , G06V30/414 , G06V30/418 , G06F16/93

Abstract: Systems and methods may be provided for performing a search on an input text block. The input text block may be split into a plurality of input text segments. A text similarity algorithm may be used to find similar stored text segments to each of the plurality of input text segments.

3.

发明授权
Rights mapping system and method 有权

公开(公告)号：US12230048B1

公开(公告)日：2025-02-18

申请号：US18137590

申请日：2023-04-21

Applicant: Thomson Reuters Enterprise Centre GmbH

Inventor： Nicholas E. Vandivere

IPC: G06V30/414 , G06F16/901 , G06F18/214 , G06Q50/163 , G06V30/412 , G06V30/413 , G06V30/19

Abstract: A method and system can include processing title and title opinion document images to generate text information. Trained models may generate data objects representative of period of time during which certain rights to a property exist. The trained models may also generate rules for modifying the data objects and interrelating the data objects to each other. In some examples, a confidence level can be generated and will reflect a likelihood of a data object including correct information. The modified and interrelated data objects may be used to generate a navigable interface which includes a current title status for a property and a navigable chain of title reflecting historical rights to the property.

4.

发明申请
ANNOTATION ALIGNMENT FOR CHARACTER RECOGNITION IN DOCUMENTS 有权

公开(公告)号：US20250054325A1

公开(公告)日：2025-02-13

申请号：US18231652

申请日：2023-08-08

Applicant: SAP SE

Inventor： Xiang Yu , Christoph Meyer

IPC: G06V30/14 , G06F40/284 , G06V10/70 , G06V30/19 , G06V30/414

Abstract: Systems and processes for aligning weakly-annotated data to recognized characters in a document are provided. In a method for aligning annotation data to recognized characters, annotation words and character recognition tokens are received, and a search algorithm is performed to align the annotation words to the tokens in a stepwise manner. At each step, an annotation word is aligned to one or more tokens, and a cost of each respective alignment is calculated. Once all annotation words are aligned, a full set of annotation word-token pairs corresponding to the annotation is selected based on a total cost of alignment for that set. A bounding box enclosing the tokens in the selected full set is generated and output to a target application.

5.

发明授权
Method for comparing content of two document files, and method for training a graph neural network structure to implement the same 有权

公开(公告)号：US12217521B2

公开(公告)日：2025-02-04

申请号：US17567192

申请日：2022-01-03

Applicant: FOXIT SOFTWARE INC.

Inventor： Po-Fang Hsu , Chi-Ching Wei

IPC: G06V30/414 , G06F40/205 , G06F40/279 , G06N3/08 , G06V30/196

Abstract: A method for comparing content of two document files each having a plurality of content blocks is provided. The method is to be implemented by an electronic device and includes the steps of: performing, for the each of the content blocks in each of the document files, a pre-process operation so as to obtain a plurality of properties associated with the content block; comparing, for each content block from one of the document files, the properties thereof with the properties of each of the plurality of content blocks of the other one of the document files; and generating a comparison result based on the operations of the comparing.

6.

发明授权
Method and apparatus for performing structured extraction on text, device and storage medium 有权

公开(公告)号：US12211304B2

公开(公告)日：2025-01-28

申请号：US17200448

申请日：2021-03-12

Applicant: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventor： Yulin Li , Xiameng Qin , Chengquan Zhang , Junyu Han , Errui Ding , Tian Wu , Haifeng Wang

IPC: G06F16/901 , G06N3/047 , G06N5/04 , G06V10/22 , G06V10/80 , G06V30/262 , G06V30/414 , G06V10/24

Abstract: Embodiments of the present disclosure provide a method and apparatus for performing a structured extraction on a text, a device and a storage medium. The method may include: performing a text detection on an entity text image to obtain a position and content of a text line of the entity text image; extracting multivariate information of the text line based on the position and the content of the text line; performing a feature fusion on the multivariate information of the text line to obtain a multimodal fusion feature of the text line; performing category and relationship reasoning based on the multimodal fusion feature of the text line to obtain a category and a relationship probability matrix of the text line; and constructing structured information of the entity text image based on the category and the relationship probability matrix of the text line.

7.

发明授权
Machined learning supporting document data extraction 有权

公开(公告)号：US12190620B2

公开(公告)日：2025-01-07

申请号：US17160082

申请日：2021-01-27

Applicant: Automation Anywhere, Inc.

Inventor： Siddarth Sathi , Vibhas Gejji , Anish Hiranandani , Bruno Gomes Selva , Anjana Prabhakar

IPC: G06V30/416 , G06F16/242 , G06F40/00 , G06F40/177 , G06F40/20 , G06F40/279 , G06N3/045 , G06N20/00 , G06Q10/10 , G06Q40/12 , G06V30/00 , G06V30/148 , G06V30/19 , G06V30/40 , G06V30/412 , G06V30/413 , G06V30/414

Abstract: Improved techniques to access content from documents in an automated fashion. The improved techniques permit content within documents to be retrieved and then used by computer systems operating various software programs (e.g., application programs), such as an extraction program. Documents, especially business transaction documents, often have various descriptors (or tables) and values that form key-value pairs. The improved techniques permit key-value pairs within documents to be recognized and extracted from documents. Consequently, RPA systems are able to accurately understand the content of tables within documents so that users and/or software robots can operate on the documents with increased reliability and flexibility.

8.

发明申请
METHODS AND APPARATUS FOR EXTRACTING DATA FROM A DOCUMENT BY ENCODING IT WITH TEXTUAL AND VISUAL FEATURES AND USING MACHINE LEARNING 有权

公开(公告)号：US20250005952A1

公开(公告)日：2025-01-02

申请号：US18759395

申请日：2024-06-28

Applicant: Greenhouse Software, Inc.

Inventor： Triantafyllos XYLOURIS

IPC: G06V30/414 , G06V30/12

Abstract: An apparatus including a processor caused to receive document images, each including representations of characters. The processor is caused to parse each document image to extract, based on structure type, subsets of characters, to generate a text encoding for that document image. For each document, the processor is caused to extract visual features to generate a visual encoding for that document image, each visual feature associated with a subset of characters. The processor is caused to generate parsed documents, each parsed document uniquely associated with a document image and based on the text and visual encoding for that document image. For each parsed document, the processor is caused to identify sections uniquely associated with section type. The processor is caused to train machine learning models, each machine learning model associated with one section type and trained using a portion of each parsed document associated with that section type.

9.

发明授权
Methods and apparatus for extracting data from a document by encoding it with textual and visual features and using machine learning 有权

公开(公告)号：US12183106B1

公开(公告)日：2024-12-31

申请号：US18759395

申请日：2024-06-28

Applicant: Greenhouse Software, Inc.

Inventor： Triantafyllos Xylouris

IPC: G06V30/414 , G06N20/00 , G06V30/12 , G06V30/18 , G06V30/41 , G06V30/412 , G06V30/416 , G06F40/30

Abstract: An apparatus including a processor caused to receive document images, each including representations of characters. The processor is caused to parse each document image to extract, based on structure type, subsets of characters, to generate a text encoding for that document image. For each document, the processor is caused to extract visual features to generate a visual encoding for that document image, each visual feature associated with a subset of characters. The processor is caused to generate parsed documents, each parsed document uniquely associated with a document image and based on the text and visual encoding for that document image. For each parsed document, the processor is caused to identify sections uniquely associated with section type. The processor is caused to train machine learning models, each machine learning model associated with one section type and trained using a portion of each parsed document associated with that section type.

10.

发明授权
Removal of sensitive data from documents for use as training sets 有权

公开(公告)号：US12182308B2

公开(公告)日：2024-12-31

申请号：US17309198

申请日：2019-11-07

Applicant: SERVICENOW CANADA INC.

Inventor： Archy Otto De Berker , Philippe Guay , Dominique Tourillon , Etienne Marcotte

IPC: G06F21/62 , G06F18/21 , G06F18/214 , G06N3/08 , G06V10/82 , G06V30/19 , G06V30/414 , G06V30/416

Abstract: Systems and methods relating to the replacement or removal of sensitive data in images of documents. An initial image of a document with sensitive data is received at an execution module and changes are made based on the execution module's training. The changes include replacing or effectively removing the sensitive data from the image of the document. The resulting sanitized image is then sent to a user for validation of the changes. The feedback from the user is then used in training the execution module to refine its behaviour when applying changes to other initial images of documents. To train the execution module, training data sets of document images with sensitive data manually tagged by users are used. The execution module thus learns to identify sensitive data and its submodules replace that sensitive data with suitable replacement data. The feedback from the user works to improve the resulting sanitized images from the execution module.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification