-
公开(公告)号:US20250061738A1
公开(公告)日:2025-02-20
申请号:US18935835
申请日:2024-11-04
Applicant: Capital One Services, LLC
Inventor: Jason PRIBBLE , Daniel Alan JARVIS , Nicholas CAPURSO
IPC: G06V30/414 , G06T7/62 , G06V30/142
Abstract: Described herein are method, system, and non-transitory computer-readable medium embodiments for capturing an image of a first object. A method can include determining that at least one image parameter value of the first object is within a threshold value based on an outline of the first object and at least one environmental feature. The environmental feature can include at least one of: an angle with respect to the first object and a camera, a distance between the first object and the camera, or a second object other than the first object in an image frame. The method can include triggering the camera to capture an image of the first object.
-
公开(公告)号:US12230049B2
公开(公告)日:2025-02-18
申请号:US18205867
申请日:2023-06-05
Applicant: Cognition IP Technology Inc.
Inventor: Bryant Lee , Andrew Tjang , Andrew Perry Chu , Uday Pulleti
IPC: G06F17/00 , G06F40/226 , G06V30/413 , G06V30/414 , G06V30/418 , G06F16/93
Abstract: Systems and methods may be provided for performing a search on an input text block. The input text block may be split into a plurality of input text segments. A text similarity algorithm may be used to find similar stored text segments to each of the plurality of input text segments.
-
公开(公告)号:US12230048B1
公开(公告)日:2025-02-18
申请号:US18137590
申请日:2023-04-21
Applicant: Thomson Reuters Enterprise Centre GmbH
Inventor: Nicholas E. Vandivere
IPC: G06V30/414 , G06F16/901 , G06F18/214 , G06Q50/163 , G06V30/412 , G06V30/413 , G06V30/19
Abstract: A method and system can include processing title and title opinion document images to generate text information. Trained models may generate data objects representative of period of time during which certain rights to a property exist. The trained models may also generate rules for modifying the data objects and interrelating the data objects to each other. In some examples, a confidence level can be generated and will reflect a likelihood of a data object including correct information. The modified and interrelated data objects may be used to generate a navigable interface which includes a current title status for a property and a navigable chain of title reflecting historical rights to the property.
-
公开(公告)号:US20250054325A1
公开(公告)日:2025-02-13
申请号:US18231652
申请日:2023-08-08
Applicant: SAP SE
Inventor: Xiang Yu , Christoph Meyer
IPC: G06V30/14 , G06F40/284 , G06V10/70 , G06V30/19 , G06V30/414
Abstract: Systems and processes for aligning weakly-annotated data to recognized characters in a document are provided. In a method for aligning annotation data to recognized characters, annotation words and character recognition tokens are received, and a search algorithm is performed to align the annotation words to the tokens in a stepwise manner. At each step, an annotation word is aligned to one or more tokens, and a cost of each respective alignment is calculated. Once all annotation words are aligned, a full set of annotation word-token pairs corresponding to the annotation is selected based on a total cost of alignment for that set. A bounding box enclosing the tokens in the selected full set is generated and output to a target application.
-
公开(公告)号:US12217521B2
公开(公告)日:2025-02-04
申请号:US17567192
申请日:2022-01-03
Applicant: FOXIT SOFTWARE INC.
Inventor: Po-Fang Hsu , Chi-Ching Wei
IPC: G06V30/414 , G06F40/205 , G06F40/279 , G06N3/08 , G06V30/196
Abstract: A method for comparing content of two document files each having a plurality of content blocks is provided. The method is to be implemented by an electronic device and includes the steps of: performing, for the each of the content blocks in each of the document files, a pre-process operation so as to obtain a plurality of properties associated with the content block; comparing, for each content block from one of the document files, the properties thereof with the properties of each of the plurality of content blocks of the other one of the document files; and generating a comparison result based on the operations of the comparing.
-
6.
公开(公告)号:US12211304B2
公开(公告)日:2025-01-28
申请号:US17200448
申请日:2021-03-12
Inventor: Yulin Li , Xiameng Qin , Chengquan Zhang , Junyu Han , Errui Ding , Tian Wu , Haifeng Wang
IPC: G06F16/901 , G06N3/047 , G06N5/04 , G06V10/22 , G06V10/80 , G06V30/262 , G06V30/414 , G06V10/24
Abstract: Embodiments of the present disclosure provide a method and apparatus for performing a structured extraction on a text, a device and a storage medium. The method may include: performing a text detection on an entity text image to obtain a position and content of a text line of the entity text image; extracting multivariate information of the text line based on the position and the content of the text line; performing a feature fusion on the multivariate information of the text line to obtain a multimodal fusion feature of the text line; performing category and relationship reasoning based on the multimodal fusion feature of the text line to obtain a category and a relationship probability matrix of the text line; and constructing structured information of the entity text image based on the category and the relationship probability matrix of the text line.
-
公开(公告)号:US12190620B2
公开(公告)日:2025-01-07
申请号:US17160082
申请日:2021-01-27
Applicant: Automation Anywhere, Inc.
Inventor: Siddarth Sathi , Vibhas Gejji , Anish Hiranandani , Bruno Gomes Selva , Anjana Prabhakar
IPC: G06V30/416 , G06F16/242 , G06F40/00 , G06F40/177 , G06F40/20 , G06F40/279 , G06N3/045 , G06N20/00 , G06Q10/10 , G06Q40/12 , G06V30/00 , G06V30/148 , G06V30/19 , G06V30/40 , G06V30/412 , G06V30/413 , G06V30/414
Abstract: Improved techniques to access content from documents in an automated fashion. The improved techniques permit content within documents to be retrieved and then used by computer systems operating various software programs (e.g., application programs), such as an extraction program. Documents, especially business transaction documents, often have various descriptors (or tables) and values that form key-value pairs. The improved techniques permit key-value pairs within documents to be recognized and extracted from documents. Consequently, RPA systems are able to accurately understand the content of tables within documents so that users and/or software robots can operate on the documents with increased reliability and flexibility.
-
公开(公告)号:US20250005952A1
公开(公告)日:2025-01-02
申请号:US18759395
申请日:2024-06-28
Applicant: Greenhouse Software, Inc.
Inventor: Triantafyllos XYLOURIS
IPC: G06V30/414 , G06V30/12
Abstract: An apparatus including a processor caused to receive document images, each including representations of characters. The processor is caused to parse each document image to extract, based on structure type, subsets of characters, to generate a text encoding for that document image. For each document, the processor is caused to extract visual features to generate a visual encoding for that document image, each visual feature associated with a subset of characters. The processor is caused to generate parsed documents, each parsed document uniquely associated with a document image and based on the text and visual encoding for that document image. For each parsed document, the processor is caused to identify sections uniquely associated with section type. The processor is caused to train machine learning models, each machine learning model associated with one section type and trained using a portion of each parsed document associated with that section type.
-
公开(公告)号:US12183106B1
公开(公告)日:2024-12-31
申请号:US18759395
申请日:2024-06-28
Applicant: Greenhouse Software, Inc.
Inventor: Triantafyllos Xylouris
IPC: G06V30/414 , G06N20/00 , G06V30/12 , G06V30/18 , G06V30/41 , G06V30/412 , G06V30/416 , G06F40/30
Abstract: An apparatus including a processor caused to receive document images, each including representations of characters. The processor is caused to parse each document image to extract, based on structure type, subsets of characters, to generate a text encoding for that document image. For each document, the processor is caused to extract visual features to generate a visual encoding for that document image, each visual feature associated with a subset of characters. The processor is caused to generate parsed documents, each parsed document uniquely associated with a document image and based on the text and visual encoding for that document image. For each parsed document, the processor is caused to identify sections uniquely associated with section type. The processor is caused to train machine learning models, each machine learning model associated with one section type and trained using a portion of each parsed document associated with that section type.
-
公开(公告)号:US12182308B2
公开(公告)日:2024-12-31
申请号:US17309198
申请日:2019-11-07
Applicant: SERVICENOW CANADA INC.
Inventor: Archy Otto De Berker , Philippe Guay , Dominique Tourillon , Etienne Marcotte
IPC: G06F21/62 , G06F18/21 , G06F18/214 , G06N3/08 , G06V10/82 , G06V30/19 , G06V30/414 , G06V30/416
Abstract: Systems and methods relating to the replacement or removal of sensitive data in images of documents. An initial image of a document with sensitive data is received at an execution module and changes are made based on the execution module's training. The changes include replacing or effectively removing the sensitive data from the image of the document. The resulting sanitized image is then sent to a user for validation of the changes. The feedback from the user is then used in training the execution module to refine its behaviour when applying changes to other initial images of documents. To train the execution module, training data sets of document images with sensitive data manually tagged by users are used. The execution module thus learns to identify sensitive data and its submodules replace that sensitive data with suitable replacement data. The feedback from the user works to improve the resulting sanitized images from the execution module.
-
-
-
-
-
-
-
-
-