Content-based object detection, 3D reconstruction, and data extraction from digital images

    公开(公告)号:US11620733B2

    公开(公告)日:2023-04-04

    申请号:US17005147

    申请日:2020-08-27

    申请人: Kofax, Inc.

    摘要: A method of detecting an object depicted in a digital image includes: detecting a plurality of identifying features of the object, wherein the plurality of identifying features are located internally with respect to the object; projecting a location of region(s) of interest of the object based on the plurality of identifying features, where each region of interest depicts content; building and/or selecting an extraction model configured to extract the content based at least in part on: the location of the region(s) of interest, the of identifying feature(s), or both; and extracting the some or all of the content from the digital image using the extraction model. Corresponding system and computer program product embodiments are disclosed. The inventive concepts enable reliable extraction of data from digital images where portions of an object are obscured/missing, and/or depicted on a complex background.

    AUTOMATED DOCUMENT PROCESSING FOR DETECTING, EXTRACTNG, AND ANALYZING TABLES AND TABULAR DATA

    公开(公告)号:US20220405265A1

    公开(公告)日:2022-12-22

    申请号:US17850835

    申请日:2022-06-27

    申请人: Kofax, Inc.

    IPC分类号: G06F16/22

    摘要: According to one embodiment, a computer-implemented method for classifying one or more tables and/or one or more tabular data arrangements depicted in image data includes: training a machine learning model, using a training dataset representing a plurality of different tables and/or tabular data arrangements, based at least in part on a plurality of recognized textual elements within the training dataset; and outputting a trained classification model based on the training, wherein the trained classification model is configured to classify one or more tables and/or one or more tabular data arrangements represented within a test dataset according to: one or more table classifications; one or more tabular data arrangement classifications; and/or one or more column classifications; and classifying the one or more tables and/or the one or more tabular data arrangements represented within the test dataset using the trained classification model. Methods for detecting, extracting, and classifying tables are also disclosed.

    ITERATIVE RECOGNITION-GUIDED THRESHOLDING AND DATA EXTRACTION

    公开(公告)号:US20210383150A1

    公开(公告)日:2021-12-09

    申请号:US17348584

    申请日:2021-06-15

    申请人: Kofax, Inc.

    摘要: Techniques for binarization and extraction of information from image data are disclosed. The inventive concepts include independently binarizing portions of the image data on the basis of individual features, e.g. per connected component, and using multiple different binarization thresholds to obtain the best possible binarization result for each portion of the image data. Determining the quality of each binarization result may be based on attempted recognition and/or extraction of information therefrom. Independently binarized portions may be assembled into a contiguous result. In one embodiment, a method includes: identifying a region of interest within a digital image; generating a plurality of binarized images based on the region of interest using different binarization thresholds; and extracting data from some or all of the plurality of binarized images. The extracted data includes connected components that overlap and/or are obscured by unique background. Corresponding systems and computer program products are disclosed.

    CONTENT-BASED OBJECT DETECTION, 3D RECONSTRUCTION, AND DATA EXTRACTION FROM DIGITAL IMAGES

    公开(公告)号:US20210027431A1

    公开(公告)日:2021-01-28

    申请号:US17005171

    申请日:2020-08-27

    申请人: Kofax, Inc.

    摘要: A computer-implemented method of detecting an object depicted in a digital image includes: detecting a plurality of identifying features of the object, wherein the plurality of identifying features are located internally with respect to the object; projecting a location of region(s) of interest of the object based on the plurality of identifying features, where each region of interest depicts content; building and/or selecting an extraction model configured to extract the content based at least in part on: the location of the region(s) of interest, the of identifying feature(s), or both; and extracting the some or all of the content from the digital image using the extraction model. Corresponding system and computer program product embodiments are disclosed. The inventive concepts enable reliable extraction of data from digital images where portions of an object are obscured/missing, and/or depicted on a complex background.

    Content-based detection and three dimensional geometric reconstruction of objects in image and video data

    公开(公告)号:US10783613B2

    公开(公告)日:2020-09-22

    申请号:US16151090

    申请日:2018-10-03

    申请人: Kofax, Inc.

    摘要: Systems, computer program products, and techniques for detecting and/or reconstructing objects depicted in digital image data within a three-dimensional space are disclosed, according to various exemplary embodiments. The inventive concepts uniquely utilize internal features to accomplish reconstruction, thereby avoiding reliance on reconstructing objects based on information derived from location of edges. The inventive concepts thus provide an improvement over conventional object reconstruction since objects may be detected and/or reconstructed even when edges are obscured or not depicted in the digital image data. In one aspect, reconstructing an object depicted in a digital image includes using a processor to: detect a plurality of identifying features of the object, where the identifying features are located internally with respect to the object; and reconstruct the digital image of the object within a three dimensional coordinate space based at least in part on some or all of the identifying features.

    Content-based detection and three dimensional geometric reconstruction of objects in image and video data

    公开(公告)号:US10127636B2

    公开(公告)日:2018-11-13

    申请号:US15234993

    申请日:2016-08-11

    申请人: Kofax, Inc.

    摘要: Systems, computer program products, and techniques for reconstructing objects depicted in digital image data within a three-dimensional space are disclosed, according to various exemplary embodiments. The inventive concepts uniquely utilize internal features to accomplish reconstruction, thereby avoiding reliance on reconstructing objects based on information derived from location of edges. The inventive concepts thus provide an improvement over conventional object reconstruction since objects may be reconstructed even when edges are obscured or not depicted in the digital image data. In one aspect, a computer-implemented method of reconstructing an object depicted in a digital image includes: detecting a plurality of identifying features of the object, wherein the plurality of identifying features are located internally with respect to the object; and reconstructing the digital image of the object within a three dimensional coordinate space based at least in part on some or all of the plurality of identifying features.

    Systems and methods for generating composite images of long documents using mobile video data

    公开(公告)号:US10108860B2

    公开(公告)日:2018-10-23

    申请号:US15390321

    申请日:2016-12-23

    申请人: Kofax, Inc.

    摘要: According to one embodiment, a system includes a processor and logic in and/or executable by the processor to cause the processor to: initiate a capture operation using an image capture component of the mobile device, the capture operation comprising; capturing video data; and estimating a plurality of motion vectors corresponding to motion of the image capture component during the capture operation; detect a document depicted in the video data; track a position of the detected document throughout the video data; select a plurality of images using the image capture component of the mobile device, wherein the selection is based at least in part on: the tracked position of the detected document; and the estimated motion vectors; and generate a composite image based on at least some of the selected plurality of images.