-
1.
公开(公告)号:US20230215206A1
公开(公告)日:2023-07-06
申请号:US17331463
申请日:2021-05-26
Applicant: Indeed, Inc.
Inventor: Lawrence Thibodeaux , Gokhan Ozer , Chinwei Hu , Jesse Rohwer , Eugene Raether
CPC classification number: G06K9/00463 , G06K9/00469 , G06K9/6253 , G06K9/6261 , G06K9/6259
Abstract: Systems and methods are disclosed for parsing resume documents using computer vision and optical character recognition technology in combination with a user feedback interface system to facilitate user feedback to improve the overall processing quality of the resumes that are imported into computer resume processing systems. In at least one embodiment, the system and method prompt a user to upload an input resume document, which is processed with a first parsing pass to generate initial resume data by extracting a plurality of resume text blocks. Further processing identifies an initial set of bounding blocks and to visually displays the initial resume data for user review and feedback to regroup one or more of the initial set of bounding blocks into a regrouped bounding block. Additional processing consolidates into a group text block each of the resume text blocks corresponding to the regrouped one or more of the initial set of bounding blocks.
-
2.
公开(公告)号:US20190197307A1
公开(公告)日:2019-06-27
申请号:US16132188
申请日:2018-09-14
Applicant: GOOGLE LLC
Inventor: Shih-Hao Yeh , Navid Samadani-McQuirk , Joshua Curtis Hudgins , Jack Cameron Dille
IPC: G06K9/00 , G06F3/0482 , G06K9/62
CPC classification number: G06K9/00463 , G06F3/0482 , G06K9/00442 , G06K9/325 , G06K9/6201 , G06K2209/01 , G06Q30/0207 , G06Q30/0223 , G06Q30/0633
Abstract: Modifying graphical user interfaces based on new data obtained from electronic documents comprises a computing system and an image capturing system of a user. The computing device receives a first digital image comprising a first set of data and extracts the first set of data from the first digital image. The computing device populates a list with a first set of data. The computing device receives a second digital image comprising a second set of data and extracts the second set of data from the second digital image. The computing device then modifies the list based on the second set of data. The computing device searches for third party data to associate with items on the list and takes appropriate action based on the association.
-
公开(公告)号:US20190188465A1
公开(公告)日:2019-06-20
申请号:US16139737
申请日:2018-09-24
Applicant: Capital One Services, LLC
Inventor: Subhashini TRIPURANENI , Joseph R. Barco, JR.
CPC classification number: G06K9/00463 , G06F16/5846 , G06F16/93 , G06K9/00456 , G06K9/00469 , G06K9/03 , G06K9/18 , G06K9/3241 , G06K9/4604 , G06K2209/01
Abstract: A device may receive image data representing a document, the document including: text, and edges. Based on the edges, the device may identify, a segment of interest within the image data and crop the segment of interest to obtain a portion of the image data. In addition, the device may perform optical character recognition on the portion of the image data, the optical character recognition producing recognized text. The device may obtain, based on the recognized text, validation data that includes verification text, and determine whether the recognized text is verified based on the verification text. Based on a result of the determination, the device may perform an action.
-
4.
公开(公告)号:US20190114743A1
公开(公告)日:2019-04-18
申请号:US16229397
申请日:2018-12-21
Applicant: Open Text Corporation
Inventor: Christopher Dale Lund , Sreelatha Samala
CPC classification number: G06T3/4046 , G06K9/00456 , G06K9/00463 , G06K9/2054 , G06K9/325 , G06K9/4628 , G06K9/6274 , G06K2209/01 , G06N3/0454 , G06N3/08 , G06N5/046 , G06T5/009
Abstract: Systems and methods for image modification to increase contrast between text and non-text pixels within the image. In one embodiment, an original document image is scaled to a predetermined size for processing by a convolutional neural network. The convolutional neural network identifies a probability that each pixel in the scaled is text and generates a heat map of these probabilities. The heat map is then scaled back to the size of the original document image, and the probabilities in the heat map are used to adjust the intensities of the text and non-text pixels. For positive text, intensities of text pixels are reduced and intensities of non-text pixels are increased in order to increase the contrast of the text against the background of the image. Optical character recognition may then be performed on the contrast-adjusted image.
-
公开(公告)号:US20190095903A1
公开(公告)日:2019-03-28
申请号:US16012350
申请日:2018-06-19
Applicant: Capital One Services, LLC
Inventor: Dan Givol , Anand Kumar , Patrick Zearfoss
CPC classification number: G06Q20/3276 , G06K9/00463 , G06K9/22 , G06K9/3216 , G06K9/3233 , G06K9/3275 , G06K9/4604 , G06K9/4642 , G06K9/6215 , G06Q20/32 , G06Q20/3223 , G06Q20/347 , G06Q20/36 , G06Q20/363 , G06T1/0007 , G06T7/11 , G06T7/90
Abstract: A method and a system of capturing an image of a card having a magnetic stripe is provided. The method includes obtaining a first image by an imaging device of the card, obtaining a plurality of images of the card via color delta analysis, and obtaining a third image of the card by comparing the first and the plurality of images.
-
公开(公告)号:US20190036866A1
公开(公告)日:2019-01-31
申请号:US15948689
申请日:2018-04-09
Applicant: Upheaval LLC
Inventor: David ISEMINGER
CPC classification number: H04L51/32 , G06F17/248 , G06K9/00463 , G06Q50/00 , H04L67/18 , H04L67/306
Abstract: Methods and systems are provided in which an improved interface implements a synergistic hybrid of user interactions and automatic operations so that user input is elicited sparingly, making it possible to generate customized social media posts with unexpected speed relative to any art-known techniques. A draft post is pre-populated with a first keyword that identifies a machine-recognized aspect of a photograph, for example, and an event descriptor partly based on the capture location. After adding user text, a complete post is then ready for broadcast.
-
公开(公告)号:US20180349693A1
公开(公告)日:2018-12-06
申请号:US15918830
申请日:2018-03-12
Applicant: HITACHI, LTD.
Inventor: Yasuo WATANABE , Toshio OKOCHI , Hiroshi SHINJO , Masahiro MOTOBAYASHI , Yasufumi SUZUKI
IPC: G06K9/00
CPC classification number: G06K9/00469 , G06K9/00463 , G06K9/00483 , G06K9/6202 , G06K2209/01 , G06Q30/04
Abstract: A computer, which is configured to extract an attribute being a character string indicating a feature of a paper-based document, the computer stores template information dictionary information. The computer is configured to: execute character recognition processing on image data on the paper-based document; extract an attribute corresponding to each of the at least one type of attribute, which is defined in each of the plurality of templates, through use of a result of the character recognition processing and the plurality of templates; calculate a score regarding the extracted attribute for each of the plurality of templates; select one of the plurality of templates that has the highest extraction accuracy of the attribute based on the score; and generate output information through use of the selected template.
-
公开(公告)号:US20180330181A1
公开(公告)日:2018-11-15
申请号:US16043010
申请日:2018-07-23
Applicant: KONICA MINOLTA LABORATORY U.S.A., INC.
Inventor: Duanduan Yang
CPC classification number: G06K9/346 , G06K9/00463 , G06K9/00859 , G06K9/344 , G06K9/72 , G06K2209/01
Abstract: A method for segmenting an image containing handwritten text into line segments and word segments. The image is horizontally down sampled at a first ratio. Connected regions in the down-sampled image are detected; horizontal neighboring ones are merged to form lines, to segment the original image into line images. Each line image is horizontally down sampled at a second ratio which is smaller than the first ratio. Connected regions in the down-sampled line image are detected to obtain potential word segmentation positions. A path is a way of dividing the line at some or all of the potential word segmentation positions into multiple path segments; for each of all possible paths, word recognition is applied to each path segment to calculate a word recognition score, and an average word recognition score for the path is calculated; the path with the highest score gives the final word segmentation.
-
公开(公告)号:US20180314884A1
公开(公告)日:2018-11-01
申请号:US15499666
申请日:2017-04-27
Applicant: INTUIT INC.
Inventor: Daniel LEE , Vijay YELLAPRAGADA , Shailesh SOLIWAL , Peijun CHIANG
CPC classification number: G06K9/00463 , G06K9/18 , G06K9/3208 , G06K9/723 , G06K2209/01 , G06Q40/123 , G06T7/11 , G06T7/70
Abstract: The present disclosure relates to the extraction of text from an image including a depiction of a document. According to one embodiment, a mobile device receives an image depicting a document. The mobile device identifies a plurality of text areas in the document and identifies a midpoint of each of the plurality of text areas in the document. The mobile device detects one or more lines of text in the document including a plurality of text areas, where the plurality of text areas included in a line of text are associated with a midpoint having a coordinate within a threshold number of pixels on one axis in a two-dimensional space. Based on an orientation of the detected one or more lines of text, the mobile device determines a probable orientation of the document and extracts text from the image based on the determined probable orientation of the document.
-
公开(公告)号:US20180293437A1
公开(公告)日:2018-10-11
申请号:US15482611
申请日:2017-04-07
Applicant: Box, Inc.
Inventor: Andrew Miller Dempsey , James Michael DiZoglio , Oluwatosin Onafowokan
CPC classification number: G06K9/00463 , G06K9/4609 , G06K9/6202 , G06T7/62 , G06T11/60 , G06T2207/10016 , G06T2207/30176 , G06T2207/30242 , G06T2210/12 , H04N5/23222 , H04N5/23293
Abstract: Image processing to reduce or eliminate jitter when visually highlighting a target capture area of an image capture device such as a smart phone or camera. A method embodiment commences upon receiving a sequence of one or more video frames taken by the image capture device. A routine to identify the largest polygons in the video frames is applied to the sequence. Uncertainty in identification of the largest polygons is reduced or eliminated by applying intra-frame filters to the polygons. Visible jitter that can arise from inter-frame uncertainty when selecting and highlighting target capture areas can be reduced or eliminated by applying inter-frame filters so as to retain selection of a particular target capture area even in the presence of certain capture device movements. A jitter-free representation of the target capture area is visually displayed on a display screen of the image capture device.
-
-
-
-
-
-
-
-
-