-
公开(公告)号:US11810380B2
公开(公告)日:2023-11-07
申请号:US17075675
申请日:2020-10-20
Applicant: NIELSEN CONSUMER LLC
Inventor: Roberto Arroyo , Jose Javier Yebes Torres , Aitor Aller Beascoechea , Francisco Javier Delgado del Hoyo , Dayron Rizo Rodriguez , Ravindra Gadde
IPC: G06V30/413 , G06F16/583 , G06F40/18 , G06F40/58 , G06V30/412 , G06V30/414 , G06N3/08
CPC classification number: G06V30/413 , G06F16/5846 , G06F40/18 , G06F40/58 , G06N3/08 , G06V30/412 , G06V30/414
Abstract: Methods, apparatus, and articles manufacture to decode documents based on images using artificial intelligence are disclosed. An example apparatus includes a model executor to input an image into a first artificial intelligence (AI)-based model to generate detected columns of text in the image; and input the image into a second AI-based model to classify the detected columns into categories; a cell identifier to identify rows or cells in the detected columns; and a report generator to: link information corresponding to the rows or cells in the detected columns with corresponding categories; and generating a report based on the linked information.
-
22.
公开(公告)号:US20220189190A1
公开(公告)日:2022-06-16
申请号:US17598792
申请日:2019-03-28
Applicant: Nielsen Consumer LLC
Inventor: Roberto Arroyo , Javier Tovar Velasco , Francisco Javier Delgado Del Hoyo , Diego González Serrador , Emilio Almazán , Antonio Hurtado
IPC: G06V30/413 , G06V30/416 , G06V10/82
Abstract: Methods, apparatus, systems and articles of manufacture are disclosed to analyze characteristics of text of interest using a computing system. An example apparatus includes a text detector to provide text data from a first image, the first image including a first text region of interest and a second text region not of interest, a color-coding generator to generate a plurality of color-coded text-map images, the plurality of color-coded text-map images including color-coded segments with different colors, the color-coded segments corresponding to different text characteristics, and a convolutional neural network (CNN) to determine a first location in the first image as more likely to be the first text region of interest than a second location in the first image corresponding to the second text region that is not of interest based on performing a CNN analysis on the first image and the plurality of color-coded text-map images.
-
公开(公告)号:US20220114821A1
公开(公告)日:2022-04-14
申请号:US17379280
申请日:2021-07-19
Applicant: Nielsen Consumer LLC
Inventor: Roberto Arroyo , David Jiménez , Javier Martínez Cebrián
IPC: G06V30/148 , G06V30/19 , G06V30/14 , G06V20/70 , G06V20/62
Abstract: Methods, apparatus, systems, and articles of manufacture are disclosed to categorize image text. An example apparatus includes region detection model training circuitry to identify candidate regions in an input image that include text, and generate bounding boxes around respective ones of the identified candidate regions. The example apparatus also includes mask application circuitry to improve optical character recognition (OCR) by applying a mask to the input image, wherein the mask removes content of the input image except for portions of the input image within the bounding boxes, and OCR circuitry to perform OCR on the masked input image to obtain text data within the bounding boxes.
-
-