Abstract:
Methods and apparatus for retrieving PDF documents, performing text extraction operations on portions or all of a retrieved document and supporting search operations in a manner that allows search results to be quickly provided for at least portions of a PDF document being viewed are described. The methods and apparatus are particularly useful in applications, such as many applications executed by a browser, where the application is limited to a single processing thread and thus must perform all or many processing operations sequentially. By prioritizing document pages which are being viewed for text extraction even before a search is initiated and by performing text extraction in small periods of time and storing the results, in many cases a user can be provided with text search results for a page being viewed in relatively little time and without the program, e.g., java script browser application, appearing non-responsive.
Abstract:
Methods and apparatus for identifying lines in an image are described. An image to be processed is divided into a plurality of tiles, and processing is performed on a per tile basis. Lines are identified in tiles and a weight is assigned to each line based on among other things, the length of the line. Quantized first and second parameter values, e.g., values defining where lines enters and leave an area, are used in defining the identified lines. A set of lines is selected based on the weight information and output or used in image processing the image including the lines.
Abstract:
An input SVG file to be processed is accessed. Reusable symbols in the input SVG are identified, e.g., which satisfy a symbol size requirement. A set of symbols are selected from among the identified reusable symbols for conversion to glyphs of a custom binary font, e.g., based on symbol occurrence frequency. A binary font file is created corresponding to set the selected identified symbols in the SVG input file. An SVG output file is created including: binary font glyph definitions corresponding to the converted identified symbols, definitions of symbols from the SVG input file which have not been converted to glyphs and information indicating where the glyphs and symbols, which were not converted, are to be placed on an output display page. The generated SVG output file is a more efficient SVG file than the input SVG file. Different custom binary font files are created for different SVG input pages.
Abstract:
Methods and apparatus for managing resource utilization in a distributed system are described. Devices, e.g., servers, which use resources, e.g., processing cores, act as individual policy enforcement points. Individual servers retrieve and maintain local copies of resource lease records which are stored in a centralized data storage system. The individual server compares locally stored lease records to the retrieved lease records to check for any tampering in the centralized data storage and multiple states are supported to take into consideration transitory conditions and/or communications delays. Verification states include, e.g., a Pending Active state and a Pending Inactive State, in addition to an Active state and Inactive state, to delay licensing enforcement to account for centralized storage system eventual consistency delays.
Abstract:
Image binarization methods and apparatus are described. A set of input image pixel values, e.g., a set of grayscale values corresponding to an input image, is processed to determine whether to recommend to use local binarization thresholds or a global binarization threshold. Edges including edge pixels are identified. A first histogram corresponding to edge pixel values and a second histogram corresponding to image pixel values are generated, subjected to one or more smoothing operations, and truncated, based on information derived from the edge histogram. Characteristics of the histograms including, e.g., minima, maxima, points of inflection, and hidden peaks, are determined, evaluated, and used to decide between local binarization thresholds and a global threshold. Based on the recommendation, a global threshold is used or local thresholds are used to process the set of input image pixel values and generate a corresponding set of bi-level values.
Abstract:
Methods and apparatus for detecting labels included in a document or other binarized image, and for extracting and/or using information associated with a label, are described. A nodal structure modeling objects, e.g., characters, character strings or words, which make up various label aliases are described. The nodal structure is used to generate a score for portions of a binarized document with the scores being used to determine the presence or absence of one or more label aliases. When a label alias is determined to be present, information is extracted from the document and used as information corresponding to a label to which the identified label alias corresponds. Multiple different label aliases may correspond to a single label allowing multiple different aliases to be used to identify the same information. The label aliases and information extraction can be and sometimes used to extract information from scanned forms.
Abstract:
Methods and apparatus for determining which displayed content, e.g., text, of a PDF document is selected and then taking an action based on the text selection, e.g., highlighting, coping, or editing the selected text. The methods and apparatus take into consideration potentially differences between text position and/or size differences between rendered and displayed page content and text position and/or size information obtained from performing a text extraction operation on the content of a PDF document. By combining extracted text information with post-render information corresponding to individual DOM elements in synthesized text elements and then storing the information in an easily access data structure, accurate identification of selected text characters can be made in a relatively fast manner using a browser implemented application.
Abstract:
Methods and apparatus for determining which displayed content, e.g., text, of a PDF document is selected and then taking an action based on the text selection, e.g., highlighting, coping, or editing the selected text. The methods and apparatus take into consideration potentially differences between text position and/or size differences between rendered and displayed page content and text position and/or size information obtained from performing a text extraction operation on the content of a PDF document. By combining extracted text information with post-render information corresponding to individual DOM elements in synthesized text elements and then storing the information in an easily access data structure, accurate identification of selected text characters can be made in a relatively fast manner using a browser implemented application.
Abstract:
Methods and apparatus for training neural networks to identify information table partitions are described. Also described are methods and apparatus of using a trained neural network to process an image and provide partition information in an easy to use format. The format of the partition information is one which is simple to interpret, easy to communicate and uses values which facilitate successful training and recognition of partitions in tables whether the partitions be implicitly defined by data arrangement or explicitly define using lines. An image is treated as including a predetermined number of row and column portions. The neural network generates for each predetermined portion a partition present indicator value and a partition location value. The partition present value in some embodiments is a value in the range of 0 to 1 and the partition location value in some embodiments is a value in the range of −1 to +1.
Abstract:
Various features relate to processing a scanned image to facilitate accurate locator pattern identification and/or detection of the corner locations of the locator pattern. In some embodiments to facilitate the identification of corner points the scanned image is processed to reduce the effect of noise and/or other damage on the subsequent location identification process. Individual white pixels which have black pixels on four sides are converted to black as part of the processing while multiple white pixels adjacent each other are left unaltered. In some embodiments processing does not alter the color of black pixels. Corner points of the locator pattern are identified through additional processing and identification of line segments satisfying an expected black, white, black, white, black segment portion ratio.