摘要:
According to one embodiment, a data processor includes an image acquisition module, a degradation evaluation module, a first output module and a display module. The first output module is configured to output a first trigger for performing a process for detecting the image area when the possibility is high as a result of evaluation by the degradation evaluation module, the first output module is configured to output a command for displaying the image as it is on a display when the possibility is low as a result of the evaluation.
摘要:
A method for parsing the text of a complex graphical image comprises obtaining a series of blocks of text from a complex graphical image. Those blocks of text are used to generate location scores, size scores and length scores. Each of the scores is weighted and linearly summed. The highest resulting sum is identified as the most likely to be a desired text block.
摘要:
The current document is directed to methods and systems for identifying symbols corresponding to symbol images in a scanned-document image or other text-containing image, with the symbols corresponding to Chinese or Japanese characters, to Korean morpho-syllabic blocks, or to symbols of other languages that use a large number of symbols for writing and printing. In one implementation, the methods and systems to which the current document is directed create and store a decision tree, the nodes of which include classifiers that each recognizes the symbol that corresponds to a symbol image. Input of a symbol image to the decision tree and processing of the symbol image through one or more nodes of the decision tree returns a symbol corresponding to the symbol image.
摘要:
A multi-orientation text detection method and associated system is disclosed that utilizes orientation-variant glyph features to determine a text line in an image regardless of an orientation of the text line. Glyph features are determined for each glyph in an image with respect to a neighboring glyph. The glyph features are provided to a learned classifier that outputs a glyph pair score for each neighboring glyph pair. Each glyph pair score indicates a likelihood that the corresponding pair of neighboring glyphs form part of a same text line. The glyph pair scores are used to identify candidate text lines, which are then ranked to select a final set of text lines in the image.
摘要:
A word segmentation method for processing a document image applies clustering analysis to the spacing segments of a line. The spacing segments are generated by thresholding a one-dimensional vertical projection profile of the line. Taking advantage of the bimodal distribution of spacing length distribution of text lines, a k-means clustering algorithm is used, with the number of clusters pre-set to two, to classify the spacing segments as either character spacing or word spacing. Moreover, k-means++ initialization is used to enhance performance of cluster analysis. The clustering result such as cluster centers and compactness is used to prune single-word text line, single table item, etc. The locations of the word spacing segments are then used to segment the line of text into words.
摘要:
A method for cutting out, from a gray-scale image generated by capturing an image of a character string, each character in the character string for recognition, includes a first step of repeating projection processing for projecting a highest or lowest gray level in a line along a direction crossing the character string in the gray-scale image, onto an axis along the character string, with the lowest gray level selected when a character in the gray-scale image is darker than a background, the highest gray level selected when the character in the gray-scale image is brighter than the background, and a projection target position moved along the character string.
摘要:
The character recognition apparatus recognizes characters from a read document original to correct a character string as a character recognition result in a word unit with a space character as a separator. The character recognition apparatus includes a circumscribed rectangle formation portion which forms a circumscribed rectangle for each recognized alphabet character string, a fixed-pitch font determination portion which determines whether or not a font is a fixed-pitch font based on a distance between center lines in a width direction of adjacent circumscribed rectangles, a portion for determining an excess space character which determines, in the case of a fixed-pitch font, that the space character is an excess based on that a width of a space character in the character string is narrower than a predetermined width, and a portion for deleting the space character determined as an excess from the character string.
摘要:
A word segmentation method for processing a document image applies clustering analysis to the spacing segments of a line. The spacing segments are generated by thresholding a one-dimensional vertical projection profile of the line. Taking advantage of the bimodal distribution of spacing length distribution of text lines, a k-means clustering algorithm is used, with the number of clusters pre-set to two, to classify the spacing segments as either character spacing or word spacing. Moreover, k-means++ initialization is used to enhance performance of cluster analysis. The clustering result such as cluster centers and compactness is used to prune single-word text line, single table item, etc. The locations of the word spacing segments are then used to segment the line of text into words.
摘要:
A character recognition apparatus may include an imaging element configured to read a character string placed on an information recording medium; an image memory configured to store image data of the character string; and a character segmenting unit configured to segment a character constituting the character string. The character segmenting unit may include a minimum intensity curve creating unit configured to detect a minimum intensity value among light intensity values, and create a minimum intensity curve of the image data according to the minimum intensity value of each pixel row; a character segmenting position detecting unit configured to calculate a space between the characters neighboring in the created minimum intensity curve, in order to detect a character segmenting position between the characters; and a character segmenting process unit configured to segment each character according to the detected character segmenting position between the characters.
摘要:
A computer-implemented method for selecting at least one segmentation parameter for optical character recognition is provided. The method can include receiving an image having a character string that includes one or more characters. The method can also include receiving a character string identifying each of the one or more characters. The method can also include automatically generating at least one segmentation parameter. The method can also include performing segmentation on the image having the character string using the at least one segmentation parameter. The method can also include determining if a resultant segmentation satisfies one or more criteria and if the resultant segmentation satisfies the one or more criteria, selecting the at least one segmentation parameter.