摘要:
A preprocessing section binarizes input image data and calculates a total black pixel ratio. A feature extracting section detects connected components included in the binary image data and detects circumscribing bounding boxes of the connected components. Predetermined connected components are removed from all of the connected components based on the sizes of the detected circumscribing bounding boxes and bounding box black pixel ratios. By using the connected components that remain after removing the unnecessary connected components, a histogram is generated by specifying the sizes of the circumscribing bounding boxes as classes and numbers of the connected components as the frequencies of occurrence. A determining section determines whether the input image data is document image data or non-document image data based on information related to the generated histogram and the total black pixel ratio.
摘要:
A preprocessing section binarizes input image data and calculates a total black pixel ratio. A feature extracting section detects connected components contained in the binarized image data and detects circumscribing bounding boxes that circumscribe these connected components, respectively. Based on sizes of the circumscribing bounding boxes detected and numbers of black pixels contained therein, predetermined connected components are removed. A determining section generates an edge map by using the residual connected components, and performs two-dimensional fast Fourier transform thereon to generate spectral data. The determining section performs two-dimensional fast Fourier transform on template images to generate spectral data. The determining section determines, based on these pieces of spectral data, whether or not a circular shape is contained in the input image data.
摘要:
A preprocessing section binarizes input image data and calculates a total black pixel ratio. A feature extracting section detects connected components contained in the binarized image data and detects circumscribing bounding boxes that circumscribe these connected components, respectively. Based on sizes of the circumscribing bounding boxes detected and numbers of black pixels contained therein, predetermined connected components are removed. A determining section generates an edge map by using the residual connected components, and performs two-dimensional fast Fourier transform thereon to generate spectral data. The determining section performs two-dimensional fast Fourier transform on template images to generate spectral data. The determining section determines, based on these pieces of spectral data, whether or not a circular shape is contained in the input image data.
摘要:
A preprocessing section binarizes input image data and calculates a total black pixel ratio. A feature extracting section detects connected components included in the binary image data and detects circumscribing bounding boxes of the connected components. Predetermined connected components are removed from all of the connected components based on the sizes of the detected circumscribing bounding boxes and bounding box black pixel ratios. By using the connected components that remain after removing the unnecessary connected components, a histogram is generated by specifying the sizes of the circumscribing bounding boxes as classes and numbers of the connected components as the frequencies of occurrence. A determining section determines whether the input image data is document image data or non-document image data based on information related to the generated histogram and the total black pixel ratio.
摘要:
A headline-region initial processing section clips a headline-region image in an image document, divides the image into individual character images, and extracts features of the individual character images. Based on the features, a candidate-character-sequence generating section selects N (N is an integer more than 1) character images as candidate characters in the order of degree of matching from a font-feature dictionary for storing features of individual character images, and generates M×N index matrix where M is the number of characters in an extracted character sequence. Based on the index matrix, a document-name generating section generates a meaningful document name according to the image document. An image-document-DB management section manages accumulated image documents using the document name. This provides an image document processing device and an image document processing method each allowing automatically generating and managing the meaningful document name that represents the contents of the image document, without user's operation.
摘要:
An image of a character string composed of M pieces of characters is clipped from a document image, and the image is divided into separate characters. Image features of each character image are extracted. Based on the image features, N (N>1, integer) pieces of character images in descending order of degree of similarity are selected as candidate characters, from a character image feature dictionary which stores the image features of character image in units of character, and a first index matrix of M×N cells is prepared. A candidate character string composed of a plurality of candidate characters constituting a first column of the first index matrix, is subjected to a lexical analysis according to a language model, and whereby a second index matrix having a character string which makes sense is prepared. In the language model, statistics are taken and then, the lexical analysis is performed.
摘要:
A headline-region initial processing section clips a headline-region image in an image document, divides the image into individual character images, and extracts features of the individual character images. Based on the features, a candidate-character-sequence generating section selects N (N is an integer more than 1) character images as candidate characters in the order of degree of matching from a font-feature dictionary for storing features of individual character images, and generates M×N index matrix where M is the number of characters in an extracted character sequence. Based on the index matrix, a document-name generating section generates a meaningful document name according to the image document. An image-document-DB management section manages accumulated image documents using the document name. This provides an image document processing device and an image document processing method each allowing automatically generating and managing the meaningful document name that represents the contents of the image document, without user's operation.
摘要:
An image of a character string composed of M pieces of characters is clipped from a document image, and the image is divided character by character, and image features of each character image are extracted. On the basis of the image features, N (N>1, integer) pieces of character images in descending order of degree of similarity are selected as candidate characters from a character image feature dictionary which stores the image features of character image in units of character, and the first index matrix of M×N cells is prepared. A candidate character string composed of a plurality of candidate characters constituting the first column of the first index matrix, is subjected to a lexical analysis according to a predetermined language model, whereby a second index matrix adjusted into a character string which makes sense is prepared to be utilized for searching.
摘要:
A handwriting recognition method and a handwriting recognition device are provided to recognize a character sequence continuously inputted by a user for convenience. The present method comprises steps of calculating various features of the inputted character sequence which include single character recognition accuracy features and space geometry features of different stroke combinations in the inputted character sequence, calculating segmentation reliabilities of respective stroke combinations in different segmented patterns by using a probabilistic model in which coefficients of the probabilistic model are estimated by a parameter estimation method through sample trainings, recognizing characters in different writing patterns by using a multiple-template matching method when performing single character recognition of the stroke combinations, searching for the best segmentation path and conducting post-processing to optimize the recognition results. The present method and device have advantages of simple structure, low hardware requirement, fast recognition speed and high recognition accuracy and can be implemented in an embedded system.
摘要:
An image of a character string composed of M pieces of characters is clipped from a document image, and the image is divided into separate characters. Image features of each character image are extracted. Based on the image features, N (N>1, integer) pieces of character images in descending order of degree of similarity are selected as candidate characters, from a character image feature dictionary which stores the image features of character image in units of character, and a first index matrix of M×N cells is prepared. A candidate character string composed of a plurality of candidate characters constituting a first column of the first index matrix, is subjected to a lexical analysis according to a language model, and whereby a second index matrix having a character string which makes sense is prepared. In the language model, statistics are taken and then, the lexical analysis is performed.