摘要:
More accurate retrieval of original documents is conducted by adaptively evaluating retrieval results, which are obtained by retrieving attributes, in accordance with layout information. To achieve this, there is provided an information processing method for retrieving image data that is similar to an entered document image, the method including a step (S402) of segmenting the entered document image into a plurality of areas on a per-attribute basis; a step of calculating degree of similarity, for every area obtained by segmentation, using a retrieval step suited to the attribute; and a step (S406) of calculating overall degree of similarity in which the degree of similarity calculated for every area obtained by segmentation is weighted.
摘要:
To accurately extract embedded information from a document image using line spacing watermark, an image processing apparatus for extracting watermark information includes an input unit which inputs a document image as image data, an image reduction unit which generates, from the image data, reduced image data reduced in the first direction, a detection unit which scans the reduced image data in the second direction and detects the length of a blank region as line spacing information, and an extraction unit which extracts watermark information embedded in the document image based on the line spacing information.
摘要:
An information hiding system such as a digital-watermark information system has an embedding side and an extracting side for recognizing an identical character string in which digital-watermark information is embedded, and the extracting side reliably extracts all digital-watermark information from the character string. In order to realize this, an image processing method of the present invention includes a determining step of determining whether a predetermined distance between each target character and an adjoining character can be maintained if any digital-watermark information which may be embedded in the target character is actually embedded; and a setting step of setting an object which is to be embedded with digital-watermark information according to the determination result.
摘要:
Distance d1 between the right edges of A1 and B2, and distance d2 between the right edges of A3 and B4 are calculated. If data to be embedded is 1, one or a combination of a process for increasing the size of B2 in the column direction or decreasing the size of B4 in the column and a process for moving the position of B2 toward B3 or moving the position of B4 toward B3 is executed to meet d1>d2. If data to be embedded is 0, one or a combination of a process for decreasing the size of B2 in the column direction or increasing the size of B4 in the column direction, and a process for moving the position of B2 toward B1 or moving the position of B4 toward B5 is executed to meet d1
摘要:
To accurately extract embedded information from a document image using line spacing watermark, an image processing apparatus for extracting watermark information includes an input unit which inputs a document image as image data, an image reduction unit which generates, from the image data, reduced image data reduced in the first direction, a detection unit which scans the reduced image data in the second direction and detects the length of a blank region as line spacing information, and an extraction unit which extracts watermark information embedded in the document image based on the line spacing information.
摘要:
An apparatus receives a document image and digital-watermark information and determines an embedding capacity based on a number of the letters in the document image. The apparatus determines whether or not the entire digital-watermark information is capable of being embedded in the document image based on the determined embedding capacity and embeds the digital-watermark information in the document image based on a result of the determination of whether or not the entire digital-watermark information is capable of being embedded in the document image.
摘要:
A method for embedding a digital watermark includes a step of inputting digital watermark information; a step of inputting an image; a step of dividing the image into a plurality of areas; a step of ordering the plurality of areas according to a predetermined ordering criterion; a step of embedding the digital watermark information over the plurality of areas that have been ordered; and a step of outputting an image with the digital watermark information embedded therein.
摘要:
An apparatus receives a document image and digital-watermark information and determines an embedding capacity based on a number of the letters in the document image. The apparatus determines whether or not the entire digital-watermark information is capable of being embedded in the document image based on the determined embedding capacity and embeds the digital-watermark information in the document image based on a result of the determination of whether or not the entire digital-watermark information is capable of being embedded in the document image.
摘要:
An apparatus for embedding a digital watermark in a document image detects circumscribing outer shapes of characters in the document image and sets a plurality of reference lines that extend in the column direction and are spaced apart in the row direction by a basic pitch. The outer shapes include a first outer shape, a second outer shape that neighbors the first outer shape, and a third outer shape that neighbors the second outer shape, and the reference lines include a first reference line located between the first outer shape and the second outer shape, and a second reference line located between the second outer shape and the third outer shape. Control is performed for at least one of the second and third outer shapes so that a distance between the first reference line and an edge of the second outer shape is different from a distance between the second reference line and an edge of the third outer shape, in accordance with digital watermark information to be embedded.
摘要:
It is required to protect the copyrights and the like of partial images which form respective parts of an image obtained by reading an image, exchanged using a print as a medium, by an image scanner or the like. Input image data is divided into a plurality of image regions having different features, digital watermarks, which are embedded in the detected image regions by embedding methods corresponding to the features of the image regions, are extracted, and the availability of the input image is checked on the basis of the extracted digital watermarks.