摘要:
A fast, raster-based, enhanced, data compression technique for the printing of compound documents, including pre-processing images in an original page description form of the data before the page data is rasterized, greatly improving compressibility. Unscaled image data is filtered before it is rasterized to the final printing resolution. The filtering specifically enables a separate, near loss-less, compression algorithm to operate on a rasterized page description with high compression ratios. A data compression technique enables a system to compress compound document using a relatively fast and simple algorithm with near loss-less print quality.
摘要:
A system and method for compressing and decompressing data using a plurality of data compression mechanisms. Representative samples of each block of data are tested to select an appropriate one of the data compression mechanisms to apply to the block. The block is then compressed using the selected one of the mechanisms and the compressed block is provided with an identifier of the selected mechanism. For decompression, the identifier is examined to select an appropriate one of the data decompression mechanisms to apply to the block. The block is then decompressed using the selected one of the mechanisms.
摘要:
A data compression and decompression apparatus and method providing a high compression ratio. The data compressing apparatus includes, for example, an obtaining section, a first coding section, a first controlling section, a judging section, a second coding section, and a second controlling section. Characters are obtained one after another from a compression target. A code is then output corresponding to the character when the character and a predetermined number of characters are not identical. Subsequently, the number of characters obtained are counted when the character and the predetermined number of characters are identical, and a code corresponding to the number of characters counted is output when the character obtained is not identical with the predetermined number of characters.
摘要:
A peripheral unit converts an input data flow to page-arranged outputs and includes a random access memory capacity that is insufficient in size to accommodate an entire page of raster data. The peripheral unit also includes a processor and a control memory that holds a plurality of data compression procedures, each procedure exhibiting a different performance characteristic. The peripheral unit performs a method for compressing portions of the input data flow that includes the steps of: allocating the random access memory to portions of the input data flow; determining when an insufficient amount of random access memory is available for such allocation; employing a first data compression procedure on the input data flow portions to produce a compressed data portion; testing the compressed data portion to determine if a level of compression has been achieved that exceeds a threshold and, if not, employing succeeding data compression procedures and repeating the test for each procedure against a threshold, whereby the compression procedure that first enables a threshold level of compression to be achieved is the compression procedure employed to compress the data flow portion. Improved compression methods and techniques for handling input data flows with both integral and independent image descriptors are also described.
摘要:
A method of variable length coding/decoding of image data and an apparatus thereof can vary the length of an escape sequence during variable length coding to thereby reduce the amount of transmitted data.The method of variable length coding/decoding includes the steps of loading an initial potential maximum run length; determining a number of bits necessary to express the potential maximum run length; coding a run length using the necessary number of bits and decreasing the potential maximum run length by an amount equal to the previously coded run length to arrive at a new potential maximum run length until an end of a block to be encoded has been reached.
摘要:
A fast calculation circuit for a cyclic redundancy check code of compressed data having a compressed area and an uncompressed area, the compressed area containing at least a data byte representative of the content of consecutive same data bytes and a data byte representative of the number of consecutive same data bytes. The fast calculation circuit includes a cyclic redundancy check code calculation unit for calculating a cyclic redundancy check code of data in the uncompressed area by using a generating polynominal; a fast calculation matrix unit for storing beforehand a plurality of cyclic redundancy check code calculation results of data contents and consecutive data byte numbers; and a detection unit for detecting the compressed area; wherein when the detection unit detects the compressed area, a cyclic redundancy check code of data in the detected compressed area is calculated by the fast calculation matrix unit, and when the detection unit does not detect the compressed area, a cyclic redundancy check code of data in the uncompressed area is calculated by the cyclic redundancy check code calculation unit.
摘要:
The improved data compression system concurrently processes both strings of repeated characters and textual substitution of input character strings. In this system, the performance of data compression techniques based on textual substitution are improved by the use of a compact representation for identifying instances in which a character in the input data stream is repeated. This is accomplished by nesting a run length encoding system in the textual substitution system. This structure adds the recognition of runs of a repeated character before the processor performs the textual substituted data compression operation. A further performance improvement is obtained by expanding the alphabet of symbols stored in the compressor's dictionary to include both the characters of the input data stream and repeat counts which indicate the repetition of a character. The handling of these repeat counts by the textual substitution based compression technique is no different than the handling of characters, or certain modifications are made in the handling of repeat counts.
摘要:
A television system (for example) may transmit a stream of signals in the form of successive binary words. The signal transmission channel may be used much more efficiently if these binary words are compressed by a reduction of the signal bits before transmission or expanded after transmission in order to reconstitute the original signal. Regardless of why the signals are compressed, the invention improves the transmission quality by sending a combination of fixed length and variable length binary words. This is done by providing a fixed length signal converter in parallel with a variable length signal converter. A decision-making circuit separates the successive binary words responsive to logical decision-making rules relating to an average length word. A selector switch selects between the outputs of the two converters responsive to the logical decisions. Stated another way, one converter is used for signal words which are longer than average and the other converter is used for signal words which are shorter than average so that the combination causes all signals to have an average number of bits which is much less than the original number of bits.
摘要:
A method and system for compacting text data to be transmitted over communications lines and thereby reduce the data volume and transmission time. Transmitting and receiving text processing systems are provided identical library memories containing text strings such as words commonly used in correspondence. Each word in a document to be communicated is compared to the transmitting system's word library and, if found in the library, only the library address is transmitted. If the word is not found in the library, then it is added to the transmitting system's library, sent, and added to the receiving system's library. The receiving system reconstructs the document by using the received addresses to access the appropriate words from its library and place them in the document. The system combines this word match encoding with character match encoding and facsimile run length encoding for communicating words not found in the system library.