摘要:
In a coding system, input data within a system is encoded. The input data might include sequences of symbols that repeat in the input data or occur in other input data encoded in the system. The encoding includes determining a target segment size, determining a window size, identifying a fingerprint within a window of symbols at an offset in the input data, determining whether the offset is to be designated as a cut point and segmenting the input data as indicated by the set of cut points. For each segment so identified, the encoder determines whether the segment is to be a referenced segment or an unreferenced segment, replacing the segment data of each referenced segment with a reference label and storing a reference binding in a persistent segment store for each referenced segment, if needed. Hierarchically, the process can be repeated by grouping references into groups, replacing the grouped references with a group label, storing a binding between the grouped references and group label, if one is not already present, and repeating the process. The number of levels of hierarchy can be fixed in advanced or it can be determined from the content encoded.
摘要:
A method of lossless data compression is provided which uses a grammar transform to sequentially construct a sequence of greedy context-dependent grammars from which an original data sequence can be recovered incrementally. The data sequence is encoded using any one of a sequential context-dependent method, an improved sequential context-dependent method, and a hierarchical context-dependent method.
摘要:
An EBCOT codec (1) is provided which includes a bit modeling unit (11), arithmetic codec (12), input FIFO memory (13), output FIFO memory (14) and a controller (16). The input and output FIFO memories (13) and (14) have a function to control the bit length of to-be-stored data correspondingly to that of supplied data. For the coding, the input FIFO memory (13) is supplied with a wavelet transform coefficient of 16 bits, and for the decoding, it is supplied with a code data of 8 bits. For the coding, the output FIFO memory (14) outputs 8-bit code data, and for the decoding, it outputs wavelet transform coefficients of 16 bits. Thus, the circuit scale can be reduced and data transfer speed be improved.
摘要:
In a coding system, input data within a system is encoded. The input data might include sequences of symbols that repeat in the input data or occur in other input data encoded in the system. The encoding includes determining a target segment size, determining a window size, identifying a fingerprint within a window of symbols at an offset in the input data, determining whether the offset is to be designated as a cut point and segmenting the input data as indicated by the set of cut points. For each segment so identified, the encoder determines whether the segment is to be a referenced segment or an unreferenced segment, replacing the segment data of each referenced segment with a reference label and storing a reference binding in a persistent segment store for each referenced segment, if needed. Hierarchically, the process can be repeated by grouping references into groups, replacing the grouped references with a group label, storing a binding between the grouped references and group label, if one is not already present, and repeating the process. The number of levels of hierarchy can be fixed in advanced or it can be determined from the content encoded.
摘要:
A data compressor includes a matrix of AND-gates corresponding to a respective plurality of strings. An AND-gate has inputs responsive, respectively, to a representation of a prefix code and a representation of a fetched character for energizing the AND-gate output. The AND-gate outputs are coupled, respectively, to the inputs of a matrix switch and the matrix switch outputs have respective string codes assigned thereto. The matrix switch is controllable for coupling any one of the matrix switch inputs to a selected one of the matrix switch outputs. Energization of an AND-gate output coupled to a matrix switch output provides a representation of the code assigned thereto. A prefix decoder responsive to the provided representations of codes assigned to the matrix switch outputs provides decoder outputs to the prefix code inputs of the AND-gates. A character decoder responsive to fetched characters provides decoder outputs to the character inputs of the AND-gates.
摘要:
A method for compressing an input sequence of data portions is disclosed. The input sequence is compressed using a Lempel-Ziv technique to generate an output codestream. The codestream includes an ordered sequence of codewords corresponding to and separate from a stream of at least one sequence of non-matchable portions in the input sequence. Each codeword includes three data items denoting a length of a non-matchable sequence receding a matchable first sequence, the offset associated therewith and the length of the matchable first sequence. The codewords are used to reference sequences of data portions which previously appeared when decompressing the output codestream to allow the input sequence to be rebuilt. A program storage device and a compressing system for providing the above method are also disclosed.
摘要:
A character string of which a start point is each address of character string data in an input buffer is rearranged in the predetermined order, so that a rank list is generated. Next, the location of the matching candidate of a character string to be encoded is obtained on the basis of the rank list. Then, the character string to be encoded is compared with a matching candidate, thereby obtaining a matching length. Further, a code is generated using the location of the matching candidate and the matching length, and the code is output as compression data.
摘要:
A symbol string detection unit detects the second symbol string matching the first symbol string having a predetermined length n from input character strings. A matching length detection unit detects a matching length k between the third symbol string following the first symbol string and the fourth symbol string following the second symbol string. A coding unit codes an input symbol string based on the symbol string detected by the symbol string detection unit and the matching length k detected by the matching length detection unit.
摘要:
An intelligence bearing signal in the form of a string of digitized analog signals is communicated from a source site to a destination site. The signal is in the form of a string of digitized analog signal samples. A sub-string dictionary, a linked list and an ID list are provided at the source site and the destination site, and are used to compress the intelligence bearing signal for faster transmission.
摘要:
A data compressor for compressing an input stream of data characters into an output stream of compressed codes includes a plurality of AND-gates corresponding to a respective plurality of codes to be assigned to strings. Each string comprises a prefix string, having an associated prefix code, and an extension character. An AND-gate has a prefix code input and a character input for enabling the AND-gate, the energized output of an AND-gate providing a representation of the code corresponding thereto. The compressor includes a first matrix switch for selectively coupling the provided representations of codes corresponding to the AND-gates to the prefix code inputs of the AND-gates and a second matrix switch for selectively coupling representations of data characters fetched from the input stream to the character inputs of the AND-gates. Data characters are sequentially fetched from the input stream so as to sequentially enable AND-gates until a last data character is fetched that does not result in an enabled AND-gate. The code is output that corresponds to the last enabled AND-gate, thereby providing the stream of compressed codes.