Content-based segmentation scheme for data compression in storage and transmission including hierarchical segment representation
    1.
    发明授权
    Content-based segmentation scheme for data compression in storage and transmission including hierarchical segment representation 有权
    基于内容的分段方案用于存储和传输中的数据压缩,包括分层段表示

    公开(公告)号:US06828925B2

    公开(公告)日:2004-12-07

    申请号:US10731687

    申请日:2003-12-08

    IPC分类号: H03M734

    摘要: In a coding system, input data within a system is encoded. The input data might include sequences of symbols that repeat in the input data or occur in other input data encoded in the system. The encoding includes determining a target segment size, determining a window size, identifying a fingerprint within a window of symbols at an offset in the input data, determining whether the offset is to be designated as a cut point and segmenting the input data as indicated by the set of cut points. For each segment so identified, the encoder determines whether the segment is to be a referenced segment or an unreferenced segment, replacing the segment data of each referenced segment with a reference label and storing a reference binding in a persistent segment store for each referenced segment, if needed. Hierarchically, the process can be repeated by grouping references into groups, replacing the grouped references with a group label, storing a binding between the grouped references and group label, if one is not already present, and repeating the process. The number of levels of hierarchy can be fixed in advanced or it can be determined from the content encoded.

    摘要翻译: 在编码系统中,对系统内的输入数据进行编码。 输入数据可能包括在输入数据中重复或发生在系统中编码的其他输入数据中的符号序列。 编码包括确定目标段大小,确定窗口大小,识别在输入数据中的偏移处的符号窗口内的指纹,确定偏移是否被指定为切割点并分割输入数据,如 一组切点。 对于如此识别的每个段,编码器确定段是引用段还是未引用段,用参考标记替换每个引用段的段数据,并将引用绑定存储在每个引用段的持久段存储中, 如果需要的话。 分层次地,可以通过将引用分组为组来重复进程,用组标签替换分组的引用,存储分组的引用和组标签之间的绑定(如果尚未存在),并重复该过程。 层次级别的数量可以固定为高级,也可以从编码的内容中确定。

    Method for lossless data compression using greedy sequential context-dependent grammar transform
    2.
    发明授权
    Method for lossless data compression using greedy sequential context-dependent grammar transform 有权
    使用贪心顺序上下文相关语法变换进行无损数据压缩的方法

    公开(公告)号:US06801141B2

    公开(公告)日:2004-10-05

    申请号:US10438357

    申请日:2003-05-14

    申请人: En-Hui Yang Da-ke He

    发明人: En-Hui Yang Da-ke He

    IPC分类号: H03M734

    CPC分类号: H03M7/4006 H03M7/3084

    摘要: A method of lossless data compression is provided which uses a grammar transform to sequentially construct a sequence of greedy context-dependent grammars from which an original data sequence can be recovered incrementally. The data sequence is encoded using any one of a sequential context-dependent method, an improved sequential context-dependent method, and a hierarchical context-dependent method.

    摘要翻译: 提供了一种无损数据压缩的方法,其使用语法变换来顺序地构建贪心上下文相关语法序列,从该序列可以逐渐地恢复原始数据序列。 使用顺序上下文相关方法,改进的顺序上下文相关方法和分层上下文相关方法中的任一种对数据序列进行编码。

    Entropy encoder/decoder
    3.
    发明授权
    Entropy encoder/decoder 失效
    熵编码器/解码器

    公开(公告)号:US06765510B2

    公开(公告)日:2004-07-20

    申请号:US10371630

    申请日:2003-02-20

    IPC分类号: H03M734

    CPC分类号: G06T9/005 H03M7/4006

    摘要: An EBCOT codec (1) is provided which includes a bit modeling unit (11), arithmetic codec (12), input FIFO memory (13), output FIFO memory (14) and a controller (16). The input and output FIFO memories (13) and (14) have a function to control the bit length of to-be-stored data correspondingly to that of supplied data. For the coding, the input FIFO memory (13) is supplied with a wavelet transform coefficient of 16 bits, and for the decoding, it is supplied with a code data of 8 bits. For the coding, the output FIFO memory (14) outputs 8-bit code data, and for the decoding, it outputs wavelet transform coefficients of 16 bits. Thus, the circuit scale can be reduced and data transfer speed be improved.

    摘要翻译: 提供了一种EBCOT编解码器(1),其包括位建模单元(11),算术编解码器(12),输入FIFO存储器(13),输出FIFO存储器(14)和控制器(16)。 输入和输出FIFO存储器(13)和(14)具有对应于所提供数据的位长度来控制待存储数据的位长的功能。 对于编码,输入FIFO存储器(13)被提供有16位的小波变换系数,并且对于解码,它被提供有8位的代码数据。 对于编码,输出FIFO存储器(14)输出8位代码数据,对于解码,它输出16位的小波变换系数。 因此,可以减小电路规模,提高数据传输速度。

    Content-based segmentation scheme for data compression in storage and transmission including hierarchical segment representation

    公开(公告)号:US06667700B1

    公开(公告)日:2003-12-23

    申请号:US10285330

    申请日:2002-10-30

    IPC分类号: H03M734

    摘要: In a coding system, input data within a system is encoded. The input data might include sequences of symbols that repeat in the input data or occur in other input data encoded in the system. The encoding includes determining a target segment size, determining a window size, identifying a fingerprint within a window of symbols at an offset in the input data, determining whether the offset is to be designated as a cut point and segmenting the input data as indicated by the set of cut points. For each segment so identified, the encoder determines whether the segment is to be a referenced segment or an unreferenced segment, replacing the segment data of each referenced segment with a reference label and storing a reference binding in a persistent segment store for each referenced segment, if needed. Hierarchically, the process can be repeated by grouping references into groups, replacing the grouped references with a group label, storing a binding between the grouped references and group label, if one is not already present, and repeating the process. The number of levels of hierarchy can be fixed in advanced or it can be determined from the content encoded.

    Matrix implemented data compression apparatus and method
    5.
    发明授权
    Matrix implemented data compression apparatus and method 有权
    矩阵实现数据压缩装置和方法

    公开(公告)号:US06608570B1

    公开(公告)日:2003-08-19

    申请号:US10195795

    申请日:2002-07-15

    申请人: Albert B. Cooper

    发明人: Albert B. Cooper

    IPC分类号: H03M734

    CPC分类号: H03M7/3088

    摘要: A data compressor includes a matrix of AND-gates corresponding to a respective plurality of strings. An AND-gate has inputs responsive, respectively, to a representation of a prefix code and a representation of a fetched character for energizing the AND-gate output. The AND-gate outputs are coupled, respectively, to the inputs of a matrix switch and the matrix switch outputs have respective string codes assigned thereto. The matrix switch is controllable for coupling any one of the matrix switch inputs to a selected one of the matrix switch outputs. Energization of an AND-gate output coupled to a matrix switch output provides a representation of the code assigned thereto. A prefix decoder responsive to the provided representations of codes assigned to the matrix switch outputs provides decoder outputs to the prefix code inputs of the AND-gates. A character decoder responsive to fetched characters provides decoder outputs to the character inputs of the AND-gates.

    摘要翻译: 数据压缩器包括对应于相应多个串的与门的矩阵。 与门具有分别响应于前缀代码的表示和用于激励与门输出的取出字符的表示的输入。 与门输出分别耦合到矩阵开关的输入,并且矩阵开关输出具有分配给它们的相应串码。 矩阵开关是可控的,用于将矩阵开关输入中的任何一个耦合到所选矩阵开关输出中的一个。 耦合到矩阵开关输出的与门输出的通电提供分配给其的代码的表示。 响应于提供给矩阵切换输出的代码的表示的前缀解码器向AND门的前缀码输入提供解码器输出。 响应于获取的字符的字符解码器向AND门的字符输入提供解码器输出。

    System, method and computer readable medium for compressing a data sequence
    6.
    发明授权
    System, method and computer readable medium for compressing a data sequence 失效
    用于压缩数据序列的系统,方法和计算机可读介质

    公开(公告)号:US06501395B1

    公开(公告)日:2002-12-31

    申请号:US10120026

    申请日:2002-04-10

    IPC分类号: H03M734

    CPC分类号: H03M7/30 H03M7/3084

    摘要: A method for compressing an input sequence of data portions is disclosed. The input sequence is compressed using a Lempel-Ziv technique to generate an output codestream. The codestream includes an ordered sequence of codewords corresponding to and separate from a stream of at least one sequence of non-matchable portions in the input sequence. Each codeword includes three data items denoting a length of a non-matchable sequence receding a matchable first sequence, the offset associated therewith and the length of the matchable first sequence. The codewords are used to reference sequences of data portions which previously appeared when decompressing the output codestream to allow the input sequence to be rebuilt. A program storage device and a compressing system for providing the above method are also disclosed.

    摘要翻译: 公开了一种用于压缩数据部分的输入序列的方法。 使用Lempel-Ziv技术压缩输入序列以产生输出码流。 码流包括与输入序列中的至少一个不匹配部分序列的流对应并分离的码字序列。 每个码字包括表示不匹配序列的长度的三个数据项,后退的可匹配的第一序列,与其相关联的偏移量和匹配的第一序列的长度。 码字用于引用先前在解压缩输出码流以允许重建输入序列时出现的数据部分的序列。 还公开了一种用于提供上述方法的程序存储装置和压缩系统。

    Apparatus for repeatedly compressing a data string and a method thereof

    公开(公告)号:US06392567B1

    公开(公告)日:2002-05-21

    申请号:US09765421

    申请日:2001-01-22

    申请人: Noriko Satoh

    发明人: Noriko Satoh

    IPC分类号: H03M734

    摘要: A character string of which a start point is each address of character string data in an input buffer is rearranged in the predetermined order, so that a rank list is generated. Next, the location of the matching candidate of a character string to be encoded is obtained on the basis of the rank list. Then, the character string to be encoded is compared with a matching candidate, thereby obtaining a matching length. Further, a code is generated using the location of the matching candidate and the matching length, and the code is output as compression data.

    Encoding and decoding apparatus using context
    8.
    发明授权
    Encoding and decoding apparatus using context 有权
    使用上下文的编码和解码设备

    公开(公告)号:US06778103B2

    公开(公告)日:2004-08-17

    申请号:US10226292

    申请日:2002-08-23

    申请人: Noriko Satoh

    发明人: Noriko Satoh

    IPC分类号: H03M734

    CPC分类号: H03M7/40 H03M7/3084

    摘要: A symbol string detection unit detects the second symbol string matching the first symbol string having a predetermined length n from input character strings. A matching length detection unit detects a matching length k between the third symbol string following the first symbol string and the fourth symbol string following the second symbol string. A coding unit codes an input symbol string based on the symbol string detected by the symbol string detection unit and the matching length k detected by the matching length detection unit.

    摘要翻译: 符号串检测单元从输入字符串检测与预定长度为n的第一符号串匹配的第二符号串。 匹配长度检测单元检测第一符号串之后的第三符号串与第二符号串之后的第四符号串之间的匹配长度k。 编码单元基于由符号串检测单元检测的符号串和由匹配长度检测单元检测的匹配长度k来对输入符号串进行编码。

    System and method for compressing an intelligence bearing signal and communicating the compressed signal from a source site to a destination site
    9.
    发明授权
    System and method for compressing an intelligence bearing signal and communicating the compressed signal from a source site to a destination site 失效
    用于压缩智能承载信号并将压缩信号从源站点传送到目的地站点的系统和方法

    公开(公告)号:US06724326B1

    公开(公告)日:2004-04-20

    申请号:US10345834

    申请日:2003-01-16

    申请人: John F. Remillard

    发明人: John F. Remillard

    IPC分类号: H03M734

    CPC分类号: H03M7/30 H03M7/3084

    摘要: An intelligence bearing signal in the form of a string of digitized analog signals is communicated from a source site to a destination site. The signal is in the form of a string of digitized analog signal samples. A sub-string dictionary, a linked list and an ID list are provided at the source site and the destination site, and are used to compress the intelligence bearing signal for faster transmission.

    摘要翻译: 一系列数字化模拟信号形式的智能轴承信号从源站点传送到目标站点。 该信号是一串数字化模拟信号样本的形式。 在源站点和目的地站点提供子串字典,链表和ID列表,并用于压缩智能承载信号以实现更快的传输。

    Data compressor utilizing switched input coincidence elements
    10.
    发明授权
    Data compressor utilizing switched input coincidence elements 有权
    数据压缩器利用切换输入重合元件

    公开(公告)号:US06674374B1

    公开(公告)日:2004-01-06

    申请号:US10351210

    申请日:2003-01-25

    申请人: Albert B. Cooper

    发明人: Albert B. Cooper

    IPC分类号: H03M734

    CPC分类号: G06T9/005 H03M7/3084

    摘要: A data compressor for compressing an input stream of data characters into an output stream of compressed codes includes a plurality of AND-gates corresponding to a respective plurality of codes to be assigned to strings. Each string comprises a prefix string, having an associated prefix code, and an extension character. An AND-gate has a prefix code input and a character input for enabling the AND-gate, the energized output of an AND-gate providing a representation of the code corresponding thereto. The compressor includes a first matrix switch for selectively coupling the provided representations of codes corresponding to the AND-gates to the prefix code inputs of the AND-gates and a second matrix switch for selectively coupling representations of data characters fetched from the input stream to the character inputs of the AND-gates. Data characters are sequentially fetched from the input stream so as to sequentially enable AND-gates until a last data character is fetched that does not result in an enabled AND-gate. The code is output that corresponds to the last enabled AND-gate, thereby providing the stream of compressed codes.

    摘要翻译: 用于将数据字符的输入流压缩为压缩代码的输出流的数据压缩器包括与要分配给字符串的相应多个代码对应的多个与门。 每个字符串包括具有相关联的前缀码和扩展字符的前缀字符串。 与门具有用于启用与门的前缀码输入和字符输入,与门相关的通电输出提供与其对应的代码的表示。 压缩机包括第一矩阵开关,用于将提供的与门对应的代码的表示选择性地耦合到与门的前缀码输入,第二矩阵开关用于选择性地耦合从输入流提取的数据字符的表示到 AND门的字符输入。 从输入流顺序地取出数据字符,以便顺序启用与门,直到取出最后的数据字符而不产生使能的与门。 代码是对应于最后启用的与门的输出,从而提供压缩代码流。