COMPRESSING XML DOCUMENTS USING STATISTICAL TREES GENERATED FROM THOSE DOCUMENTS
    1.
    发明申请
    COMPRESSING XML DOCUMENTS USING STATISTICAL TREES GENERATED FROM THOSE DOCUMENTS 审中-公开
    使用从这些文档生成的统计数据来压缩XML文档

    公开(公告)号:US20100049727A1

    公开(公告)日:2010-02-25

    申请号:US12194599

    申请日:2008-08-20

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/2252 G06F17/2247

    摘要: Compressing data from a markup language document such as an XML document includes the steps of creating from the document a path based statistical tree built according to a given set of rules, and compressing the document by using the statistical tree. In an embodiment, the statistical tree includes a multitude of paths, and a single bit represents each of said paths. Also, the document may include both enumerated data and non-enumerated data, and the enumerated data is compressed by using the statistical tree. In an embodiment, the document includes a multitude of document nodes, and the step of creating the path based statistical tree includes the step of forming said tree with a multitude of tree nodes, each of the tree nodes representing one of the document nodes.

    摘要翻译: 从诸如XML文档的标记语言文档压缩数据包括从文档创建根据给定的一组规则构建的基于路径的统计树并且使用统计树来压缩文档的步骤。 在一个实施例中,统计树包括多个路径,并且单个比特表示每个所述路径。 此外,文档可以包括枚举数据和非枚举数据,并且使用统计树来压缩枚举的数据。 在一个实施例中,文档包括多个文档节点,并且创建基于路径的统计树的步骤包括用多个树节点形成所述树的步骤,每个树节点表示文档节点之一。

    DICTIONARY FOR TEXTUAL DATA COMPRESSION AND DECOMPRESSION
    2.
    发明申请
    DICTIONARY FOR TEXTUAL DATA COMPRESSION AND DECOMPRESSION 有权
    文本数据压缩和解密的词典

    公开(公告)号:US20090271180A1

    公开(公告)日:2009-10-29

    申请号:US12108914

    申请日:2008-04-24

    IPC分类号: G06F17/21 H03M7/00

    CPC分类号: H03M7/3088 G06F17/2735

    摘要: A dictionary for compressing and decompressing textual data has a number of keys. Each key is associated with an identifier. The keys include static word or phrase keys, where each static word or phrase key lists one or more unchanging words in a particular order. The keys further include dynamic phrase keys, where each dynamic phrase key lists a number of words and one or more placeholders in a particular order, and each placeholder denotes a place where a word or phrase other than the words of the dynamic phrase key is to be inserted. At least one of the dynamic phrase keys may identify one or more of the words by identifiers for corresponding static words or phrase keys. At least one of the static word or phrase keys may identify one or more of the words by identifiers for corresponding other static words or phrase keys.

    摘要翻译: 用于压缩和解压缩文本数据的字典具有多个键。 每个密钥都与一个标识符相关联。 密钥包括静态字或短语键,其中每个静态词或短语键以特定顺序列出一个或多个不变的单词。 密钥还包括动态短语键,其中每个动态短语键以特定顺序列出多个单词和一个或多个占位符,并且每个占位符表示除动态短语键的单词之外的单词或短语为 被插入。 动态短语键中的至少一个可以通过用于对应的静态词或短语键的标识符识别一个或多个词。 静态词或短语键中的至少一个可以通过相应的其他静态词或短语键的标识符识别一个或多个词。