OZIP compression and decompression

    公开(公告)号:US10437781B2

    公开(公告)日:2019-10-08

    申请号:US15640286

    申请日:2017-06-30

    Abstract: A method, apparatus, and system for OZIP, a data compression and decompression codec, is provided. OZIP utilizes a fixed size static dictionary, which may be generated from a random sampling of input data to be compressed. Compression by direct token encoding to the static dictionary streamlines the encoding and avoids expensive conditional branching, facilitating hardware implementation and high parallelism. By bounding token definition sizes and static dictionary sizes to hardware architecture constraints such as word size or processor cache size, hardware implementation can be made fast and cost effective. For example, decompression may be accelerated by using SIMD instruction processor extensions. A highly granular block mapping in optional stored metadata allows compressed data to be accessed quickly at random, bypassing the processing overhead of dynamic dictionaries. Thus, OZIP can support low latency random data access for highly random workloads, such as for OLTP systems.

    Combined Row and Columnar Storage for In-Memory Databases for OLTP and Analytics Workloads
    3.
    发明申请
    Combined Row and Columnar Storage for In-Memory Databases for OLTP and Analytics Workloads 审中-公开
    用于OLTP和分析工作负载的内存数据库的组合行和列存储

    公开(公告)号:US20150088813A1

    公开(公告)日:2015-03-26

    申请号:US14097575

    申请日:2013-12-05

    CPC classification number: G06F17/30292 G06F17/30289 G06F17/30584

    Abstract: Columns of a table are stored in either row-major format or column-major format in an in-memory DBMS. For a given table, one set of columns is stored in column-major format; another set of columns for a table are stored in row-major format. This way of storing columns of a table is referred to herein as dual-major format. In addition, a row in a dual-major table is updated “in-place”, that is, updates are made directly to column-major columns without creating an interim row-major form of the column-major columns of the row. Users may submit database definition language (“DDL”) commands that declare the row-major columns and column-major columns of a table.

    Abstract translation: 表的列以存储器内DBA中的行主格式或列主格式存储。 对于给定的表,一列列以列主格式存储; 表的另一组列以行主格式存储。 这种存储表的​​列的方式在本文中被称为双主格式。 此外,双主表中的一行更新为“就地”,即直接对列主列进行更新,而不创建该行的主列列的主要行。 用户可以提交数据库定义语言(“DDL”)命令,声明表的行主列和列主列。

    OZIP compression and decompression

    公开(公告)号:US09697221B2

    公开(公告)日:2017-07-04

    申请号:US14337113

    申请日:2014-07-21

    Abstract: A method, apparatus, and system for OZIP, a data compression and decompression codec, is provided. OZIP utilizes a fixed size static dictionary, which may be generated from a random sampling of input data to be compressed. Compression by direct token encoding to the static dictionary streamlines the encoding and avoids expensive conditional branching, facilitating hardware implementation and high parallelism. By bounding token definition sizes and static dictionary sizes to hardware architecture constraints such as word size or processor cache size, hardware implementation can be made fast and cost effective. For example, decompression may be accelerated by using SIMD instruction processor extensions. A highly granular block mapping in optional stored metadata allows compressed data to be accessed quickly at random, bypassing the processing overhead of dynamic dictionaries. Thus, OZIP can support low latency random data access for highly random workloads, such as for OLTP systems.

Patent Agency Ranking