Storing compression units in relational tables
    1.
    发明授权
    Storing compression units in relational tables 有权
    将压缩单元存储在关系表中

    公开(公告)号:US08645337B2

    公开(公告)日:2014-02-04

    申请号:US12769205

    申请日:2010-04-28

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    摘要: A database server stores compressed units in data blocks of a database. A table (or data from a plurality of rows thereof) is first compressed into a “compression unit” using any of a wide variety of compression techniques. The compression unit is then stored in one or more data block rows across one or more data blocks. As a result, a single data block row may comprise compressed data for a plurality of table rows, as encoded within the compression unit. Storage of compression units in data blocks maintains compatibility with existing data block-based databases, thus allowing the use of compression units in preexisting databases without modification to the underlying format of the database. The compression units may, for example, co-exist with uncompressed tables. Various techniques allow a database server to optimize access to data in the compression unit, so that the compression is virtually transparent to the user.

    摘要翻译: 数据库服务器将压缩单位存储在数据库的数据块中。 使用各种各样的压缩技术中的任一种,首先将表(或其多行的数据)压缩为“压缩单位”。 然后,压缩单元被存储在跨越一个或多个数据块的一个或多个数据块行。 结果,单个数据块行可以包括在压缩单元内编码的多个表行的压缩数据。 数据块中的压缩单元的存储与现有的基于数据块的数据库保持兼容,从而允许在预先存在的数据库中使用压缩单元,而无需修改数据库的底层格式。 压缩单元可以例如与未压缩的表共存。 各种技术允许数据库服务器优化对压缩单元中的数据的访问,使得压缩对于用户实际上是透明的。

    Techniques For Automatic Data Placement With Compression And Columnar Storage
    2.
    发明申请
    Techniques For Automatic Data Placement With Compression And Columnar Storage 有权
    用于压缩和柱状存储的自动数据放置技术

    公开(公告)号:US20120296883A1

    公开(公告)日:2012-11-22

    申请号:US13475669

    申请日:2012-05-18

    IPC分类号: G06F17/30

    摘要: For automatic data placement of database data, a plurality of access-tracking data is maintained. The plurality of access-tracking data respectively corresponds to a plurality of data rows that are managed by a database server. While the database server is executing normally, it is automatically determined whether a data row, which is stored in first one or more data blocks, has been recently accessed based on the access-tracking data that corresponds to that data row. After determining that the data row has been recently accessed, the data row is automatically moved from the first one or more data blocks to one or more hot data blocks that are designated for storing those data rows, from the plurality of data rows, that have been recently accessed.

    摘要翻译: 对于数据库数据的自动数据放置,维持多个访问跟踪数据。 多个访问跟踪数据分别对应于由数据库服务器管理的多个数据行。 当数据库服务器正常执行时,根据与该数据行相对应的访问跟踪数据,自动确定是否最近访问存储在第一个一个或多个数据块中的数据行。 在确定最近访问了数据行之后,数据行被自动从第一个或多个数据块移动到指定用于存储来自多个数据行的那些数据行的一个或多个热数据块,这些数据行具有 最近被访问。

    STRUCTURE OF HIERARCHICAL COMPRESSED DATA STRUCTURE FOR TABULAR DATA
    3.
    发明申请
    STRUCTURE OF HIERARCHICAL COMPRESSED DATA STRUCTURE FOR TABULAR DATA 审中-公开
    用于数据数据的分层压缩数据结构的结构

    公开(公告)号:US20120143833A1

    公开(公告)日:2012-06-07

    申请号:US13371354

    申请日:2012-02-10

    IPC分类号: G06F7/00

    CPC分类号: G06F16/221

    摘要: A highly flexible and extensible structure is provided for physically storing tabular data. The structure, referred to as a compression unit, may be used to store tabular data that logically resides in any type of table-like structure. According to one embodiment, compression units are recursive. Thus, a compression unit may have a “parent” compression unit to which it belongs, and may have one or more “child” compression units that belong to it. In one embodiment, compression units include metadata that indicates how the tabular data is stored within them. The metadata for a compression unit may indicate, for example, whether the data is stored in row-major or column major-format the order of the columns within the compression unit (which may differ from the logical order of the columns dictated by the definition of their logical container), a compression technique for the compression unit, the child compression units (if any), etc.

    摘要翻译: 提供了高度灵活和可扩展的结构,用于物理存储表格数据。 称为压缩单元的结构可用于存储逻辑上位于任何类型的类似桌面结构的表格数据。 根据一个实施例,压缩单元是递归的。 因此,压缩单元可以具有它所属的“父”压缩单元,并且可以具有属于它的一个或多个“子”压缩单元。 在一个实施例中,压缩单元包括指示表格数据如何被存储在其中的元数据。 压缩单元的元数据例如可以指示数据是以行主列还是列主格式存储在压缩单元内的列的顺序(其可以不同于由定义指定的列的逻辑顺序 的逻辑容器),压缩单元的压缩技术,子压缩单元(如果有的话)等等

    STORING COMPRESSION UNITS IN RELATIONAL TABLES
    4.
    发明申请
    STORING COMPRESSION UNITS IN RELATIONAL TABLES 有权
    在关系表中存储压缩单位

    公开(公告)号:US20100281004A1

    公开(公告)日:2010-11-04

    申请号:US12769205

    申请日:2010-04-28

    IPC分类号: G06F17/30

    摘要: A database server stores compressed units in data blocks of a database. A table (or data from a plurality of rows thereof) is first compressed into a “compression unit” using any of a wide variety of compression techniques. The compression unit is then stored in one or more data block rows across one or more data blocks. As a result, a single data block row may comprise compressed data for a plurality of table rows, as encoded within the compression unit. Storage of compression units in data blocks maintains compatibility with existing data block-based databases, thus allowing the use of compression units in preexisting databases without modification to the underlying format of the database. The compression units may, for example, co-exist with uncompressed tables. Various techniques allow a database server to optimize access to data in the compression unit, so that the compression is virtually transparent to the user.

    摘要翻译: 数据库服务器将压缩单位存储在数据库的数据块中。 使用各种各样的压缩技术中的任一种,首先将表(或其多行的数据)压缩为“压缩单元”。 然后,压缩单元被存储在跨越一个或多个数据块的一个或多个数据块行。 结果,单个数据块行可以包括在压缩单元内编码的多个表行的压缩数据。 数据块中的压缩单元的存储与现有的基于数据块的数据库保持兼容,从而允许在预先存在的数据库中使用压缩单元,而无需修改数据库的底层格式。 压缩单元可以例如与未压缩的表共存。 各种技术允许数据库服务器优化对压缩单元中的数据的访问,使得压缩对于用户实际上是透明的。

    DDL and DML support for hybrid columnar compressed tables
    5.
    发明授权
    DDL and DML support for hybrid columnar compressed tables 有权
    DDL和DML支持混合柱状压缩表

    公开(公告)号:US08583692B2

    公开(公告)日:2013-11-12

    申请号:US12871882

    申请日:2010-08-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30595

    摘要: Techniques for storing and manipulating tabular data are provided. According to one embodiment, a user may control whether tabular data is stored in row-level or column-major format. Furthermore, the user may control the level of data compression to achieve an optimal balance between query performance and compression ratios. Tabular data from within the same table may be stored in both column-major and row-major format and compressed at different levels. In addition, tabular data can migrate between column-major format and row-major format in response to various events. For example, in response to a request to update or lock a row stored in column-major format, the row may be migrated and subsequently stored into row-major format. In one embodiment, table partitions are used to enhance data compression techniques. For example, compression tests are performed on a representative table partition, and a compression map is generated and applied to other table partitions.

    摘要翻译: 提供了存储和操作表格数据的技术。 根据一个实施例,用户可以控制表格数据是以行主列还是列主格式存储。 此外,用户可以控制数据压缩的水平,以实现查询性能和压缩比之间的最佳平衡。 来自同一表格的表格数据可以以列主和行主格式存储,并在不同级别进行压缩。 此外,响应于各种事件,表格数据可以在列主格式和行主格式之间迁移。 例如,响应于更新或锁定以列主格式存储的行的请求,该行可以被迁移并且随后被存储为行主格式。 在一个实施例中,表分区用于增强数据压缩技术。 例如,压缩测试在代表性的表格分区上执行,生成压缩映射并将其应用于其他表格分区。

    DDL and DML support for hybrid columnar compressed tables

    公开(公告)号:US08521784B2

    公开(公告)日:2013-08-27

    申请号:US12871882

    申请日:2010-08-30

    IPC分类号: G06F17/30

    摘要: Techniques for storing and manipulating tabular data are provided. According to one embodiment, a user may control whether tabular data is stored in row-level or column-major format. Furthermore, the user may control the level of data compression to achieve an optimal balance between query performance and compression ratios. Tabular data from within the same table may be stored in both column-major and row-major format and compressed at different levels. In addition, tabular data can migrate between column-major format and row-major format in response to various events. For example, in response to a request to update or lock a row stored in column-major format, the row may be migrated and subsequently stored into row-major format. In one embodiment, table partitions are used to enhance data compression techniques. For example, compression tests are performed on a representative table partition, and a compression map is generated and applied to other table partitions.

    Query and Exadata Support for Hybrid Columnar Compressed Data
    7.
    发明申请
    Query and Exadata Support for Hybrid Columnar Compressed Data 有权
    查询和Exadata支持混合柱状压缩数据

    公开(公告)号:US20120054225A1

    公开(公告)日:2012-03-01

    申请号:US12871824

    申请日:2010-08-30

    IPC分类号: G06F17/30

    摘要: A method and apparatus is provided for optimizing queries received by a database system that relies on an intelligent data storage server to manage storage for the database system. Storing compression units in hybrid columnar format, the storage manager evaluates simple predicates and only returns data blocks containing rows that satisfy those predicates. The returned data blocks are not necessarily stored persistently on disk. That is, the storage manager is not limited to returning disc block images. The hybrid columnar format enables optimizations that provide better performance when processing typical database workloads including both fetching rows by identifier and performing table scans.

    摘要翻译: 提供了一种方法和装置,用于优化由数据库系统接收的依赖于智能数据存储服务器来管理数据库系统的存储的查询。 以混合列格式存储压缩单元,存储管理器将评估简单谓词,并仅返回包含满足这些谓词的行的数据块。 返回的数据块不一定存储在磁盘上。 也就是说,存储管理器不限于返回盘块图像。 混合柱状格式可以实现在处理典型数据库工作负载时提供更好性能的优化,包括通过标识符提取行和执行表扫描。

    Techniques for automatic data placement with compression and columnar storage
    8.
    发明授权
    Techniques for automatic data placement with compression and columnar storage 有权
    使用压缩和柱状存储自动数据放置的技术

    公开(公告)号:US08959057B2

    公开(公告)日:2015-02-17

    申请号:US13475669

    申请日:2012-05-18

    IPC分类号: G06F17/30

    摘要: For automatic data placement of database data, a plurality of access-tracking data is maintained. The plurality of access-tracking data respectively corresponds to a plurality of data rows that are managed by a database server. While the database server is executing normally, it is automatically determined whether a data row, which is stored in first one or more data blocks, has been recently accessed based on the access-tracking data that corresponds to that data row. After determining that the data row has been recently accessed, the data row is automatically moved from the first one or more data blocks to one or more hot data blocks that are designated for storing those data rows, from the plurality of data rows, that have been recently accessed.

    摘要翻译: 对于数据库数据的自动数据放置,维持多个访问跟踪数据。 多个访问跟踪数据分别对应于由数据库服务器管理的多个数据行。 当数据库服务器正常执行时,根据与该数据行相对应的访问跟踪数据,自动确定是否最近访问存储在第一个一个或多个数据块中的数据行。 在确定最近访问了数据行之后,数据行被自动从第一个或多个数据块移动到指定用于存储来自多个数据行的那些数据行的一个或多个热数据块,这些数据行具有 最近被访问。

    Query and exadata support for hybrid columnar compressed data
    9.
    发明授权
    Query and exadata support for hybrid columnar compressed data 有权
    查询和exadata支持混合柱状压缩数据

    公开(公告)号:US08832142B2

    公开(公告)日:2014-09-09

    申请号:US12871824

    申请日:2010-08-30

    IPC分类号: G06F17/30

    摘要: A method and apparatus is provided for optimizing queries received by a database system that relies on an intelligent data storage server to manage storage for the database system. Storing compression units in hybrid columnar format, the storage manager evaluates simple predicates and only returns data blocks containing rows that satisfy those predicates. The returned data blocks are not necessarily stored persistently on disk. That is, the storage manager is not limited to returning disc block images. The hybrid columnar format enables optimizations that provide better performance when processing typical database workloads including both fetching rows by identifier and performing table scans.

    摘要翻译: 提供了一种方法和装置,用于优化由数据库系统接收的依赖于智能数据存储服务器来管理数据库系统的存储的查询。 以混合列格式存储压缩单元,存储管理器将评估简单谓词,并仅返回包含满足这些谓词的行的数据块。 返回的数据块不一定存储在磁盘上。 也就是说,存储管理器不限于返回盘块图像。 混合柱状格式可以实现在处理典型数据库工作负载时提供更好性能的优化,包括通过标识符提取行和执行表扫描。

    Compression analyzer
    10.
    发明授权
    Compression analyzer 有权
    压缩分析仪

    公开(公告)号:US08356060B2

    公开(公告)日:2013-01-15

    申请号:US12769508

    申请日:2010-04-28

    IPC分类号: G06F7/00

    CPC分类号: H03M7/30 G06F17/30595

    摘要: Techniques are described herein for automatically selecting the compression techniques to be used on tabular data. A compression analyzer gives users high-level control over the selection process without requiring the user to know details about the specific compression techniques that are available to the compression analyzer. Users are able to specify, for a given set of data, a “balance point” along the spectrum between “maximum performance” and “maximum compression”. The point thus selected is used by the compression analyzer in a variety of ways. For example, in one embodiment, the compression analyzer uses the user-specified balance point to determine which of the available compression techniques qualify as “candidate techniques” for the given set of data. The compression analyzer selects the compression technique to use on a set of data by actually testing the candidate compression techniques against samples from the set of data. After testing the candidate compression techniques against the samples, the resulting compression ratios are compared. The compression technique to use on the set of data is then selected based, in part, on the compression ratios achieved during the compression tests performed on the sample data.

    摘要翻译: 这里描述了用于自动选择要在表格数据上使用的压缩技术的技术。 压缩分析仪为用户提供了对选择过程的高级控制,而不需要用户了解有关压缩分析器可用的特定压缩技术的细节。 用户可以为给定的数据集指定沿最大性能和最大压缩之间的平衡点。 所选择的点由压缩分析器以各种方式使用。 例如,在一个实施例中,压缩分析器使用用户指定的平衡点来确定哪些可用的压缩技术被鉴定为用于给定的一组数据的候选技术。 压缩分析仪通过对来自该组数据的样本实际测试候选压缩技术来选择对一组数据使用的压缩技术。 在针对样品测试候选压缩技术之后,比较所得到的压缩比。 然后,部分地基于在对样本数据执行的压缩测试期间实现的压缩比来选择在该组数据上使用的压缩技术。