Query and Exadata Support for Hybrid Columnar Compressed Data
    11.
    发明申请
    Query and Exadata Support for Hybrid Columnar Compressed Data 有权
    查询和Exadata支持混合柱状压缩数据

    公开(公告)号:US20120054225A1

    公开(公告)日:2012-03-01

    申请号:US12871824

    申请日:2010-08-30

    IPC分类号: G06F17/30

    摘要: A method and apparatus is provided for optimizing queries received by a database system that relies on an intelligent data storage server to manage storage for the database system. Storing compression units in hybrid columnar format, the storage manager evaluates simple predicates and only returns data blocks containing rows that satisfy those predicates. The returned data blocks are not necessarily stored persistently on disk. That is, the storage manager is not limited to returning disc block images. The hybrid columnar format enables optimizations that provide better performance when processing typical database workloads including both fetching rows by identifier and performing table scans.

    摘要翻译: 提供了一种方法和装置,用于优化由数据库系统接收的依赖于智能数据存储服务器来管理数据库系统的存储的查询。 以混合列格式存储压缩单元,存储管理器将评估简单谓词,并仅返回包含满足这些谓词的行的数据块。 返回的数据块不一定存储在磁盘上。 也就是说,存储管理器不限于返回盘块图像。 混合柱状格式可以实现在处理典型数据库工作负载时提供更好性能的优化,包括通过标识符提取行和执行表扫描。

    Query and exadata support for hybrid columnar compressed data
    12.
    发明授权
    Query and exadata support for hybrid columnar compressed data 有权
    查询和exadata支持混合柱状压缩数据

    公开(公告)号:US08832142B2

    公开(公告)日:2014-09-09

    申请号:US12871824

    申请日:2010-08-30

    IPC分类号: G06F17/30

    摘要: A method and apparatus is provided for optimizing queries received by a database system that relies on an intelligent data storage server to manage storage for the database system. Storing compression units in hybrid columnar format, the storage manager evaluates simple predicates and only returns data blocks containing rows that satisfy those predicates. The returned data blocks are not necessarily stored persistently on disk. That is, the storage manager is not limited to returning disc block images. The hybrid columnar format enables optimizations that provide better performance when processing typical database workloads including both fetching rows by identifier and performing table scans.

    摘要翻译: 提供了一种方法和装置,用于优化由数据库系统接收的依赖于智能数据存储服务器来管理数据库系统的存储的查询。 以混合列格式存储压缩单元,存储管理器将评估简单谓词,并仅返回包含满足这些谓词的行的数据块。 返回的数据块不一定存储在磁盘上。 也就是说,存储管理器不限于返回盘块图像。 混合柱状格式可以实现在处理典型数据库工作负载时提供更好性能的优化,包括通过标识符提取行和执行表扫描。

    Techniques for compression and processing optimizations by using data transformations
    13.
    发明授权
    Techniques for compression and processing optimizations by using data transformations 有权
    使用数据转换进行压缩和处理优化的技术

    公开(公告)号:US08239421B1

    公开(公告)日:2012-08-07

    申请号:US12871862

    申请日:2010-08-30

    IPC分类号: G06F17/20

    CPC分类号: H03M7/30 H03M7/3084

    摘要: Described herein are compression and processing optimizations by using data transformation techniques. In example embodiments, a byte-wise differential transformation is applied to columnar data represented as a list of length-value pairs to determine a list of delta pairs that is subsequently compressed and stored on persistent storage. A length separation transformation is applied to separate a list of length-value pairs into a length array and a corresponding data value array, where these two arrays are subsequently compressed and stored separately on persistent storage. A native number transformation is applied to a set of number values to remove the lengths stored in the number values, where the transformed set is stored on persistent storage instead of the original set of number values. A native datetime-type transformation is applied to a set of datetime values to generate an encoding that is used to encode the set of datetime values into an encoded set that is stored on persistent storage instead of the original set.

    摘要翻译: 这里描述的是使用数据变换技术的压缩和处理优化。 在示例实施例中,将逐字节差分变换应用于表示为长度值对列表的列数据,以确定随后压缩并存储在持久存储器上的增量对列表。 应用长度分离变换将长度值对列表分隔成长度数组和对应的数据值数组,其中这两个数组随后被压缩并分别存储在持久存储器上。 原始数字变换被应用于一组数字值以去除存储在数字值中的长度,其中变换的集合存储在永久存储器上而不是原始的数字集合。 本机datetime类型转换应用于一组datetime值,以生成用于将datetime值集合编码为存储在永久存储而不是原始集合的编码集中的编码。

    Techniques For Automatic Data Placement With Compression And Columnar Storage
    14.
    发明申请
    Techniques For Automatic Data Placement With Compression And Columnar Storage 有权
    用于压缩和柱状存储的自动数据放置技术

    公开(公告)号:US20120296883A1

    公开(公告)日:2012-11-22

    申请号:US13475669

    申请日:2012-05-18

    IPC分类号: G06F17/30

    摘要: For automatic data placement of database data, a plurality of access-tracking data is maintained. The plurality of access-tracking data respectively corresponds to a plurality of data rows that are managed by a database server. While the database server is executing normally, it is automatically determined whether a data row, which is stored in first one or more data blocks, has been recently accessed based on the access-tracking data that corresponds to that data row. After determining that the data row has been recently accessed, the data row is automatically moved from the first one or more data blocks to one or more hot data blocks that are designated for storing those data rows, from the plurality of data rows, that have been recently accessed.

    摘要翻译: 对于数据库数据的自动数据放置,维持多个访问跟踪数据。 多个访问跟踪数据分别对应于由数据库服务器管理的多个数据行。 当数据库服务器正常执行时,根据与该数据行相对应的访问跟踪数据,自动确定是否最近访问存储在第一个一个或多个数据块中的数据行。 在确定最近访问了数据行之后,数据行被自动从第一个或多个数据块移动到指定用于存储来自多个数据行的那些数据行的一个或多个热数据块,这些数据行具有 最近被访问。

    Techniques for automatic data placement with compression and columnar storage
    15.
    发明授权
    Techniques for automatic data placement with compression and columnar storage 有权
    使用压缩和柱状存储自动数据放置的技术

    公开(公告)号:US08959057B2

    公开(公告)日:2015-02-17

    申请号:US13475669

    申请日:2012-05-18

    IPC分类号: G06F17/30

    摘要: For automatic data placement of database data, a plurality of access-tracking data is maintained. The plurality of access-tracking data respectively corresponds to a plurality of data rows that are managed by a database server. While the database server is executing normally, it is automatically determined whether a data row, which is stored in first one or more data blocks, has been recently accessed based on the access-tracking data that corresponds to that data row. After determining that the data row has been recently accessed, the data row is automatically moved from the first one or more data blocks to one or more hot data blocks that are designated for storing those data rows, from the plurality of data rows, that have been recently accessed.

    摘要翻译: 对于数据库数据的自动数据放置,维持多个访问跟踪数据。 多个访问跟踪数据分别对应于由数据库服务器管理的多个数据行。 当数据库服务器正常执行时,根据与该数据行相对应的访问跟踪数据,自动确定是否最近访问存储在第一个一个或多个数据块中的数据行。 在确定最近访问了数据行之后,数据行被自动从第一个或多个数据块移动到指定用于存储来自多个数据行的那些数据行的一个或多个热数据块,这些数据行具有 最近被访问。

    System and method for data de-duplication
    16.
    发明授权
    System and method for data de-duplication 有权
    用于重复数据删除的系统和方法

    公开(公告)号:US09465823B2

    公开(公告)日:2016-10-11

    申请号:US11584782

    申请日:2006-10-19

    IPC分类号: G06F17/30 G06F11/14

    摘要: Disclosed are methods, systems, and computer program products for processing a file which include using a computer system that is programmed for performing a process of receiving the file in response to a request for storing the file, determining whether a database already contains the file, and storing the file in the database if the database does not already contain the file. The process may alternatively include receiving the file in response to a request for storing the file, determining whether a database already contains the file, and storing the file without storing the received file if the database already contains the file. The process may also alternatively include receiving the file in response to a request for storing the file in a database, separating the file into a plurality of portions, and storing the plurality of portions so each of the plurality of portions can be individually accessed.

    摘要翻译: 公开了用于处理文件的方法,系统和计算机程序产品,其包括使用被编程为响应于存储文件的请求执行接收文件的处理的计算机系统,确定数据库是否已经包含文件, 并且如果数据库尚未包含文件,则将该文件存储在数据库中。 该过程可以替代地包括响应于存储文件的请求而接收文件,确定数据库是否已经包含该文件,以及如果数据库已经包含该文件,则存储该文件而不存储接收到的文件。 该过程还可以包括响应于将文件存储在数据库中的请求而接收文件,将文件分离成多个部分,并且存储多个部分,使得多个部分中的每一个可以被单独访问。

    System and method for data de-duplication
    17.
    发明申请
    System and method for data de-duplication 有权
    用于重复数据删除的系统和方法

    公开(公告)号:US20080098083A1

    公开(公告)日:2008-04-24

    申请号:US11584782

    申请日:2006-10-19

    IPC分类号: G06F15/16

    摘要: A method of processing a file includes receiving a request to store the file, receiving the file, determining whether a database already contains the file, and storing the file in the database if the database does not already contain the file. A method of processing a file includes receiving a request to store the file, receiving the file, determining whether a database already contains the file, and satisfying the request to store the file without storing the received file if the database already contains the file. A method of processing data from a file includes receiving a request to store the file in a database, receiving the file, separating the file into a plurality of portions, and storing the plurality of portions such that each of the plurality of portions can be individually accessed.

    摘要翻译: 处理文件的方法包括:如果数据库尚未包含文件,则接收存储文件的请求,接收文件,确定数据库是否已经包含该文件,以及将该文件存储在数据库中。 处理文件的方法包括接收文件的存储请求,接收文件,确定数据库是否已经包含文件,如果数据库已经包含该文件,则满足存储文件的请求而不存储所接收的文件。 一种从文件处理数据的方法包括:接收将文件存储在数据库中的请求,接收文件,将文件分离成多个部分,并存储多个部分,使得多个部分中的每一个可以单独地 访问。

    ON-LINE TRANSACTION PROCESSING (OLTP) COMPRESSION AND RE-COMPRESSION OF DATABASE DATA

    公开(公告)号:US20130159246A1

    公开(公告)日:2013-06-20

    申请号:US13763663

    申请日:2013-02-09

    IPC分类号: G06F17/30

    摘要: A computer is programmed to compress data of a database in response to database modification language (DML) statements generated by on-line transaction processing (OLTP) systems. In several embodiments, data that is initially added to a database block is left uncompressed until a predetermined condition is satisfied, which happens infrequently (relative to OLTP transactions on the block). When satisfied, the computer automatically compresses all uncompressed data in the block, which increases the amount of unused space in the block. New data is thereafter added uncompressed to the partially compressed block, until satisfaction of a predetermined condition whereby the partially compressed block is again compressed, i.e. re-compressed. Adding of new data to a partially compressed block and its compression are repeated unless another predetermined condition is met, in response to which the block is not further re-compressed, thereby to recognize a limit on the benefit from compression.

    Failover and resume when using ordered sequences in a multi-instance database environment

    公开(公告)号:US09910893B2

    公开(公告)日:2018-03-06

    申请号:US13309300

    申请日:2011-12-01

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/3048

    摘要: An approach is disclosed for implementing failover and resume when using ordered sequences in a multi-instance database environment. The approach commences by instantiating a first database instance initially to serve as an active instance, then instantiating a second database instance to serve as an instance of one or more passive instances. The active database establishes mastership over a sequence and then processes requests for the ‘next’ symbol by accessing a shared sequence cache only after accessing a first instance semaphore. The active instance and the passive instance perform a protocol such that upon passive database detection of a failure of the active database, one of the passive database instances takes over mastership of the sequence cache, and then proceeds to satisfy sequence value requests. The particular order is observed in spite of the failure.

    Reducing sequence cache latch contention in a database system
    20.
    发明授权
    Reducing sequence cache latch contention in a database system 有权
    减少数据库系统中的序列缓存锁定争用

    公开(公告)号:US09141609B2

    公开(公告)日:2015-09-22

    申请号:US13309394

    申请日:2011-12-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30 G06F17/30348

    摘要: In a database system having a plurality of concurrently executing session processes, the method commences by establishing a master list of sequences, the master list comprising a plurality of sequence objects which in turn define a sequence of values used for numbering and other identification within the database system. To reduce sequence cache latch access contention, multiple tiers of latches are provided. Methods of the system provide a first tier having a first tier “global” latch to serialize access to the master list. A second tier of latches is provided, the second tier having multiple second tier latches to serialize access to corresponding allocated sequences of values such that at any point in time, only one of the concurrently executing session processes is granted access to the allocated sequence.

    摘要翻译: 在具有多个同时执行的会话处理的数据库系统中,该方法通过建立序列的主列表开始,主列表包括多个序列对象,这些序列对象又定义用于数据库中的编号和其他识别的值序列 系统。 为了减少序列高速缓存锁存器访问争用,提供多层锁存器。 系统的方法提供具有第一层“全局”锁存器的第一层以串行化对主列表的访问。 提供了第二层锁存器,第二层具有多个第二层锁存器,以串行化对相应分配的值序列的访问,使得在任何时间点,只允许一个并发执行的会话进程访问所分配的序列。