System and method for adaptively loading input data into a multi-dimensional clustering table
    11.
    发明授权
    System and method for adaptively loading input data into a multi-dimensional clustering table 失效
    将输入数据自适应地加载到多维聚类表中的系统和方法

    公开(公告)号:US07080206B2

    公开(公告)日:2006-07-18

    申请号:US10425351

    申请日:2003-04-29

    IPC分类号: G06F12/00

    摘要: A system and associated method load an input data stream into a multi-dimensional clustering (MDC) table or other structure containing data clustered along one or more dimensions, by assembling blocks of data in a partial block cache in which each partial block is associated with a distinct logical cell. A minimum threshold number of partial blocks may be maintained. Partial blocks may be spilled from the partial block cache to make room for new logical cells. Last partial pages of spilled partial blocks may be stored in a partial page cache to limit I/O if the cell associated with a spilled block is encountered later in the input data stream. Buffers may be reassigned from the partial block cache to the partial page cache if the latter is filled. Parallelism may be employed for efficiency during sorting of input data subsets and during storage of blocks to secondary storage.

    摘要翻译: 系统和相关联的方法通过在每个部分块与其相关联的部分块高速缓存中组合数据块,将输入数据流加载到多维聚类(MDC)表或包含沿着一个或多个维集群的数据的其他结构 一个不同的逻辑单元。 可以维持部分块的最小阈值数。 部分块可能从部分块高速缓存中溢出,为新的逻辑单元腾出空间。 溢出的部分块的最后部分页面可以存储在部分页面高速缓存中,以便在输入数据流中稍后遇到与溢出块相关联的单元格时,限制I / O。 缓冲区可以从部分块缓存重新分配到部分页面缓存(如果后者被填充)。 在排序输入数据子集期间以及在将块存储到次级存储期间,可以采用并行性来实现效率。

    DYNAMIC DATA COMPACTION FOR DATA REDISTRIBUTION
    12.
    发明申请
    DYNAMIC DATA COMPACTION FOR DATA REDISTRIBUTION 有权
    用于数据重新分配的动态数据压缩

    公开(公告)号:US20090063526A1

    公开(公告)日:2009-03-05

    申请号:US11849159

    申请日:2007-08-31

    IPC分类号: G06F17/30

    摘要: A method and system for optimizing data redistribution in a database. In one embodiment, the method includes moving, during a first scan, outgoing records from a sending partition to one or more receiving partitions, where free space is created in the sending partition due to the outgoing records leaving the sending partition. The method also includes filling, during the first scan, some of the free space with remaining records that do not leave the sending partition.

    摘要翻译: 一种用于优化数据库中数据重新分配的方法和系统。 在一个实施例中,该方法包括在第一次扫描期间将输出记录从发送分区移动到一个或多个接收分区,其中由于离开发送分区的传出记录在发送分区中创建可用空间。 该方法还包括在第一次扫描期间填充一些可用空间,剩余的记录不会留下发送分区。

    DATA REDISTRIBUTION IN SHARED NOTHING ARCHITECTURE
    13.
    发明申请
    DATA REDISTRIBUTION IN SHARED NOTHING ARCHITECTURE 审中-公开
    共享架构中的数据重新分配

    公开(公告)号:US20090063807A1

    公开(公告)日:2009-03-05

    申请号:US11847270

    申请日:2007-08-29

    IPC分类号: G06F12/02

    摘要: A system and method for data redistribution. In one embodiment, the method includes dividing data into batches at a sending partition; populating a first data structure with the first pages and the first control information in a first data structure; storing the first data structure in a cache at the sending partition; sending the changes over the network to the receiving partition; receiving a notification that the changes have been successfully stored in the second hard disk at the receiving partition; and storing, in response to the notification, the changes on the first hard disk at the sending partition.

    摘要翻译: 一种用于数据重新分配的系统和方法。 在一个实施例中,该方法包括在发送分区处分批数据; 在第一数据结构中填充具有第一页面和第一控制信息的第一数据结构; 将所述第一数据结构存储在所述发送分区的高速缓存中; 通过网络将更改发送到接收分区; 接收在所述接收分区处已将所述更改成功存储在所述第二硬盘中的通知; 以及响应于所述通知存储所述发送分区上的所述第一硬盘上的改变。