FAST BATCH LOADING AND INCREMENTAL LOADING OF DATA INTO A DATABASE
    1.
    发明申请
    FAST BATCH LOADING AND INCREMENTAL LOADING OF DATA INTO A DATABASE 有权
    快速批量加载和数据加载到数据库中

    公开(公告)号:US20110099155A1

    公开(公告)日:2011-04-28

    申请号:US12984284

    申请日:2011-01-04

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30595

    摘要: Embodiments of the present invention provide for batch and incremental loading of data into a database. In the present invention, the loader infrastructure utilizes machine code database instructions and hardware acceleration to parallelize the load operations with the I/O operations. A large, hardware accelerator memory is used as staging cache for the load process. The load process also comprises an index profiling phase that enables balanced partitioning of the created indexes to allow for pipelined load. The online incremental loading process may also be performed while serving queries.

    摘要翻译: 本发明的实施例提供数据批量和增量加载到数据库中。 在本发明中,装载机基础设施利用机器码数据库指令和硬件加速来将加载操作与I / O操作并行化。 大型的硬件加速器内存用作加载进程的分段缓存。 加载过程还包括一个索引分析阶段,可以对所创建的索引进行平衡分区,以允许流水线负载。 在提供查询时也可以执行在线增量加载过程。

    FAST BULK LOADING AND INCREMENTAL LOADING OF DATA INTO A DATABASE
    2.
    发明申请
    FAST BULK LOADING AND INCREMENTAL LOADING OF DATA INTO A DATABASE 有权
    快速加载和将数据加载到数据库中

    公开(公告)号:US20090319550A1

    公开(公告)日:2009-12-24

    申请号:US12144303

    申请日:2008-06-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30595

    摘要: Embodiments of the present invention provide for batch and incremental loading of data into a database. In the present invention, the loader infrastructure utilizes machine code database instructions and hardware acceleration to parallelize the load operations with the I/O operations. A large, hardware accelerator memory is used as staging cache for the load process. The load process also comprises an index profiling phase that enables balanced partitioning of the created indexes to allow for pipelined load. The online incremental loading process may also be performed while serving queries.

    摘要翻译: 本发明的实施例提供数据批量和增量加载到数据库中。 在本发明中,装载机基础设施利用机器码数据库指令和硬件加速来将加载操作与I / O操作并行化。 大型的硬件加速器内存用作加载进程的分段缓存。 加载过程还包括一个索引分析阶段,可以对所创建的索引进行平衡分区,以允许流水线负载。 在提供查询时也可以执行在线增量加载过程。

    Fast batch loading and incremental loading of data into a database
    3.
    发明授权
    Fast batch loading and incremental loading of data into a database 有权
    快速批量加载和将数据增量加载到数据库中

    公开(公告)号:US08165988B2

    公开(公告)日:2012-04-24

    申请号:US12984284

    申请日:2011-01-04

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30595

    摘要: Embodiments of the present invention provide for batch and incremental loading of data into a database. In the present invention, the loader infrastructure utilizes machine code database instructions and hardware acceleration to parallelize the load operations with the I/O operations. A large, hardware accelerator memory is used as staging cache for the load process. The load process also comprises an index profiling phase that enables balanced partitioning of the created indexes to allow for pipelined load. The online incremental loading process may also be performed while serving queries.

    摘要翻译: 本发明的实施例提供数据批量和增量加载到数据库中。 在本发明中,装载机基础设施利用机器码数据库指令和硬件加速来将加载操作与I / O操作并行化。 大型的硬件加速器内存用作加载进程的分段缓存。 加载过程还包括一个索引分析阶段,可以对所创建的索引进行平衡分区,以允许流水线负载。 在提供查询时也可以执行在线增量加载过程。

    Fast bulk loading and incremental loading of data into a database
    4.
    发明授权
    Fast bulk loading and incremental loading of data into a database 有权
    快速批量加载和将数据递增加载到数据库中

    公开(公告)号:US07895151B2

    公开(公告)日:2011-02-22

    申请号:US12144303

    申请日:2008-06-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30595

    摘要: Embodiments of the present invention provide for batch and incremental loading of data into a database. In the present invention, the loader infrastructure utilizes machine code database instructions and hardware acceleration to parallelize the load operations with the I/O operations. A large, hardware accelerator memory is used as staging cache for the load process. The load process also comprises an index profiling phase that enables balanced partitioning of the created indexes to allow for pipelined load. The online incremental loading process may also be performed while serving queries.

    摘要翻译: 本发明的实施例提供数据批量和增量加载到数据库中。 在本发明中,装载机基础设施利用机器码数据库指令和硬件加速来将加载操作与I / O操作并行化。 大型的硬件加速器内存用作加载进程的分段缓存。 加载过程还包括一个索引分析阶段,可以对所创建的索引进行平衡分区,以允许流水线负载。 在提供查询时也可以执行在线增量加载过程。

    Methods and systems for real-time continuous updates
    6.
    发明授权
    Methods and systems for real-time continuous updates 有权
    用于实时连续更新的方法和系统

    公开(公告)号:US08458129B2

    公开(公告)日:2013-06-04

    申请号:US12144486

    申请日:2008-06-23

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30356 G06F17/30551

    摘要: Embodiments of the present invention provide fine grain concurrency control for transactions in the presence of database updates. During operations, each transaction is assigned a snapshot version number or SVN. A SVN refers to a historical snapshot of the database that can be created periodically or on demand. Transactions are thus tied to a particular SVN, such as, when the transaction was created. Queries belonging to the transactions can access data that is consistent as of a point in time, for example, corresponding to the latest SVN when the transaction was created. At various times, data from the database stored in a memory can be updated using the snapshot data corresponding to a SVN. When a transaction is committed, a snapshot of the database with a new SVN is created based on the data modified by the transaction and the snapshot is synchronized to the memory. When a transaction query requires data from a version of the database corresponding to a SVN, the data in the memory may be synchronized with the snapshot data corresponding to that SVN.

    摘要翻译: 在存在数据库更新的情况下,本发明的实施例为事务提供细粒度并行性控制。 在操作期间,为每个事务分配一个快照版本号或SVN。 SVN是指可以定期或按需创建的数据库的历史快照。 因此,事务与特定的SVN相关联,例如,当事务被创建时。 属于事务的查询可以访问一个时间点一致的数据,例如对应于创建事务时的最新SVN。 在不同时间,可以使用对应于SVN的快照数据来更新存储在存储器中的数据库的数据。 提交事务时,将根据事务修改的数据创建具有新SVN的数据库快照,并将快照与内存同步。 当事务查询需要来自与SVN对应的数据库的版本的数据时,存储器中的数据可以与对应于该SVN的快照数据同步。

    METHODS AND SYSTEMS FOR REAL-TIME CONTINUOUS UPDATES
    8.
    发明申请
    METHODS AND SYSTEMS FOR REAL-TIME CONTINUOUS UPDATES 有权
    实时连续更新的方法和系统

    公开(公告)号:US20090319486A1

    公开(公告)日:2009-12-24

    申请号:US12144486

    申请日:2008-06-23

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F17/30356 G06F17/30551

    摘要: Embodiments of the present invention provide fine grain concurrency control for transactions in the presence of database updates. During operations, each transaction is assigned a snapshot version number or SVN. A SVN refers to a historical snapshot of the database that can be created periodically or on demand. Transactions are thus tied to a particular SVN, such as, when the transaction was created. Queries belonging to the transactions can access data that is consistent as of a point in time, for example, corresponding to the latest SVN when the transaction was created. At various times, data from the database stored in a memory can be updated using the snapshot data corresponding to a SVN. When a transaction is committed, a snapshot of the database with a new SVN is created based on the data modified by the transaction and the snapshot is synchronized to the memory. When a transaction query requires data from a version of the database corresponding to a SVN, the data in the memory may be synchronized with the snapshot data corresponding to that SVN.

    摘要翻译: 在存在数据库更新的情况下,本发明的实施例为事务提供细粒度并行性控制。 在操作期间,为每个事务分配一个快照版本号或SVN。 SVN是指可以定期或按需创建的数据库的历史快照。 因此,事务与特定的SVN相关联,例如,当事务被创建时。 属于事务的查询可以访问一个时间点一致的数据,例如对应于创建事务时的最新SVN。 在不同时间,可以使用对应于SVN的快照数据来更新存储在存储器中的数据库的数据。 提交事务时,将根据事务修改的数据创建具有新SVN的数据库快照,并将快照与内存同步。 当事务查询需要来自与SVN对应的数据库的版本的数据时,存储器中的数据可以与对应于该SVN的快照数据同步。

    ACCESSING DATA IN COLUMN STORE DATABASE BASED ON HARDWARE COMPATIBLE DATA STRUCTURES
    9.
    发明申请
    ACCESSING DATA IN COLUMN STORE DATABASE BASED ON HARDWARE COMPATIBLE DATA STRUCTURES 审中-公开
    基于硬件兼容数据结构的数据库存储数据库

    公开(公告)号:US20110246432A1

    公开(公告)日:2011-10-06

    申请号:US13107399

    申请日:2011-05-13

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30315

    摘要: Embodiments of the present invention provide one or more hardware-friendly data structures that enable efficient hardware acceleration of database operations. In particular, the present invention employs a column-store format for the database. In the database, column-groups are stored with implicit row ids (RIDs) and a RID-to-primary key column having both column-store and row-store benefits via column hopping and a heap structure for adding new data. Fixed-width column compression allow for easy hardware database processing directly on the compressed data. A global database virtual address space is utilized that allows for arithmetic derivation of any physical address of the data regardless of its location. A word compression dictionary with token compare and sort index is also provided to allow for efficient hardware-based searching of text. A tuple reconstruction process is provided as well that allows hardware to reconstruct a row by stitching together data from multiple column groups.

    摘要翻译: 本发明的实施例提供了一个或多个能够有效地加速数据库操作的硬件友好的数据结构。 特别地,本发明采用数据库的列存储格式。 在数据库中,列组通过列跳转和用于添加新数据的堆结构存储隐式行ids(RID)和具有列存储和行存储优势的RID至主键列。 固定宽度列压缩允许直接对压缩数据进行硬件数据库处理。 使用全局数据库虚拟地址空间,允许对数据的任何物理地址的算术推导,而不管其位置如何。 还提供了具有令牌比较和排序索引的单词压缩字典,以允许对文本进行高效的基于硬件的搜索。 还提供了一个元组重建过程,允许硬件通过将来自多个列组的数据进行拼接来重建行。

    ACCESSING DATA IN A COLUMN STORE DATABASE BASED ON HARDWARE COMPATIBLE DATA STRUCTURES
    10.
    发明申请
    ACCESSING DATA IN A COLUMN STORE DATABASE BASED ON HARDWARE COMPATIBLE DATA STRUCTURES 有权
    基于硬件兼容的数据结构访问存储库数据库中的数据

    公开(公告)号:US20090254532A1

    公开(公告)日:2009-10-08

    申请号:US12099131

    申请日:2008-04-07

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30315

    摘要: Embodiments of the present invention provide one or more hardware-friendly data structures that enable efficient hardware acceleration of database operations. In particular, the present invention employs a column-store format for the database. In the database, column-groups are stored with implicit row ids (RIDs) and a RID-to-primary key column having both column-store and row-store benefits via column hopping and a heap structure for adding new data. Fixed-width column compression allow for easy hardware database processing directly on the compressed data. A global database virtual address space is utilized that allows for arithmetic derivation of any physical address of the data regardless of its location. A word compression dictionary with token compare and sort index is also provided to allow for efficient hardware-based searching of text. A tuple reconstruction process is provided as well that allows hardware to reconstruct a row by stitching together data from multiple column groups.

    摘要翻译: 本发明的实施例提供了一个或多个能够有效地加速数据库操作的硬件友好的数据结构。 特别地,本发明采用数据库的列存储格式。 在数据库中,列组通过列跳转和用于添加新数据的堆结构存储隐式行ids(RID)和RID至主键列,具有列存储和行存储优势。 固定宽度列压缩允许直接对压缩数据进行硬件数据库处理。 使用全局数据库虚拟地址空间,允许对数据的任何物理地址的算术推导,而不管其位置如何。 还提供了具有令牌比较和排序索引的单词压缩字典,以允许对文本进行高效的基于硬件的搜索。 还提供了一个元组重建过程,允许硬件通过将来自多个列组的数据进行拼接来重建行。