ACCESSING DATA IN COLUMN STORE DATABASE BASED ON HARDWARE COMPATIBLE DATA STRUCTURES
    21.
    发明申请
    ACCESSING DATA IN COLUMN STORE DATABASE BASED ON HARDWARE COMPATIBLE DATA STRUCTURES 审中-公开
    基于硬件兼容数据结构的数据库存储数据库

    公开(公告)号:US20110246432A1

    公开(公告)日:2011-10-06

    申请号:US13107399

    申请日:2011-05-13

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30315

    摘要: Embodiments of the present invention provide one or more hardware-friendly data structures that enable efficient hardware acceleration of database operations. In particular, the present invention employs a column-store format for the database. In the database, column-groups are stored with implicit row ids (RIDs) and a RID-to-primary key column having both column-store and row-store benefits via column hopping and a heap structure for adding new data. Fixed-width column compression allow for easy hardware database processing directly on the compressed data. A global database virtual address space is utilized that allows for arithmetic derivation of any physical address of the data regardless of its location. A word compression dictionary with token compare and sort index is also provided to allow for efficient hardware-based searching of text. A tuple reconstruction process is provided as well that allows hardware to reconstruct a row by stitching together data from multiple column groups.

    摘要翻译: 本发明的实施例提供了一个或多个能够有效地加速数据库操作的硬件友好的数据结构。 特别地,本发明采用数据库的列存储格式。 在数据库中,列组通过列跳转和用于添加新数据的堆结构存储隐式行ids(RID)和具有列存储和行存储优势的RID至主键列。 固定宽度列压缩允许直接对压缩数据进行硬件数据库处理。 使用全局数据库虚拟地址空间,允许对数据的任何物理地址的算术推导,而不管其位置如何。 还提供了具有令牌比较和排序索引的单词压缩字典,以允许对文本进行高效的基于硬件的搜索。 还提供了一个元组重建过程,允许硬件通过将来自多个列组的数据进行拼接来重建行。

    ACCESSING DATA IN A COLUMN STORE DATABASE BASED ON HARDWARE COMPATIBLE DATA STRUCTURES
    22.
    发明申请
    ACCESSING DATA IN A COLUMN STORE DATABASE BASED ON HARDWARE COMPATIBLE DATA STRUCTURES 有权
    基于硬件兼容的数据结构访问存储库数据库中的数据

    公开(公告)号:US20090254532A1

    公开(公告)日:2009-10-08

    申请号:US12099131

    申请日:2008-04-07

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30315

    摘要: Embodiments of the present invention provide one or more hardware-friendly data structures that enable efficient hardware acceleration of database operations. In particular, the present invention employs a column-store format for the database. In the database, column-groups are stored with implicit row ids (RIDs) and a RID-to-primary key column having both column-store and row-store benefits via column hopping and a heap structure for adding new data. Fixed-width column compression allow for easy hardware database processing directly on the compressed data. A global database virtual address space is utilized that allows for arithmetic derivation of any physical address of the data regardless of its location. A word compression dictionary with token compare and sort index is also provided to allow for efficient hardware-based searching of text. A tuple reconstruction process is provided as well that allows hardware to reconstruct a row by stitching together data from multiple column groups.

    摘要翻译: 本发明的实施例提供了一个或多个能够有效地加速数据库操作的硬件友好的数据结构。 特别地,本发明采用数据库的列存储格式。 在数据库中,列组通过列跳转和用于添加新数据的堆结构存储隐式行ids(RID)和RID至主键列,具有列存储和行存储优势。 固定宽度列压缩允许直接对压缩数据进行硬件数据库处理。 使用全局数据库虚拟地址空间,允许对数据的任何物理地址的算术推导,而不管其位置如何。 还提供了具有令牌比较和排序索引的单词压缩字典,以允许对文本进行高效的基于硬件的搜索。 还提供了一个元组重建过程,允许硬件通过将来自多个列组的数据进行拼接来重建行。

    Processing elements of a hardware accelerated reconfigurable processor for accelerating database operations and queries
    23.
    发明申请
    Processing elements of a hardware accelerated reconfigurable processor for accelerating database operations and queries 审中-公开
    处理硬件加速可重构处理器的元素,用于加速数据库操作和查询

    公开(公告)号:US20080189251A1

    公开(公告)日:2008-08-07

    申请号:US11895997

    申请日:2007-08-27

    IPC分类号: G06F17/30

    CPC分类号: G06F16/2453

    摘要: Embodiments of the present invention provide processing elements that are capable of performing high level database operations in hardware based on machine code instructions. These processing elements employ a dataflow architecture that operates on data in hardware without interruption or software. A scanning/indexing processing element may comprise logic that analyze database column groups stored in local memory, perform parallel field extraction and comparison, and generates a list of row pointers (row ids or RIDs) referencing those rows whose value(s) satisfy an applied predicate. The scanning/indexing processing may also be used to project database column groups, search and join index structures, and manipulate in-flight metadata flows, composing, merging, reducing, and modifying multi-dimensional lists of intermediate and final results. Furthermore, a scanning/indexing processing element may be used for joins with indexes, like a Group Index, which involves the association of each input tuple with potentially many related data components, in a one-to-many mapping. An XCAM processing element may comprise logic to perform associative database operations, like accumulation and aggregation, sieving, sorting and associative joins.

    摘要翻译: 本发明的实施例提供了能够基于机器码指令在硬件中执行高级数据库操作的处理元件。 这些处理元件采用在不中断或软件的情况下对硬件上的数据进行操作的数据流架构。 扫描/索引处理元件可以包括分析存储在本地存储器中的数据库列组,执行并行字段提取和比较的逻辑,并且生成参考其值满足应用的那些行的行指针(行ID或RID)的列表 谓词。 扫描/索引处理也可用于投影数据库列组,搜索和连接索引结构,以及操纵机上元数据流,组合,合并,减少和修改中间和最终结果的多维列表。 此外,扫描/索引处理元件可以用于具有诸如组索引的索引的连接,该索引涉及每个输入元组与潜在的许多相关数据组件的关联,在一对多映射中。 XCAM处理元件可以包括执行关联数据库操作的逻辑,例如累积和聚合,筛选,排序和关联连接。

    Methods and systems for real-time continuous updates
    25.
    发明授权
    Methods and systems for real-time continuous updates 有权
    用于实时连续更新的方法和系统

    公开(公告)号:US08458129B2

    公开(公告)日:2013-06-04

    申请号:US12144486

    申请日:2008-06-23

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30356 G06F17/30551

    摘要: Embodiments of the present invention provide fine grain concurrency control for transactions in the presence of database updates. During operations, each transaction is assigned a snapshot version number or SVN. A SVN refers to a historical snapshot of the database that can be created periodically or on demand. Transactions are thus tied to a particular SVN, such as, when the transaction was created. Queries belonging to the transactions can access data that is consistent as of a point in time, for example, corresponding to the latest SVN when the transaction was created. At various times, data from the database stored in a memory can be updated using the snapshot data corresponding to a SVN. When a transaction is committed, a snapshot of the database with a new SVN is created based on the data modified by the transaction and the snapshot is synchronized to the memory. When a transaction query requires data from a version of the database corresponding to a SVN, the data in the memory may be synchronized with the snapshot data corresponding to that SVN.

    摘要翻译: 在存在数据库更新的情况下,本发明的实施例为事务提供细粒度并行性控制。 在操作期间,为每个事务分配一个快照版本号或SVN。 SVN是指可以定期或按需创建的数据库的历史快照。 因此,事务与特定的SVN相关联,例如,当事务被创建时。 属于事务的查询可以访问一个时间点一致的数据,例如对应于创建事务时的最新SVN。 在不同时间,可以使用对应于SVN的快照数据来更新存储在存储器中的数据库的数据。 提交事务时,将根据事务修改的数据创建具有新SVN的数据库快照,并将快照与内存同步。 当事务查询需要来自与SVN对应的数据库的版本的数据时,存储器中的数据可以与对应于该SVN的快照数据同步。

    METHODS AND SYSTEMS FOR GENERATING QUERY PLANS THAT ARE COMPATIBLE FOR EXECUTION IN HARDWARE
    26.
    发明申请
    METHODS AND SYSTEMS FOR GENERATING QUERY PLANS THAT ARE COMPATIBLE FOR EXECUTION IN HARDWARE 审中-公开
    用于产生兼容硬件执行的查询计划的方法和系统

    公开(公告)号:US20100005077A1

    公开(公告)日:2010-01-07

    申请号:US12168821

    申请日:2008-07-07

    IPC分类号: G06F17/30

    CPC分类号: G06F16/24542

    摘要: Embodiments of the present invention generate and optimize query plans that are at least partially executable in hardware. Upon receiving a query, the query is rewritten and optimized with a bias for hardware execution of fragments of the query. A template-based algorithm may be employed for transforming a query into fragments and then into query tasks. The various query tasks can then be routed to either a hardware accelerator, a software module, or sent back to a database management system for execution. For those tasks routed to the hardware accelerator, the query tasks are compiled into machine code database instructions. In order to optimize query execution, query tasks may be broken into subtasks, rearranged based on available resources of the hardware, pipelined, or branched conditionally

    摘要翻译: 本发明的实施例生成和优化在硬件中至少部分可执行的查询计划。 在接收到查询后,查询将被重写和优化,以便查询的片段的硬件执行偏差。 可以采用基于模板的算法将查询转换成片段,然后转换为查询任务。 然后可以将各种查询任务路由到硬件加速器,软件模块,或者发送回数据库管理系统以执行。 对于路由到硬件加速器的任务,将查询任务编译为机器码数据库指令。 为了优化查询执行,查询任务可能被分解为子任务,根据硬件的可用资源,流水线或有条件的分支进行重新排列

    FAST BULK LOADING AND INCREMENTAL LOADING OF DATA INTO A DATABASE
    27.
    发明申请
    FAST BULK LOADING AND INCREMENTAL LOADING OF DATA INTO A DATABASE 有权
    快速加载和将数据加载到数据库中

    公开(公告)号:US20090319550A1

    公开(公告)日:2009-12-24

    申请号:US12144303

    申请日:2008-06-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30595

    摘要: Embodiments of the present invention provide for batch and incremental loading of data into a database. In the present invention, the loader infrastructure utilizes machine code database instructions and hardware acceleration to parallelize the load operations with the I/O operations. A large, hardware accelerator memory is used as staging cache for the load process. The load process also comprises an index profiling phase that enables balanced partitioning of the created indexes to allow for pipelined load. The online incremental loading process may also be performed while serving queries.

    摘要翻译: 本发明的实施例提供数据批量和增量加载到数据库中。 在本发明中,装载机基础设施利用机器码数据库指令和硬件加速来将加载操作与I / O操作并行化。 大型的硬件加速器内存用作加载进程的分段缓存。 加载过程还包括一个索引分析阶段,可以对所创建的索引进行平衡分区,以允许流水线负载。 在提供查询时也可以执行在线增量加载过程。

    Methods and systems for hardware acceleration of database operations and queries
    29.
    发明授权
    Methods and systems for hardware acceleration of database operations and queries 有权
    数据库操作和查询的硬件加速方法和系统

    公开(公告)号:US08244718B2

    公开(公告)日:2012-08-14

    申请号:US11895952

    申请日:2007-08-27

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30442

    摘要: Embodiments of the present invention provide a database system that is optimized by using hardware acceleration. The system may be implemented in several variations to accommodate a wide range of queries and database sizes. In some embodiments, the system may comprise a host system that is coupled to one or more hardware accelerator components. The host system may execute software or provide an interface for receiving queries. The host system analyzes and parses these queries into tasks. The host system may then select some of the tasks and translate them into machine code instructions, which are executed by one or more hardware accelerator components. The tasks executed by hardware accelerators are generally those tasks that may be repetitive or processing intensive. Such tasks may include, for example, indexing, searching, sorting, table scanning, record filtering, and the like.

    摘要翻译: 本发明的实施例提供了通过使用硬件加速来优化的数据库系统。 该系统可以以若干变型实现,以适应广泛的查询和数据库大小。 在一些实施例中,系统可以包括耦合到一个或多个硬件加速器组件的主机系统。 主机系统可以执行软件或提供用于接收查询的接口。 主机系统将这些查询分析并解析成任务。 然后,主机系统可以选择一些任务并将它们转换成由一个或多个硬件加速器组件执行的机器码指令。 硬件加速器执行的任务通常是可能是重复性或处理密集型的任务。 这样的任务可以包括例如索引,搜索,排序,表扫描,记录过滤等。

    Fast batch loading and incremental loading of data into a database
    30.
    发明授权
    Fast batch loading and incremental loading of data into a database 有权
    快速批量加载和将数据增量加载到数据库中

    公开(公告)号:US08165988B2

    公开(公告)日:2012-04-24

    申请号:US12984284

    申请日:2011-01-04

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30595

    摘要: Embodiments of the present invention provide for batch and incremental loading of data into a database. In the present invention, the loader infrastructure utilizes machine code database instructions and hardware acceleration to parallelize the load operations with the I/O operations. A large, hardware accelerator memory is used as staging cache for the load process. The load process also comprises an index profiling phase that enables balanced partitioning of the created indexes to allow for pipelined load. The online incremental loading process may also be performed while serving queries.

    摘要翻译: 本发明的实施例提供数据批量和增量加载到数据库中。 在本发明中,装载机基础设施利用机器码数据库指令和硬件加速来将加载操作与I / O操作并行化。 大型的硬件加速器内存用作加载进程的分段缓存。 加载过程还包括一个索引分析阶段,可以对所创建的索引进行平衡分区,以允许流水线负载。 在提供查询时也可以执行在线增量加载过程。