Methods and systems for hardware acceleration of database operations and queries
    31.
    发明申请
    Methods and systems for hardware acceleration of database operations and queries 有权
    数据库操作和查询的硬件加速方法和系统

    公开(公告)号:US20080183688A1

    公开(公告)日:2008-07-31

    申请号:US11895952

    申请日:2007-08-27

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30442

    摘要: Embodiments of the present invention provide a database system that is optimized by using hardware acceleration. The system may be implemented in several variations to accommodate a wide range of queries and database sizes. In some embodiments, the system may comprise a host system that is coupled to one or more hardware accelerator components. The host system may execute software or provide an interface for receiving queries. The host system analyzes and parses these queries into tasks. The host system may then select some of the tasks and translate them into machine code instructions, which are executed by one or more hardware accelerator components. The tasks executed by hardware accelerators are generally those tasks that may be repetitive or processing intensive. Such tasks may include, for example, indexing, searching, sorting, table scanning, record filtering, and the like.

    摘要翻译: 本发明的实施例提供了通过使用硬件加速来优化的数据库系统。 该系统可以以若干变型实现,以适应广泛的查询和数据库大小。 在一些实施例中,系统可以包括耦合到一个或多个硬件加速器组件的主机系统。 主机系统可以执行软件或提供用于接收查询的接口。 主机系统将这些查询分析并解析成任务。 然后,主机系统可以选择一些任务并将它们转换成由一个或多个硬件加速器组件执行的机器码指令。 硬件加速器执行的任务通常是可能是重复性或处理密集型的任务。 这样的任务可以包括例如索引,搜索,排序,表扫描,记录过滤等。

    Calibration of logical cost formulae for queries in a heterogeneous DBMS
using synthetic database
    32.
    发明授权
    Calibration of logical cost formulae for queries in a heterogeneous DBMS using synthetic database 失效
    使用合成数据库校正异构DBMS中查询的逻辑成本公式

    公开(公告)号:US5412806A

    公开(公告)日:1995-05-02

    申请号:US932426

    申请日:1992-08-20

    摘要: A programmable machine system and method for managing electronic data access among multiple different relational databases in a network distributed database environment. The machine is programmed so that it can construct cost-effective access strategies for any of the participating databases absent any DBMS-specific cost models. The system provides query optimization across different database management systems in a network distributed database environment based on a calibrating database relying only on typical relational database statistics and cost data is developed by running queries in the various databases against the calibrating database. A logical cost model is constructed using the resulting cost data and is used to estimate the cost of a given query based on logical characteristics of the DBMS, the relations, and the query itself. The cost of a complex query is estimated using primitive queries. Optimal query access strategies are thereby designed and used to control execution of the queries across relational databases controlled by two or more different database management systems.

    摘要翻译: 一种用于在网络分布式数据库环境中的多个不同关系数据库之间管理电子数据访问的可编程机器系统和方法。 该机器被编程,使得它可以为没有任何DBMS特定成本模型的任何参与数据库构建成本有效的访问策略。 该系统基于仅依赖于典型关系数据库统计的校准数据库在网络分布式数据库环境中提供跨越不同数据库管理系统的查询优化,并通过针对校准数据库运行各种数据库中的查询来开发成本数据。 使用所得到的成本数据构建逻辑成本模型,并且用于基于DBMS的逻辑特性,关系和查询本身来估计给定查询的成本。 使用原始查询估计复杂查询的成本。 从而设计和优化最佳查询访问策略,以控制跨两个或多个不同数据库管理系统控制的关系数据库之间的查询执行。

    Methods and systems for run-time scheduling database operations that are executed in hardware
    33.
    发明授权
    Methods and systems for run-time scheduling database operations that are executed in hardware 有权
    用于在硬件中执行的运行时调度数据库操作的方法和系统

    公开(公告)号:US09424315B2

    公开(公告)日:2016-08-23

    申请号:US12099076

    申请日:2008-04-07

    IPC分类号: G06F17/30 G06F9/48

    摘要: Embodiments of the present invention provide a run-time scheduler that schedules tasks for database queries on one or more execution resources in a dataflow fashion. In some embodiments, the run-time scheduler may comprise a task manager, a memory manager, and hardware resource manager. When a query is received by a host database management system, a query plan is created for that query. The query plan splits a query into various fragments. These fragments are further compiled into a directed acyclic graph of tasks. Unlike conventional scheduling, the dependency arc in the directed acyclic graph is based on page resources. Tasks may comprise machine code that may be executed by hardware to perform portions of the query. These tasks may also be performed in software or relate to I/O.

    摘要翻译: 本发明的实施例提供一种运行时间调度器,其以数据流的方式调度对一个或多个执行资源的数据库查询的任务。 在一些实施例中,运行时调度器可以包括任务管理器,存储器管理器和硬件资源管理器。 当主机数据库管理系统接收到查询时,将为该查询创建一个查询计划。 查询计划将查询分割成各种片段。 这些片段进一步编译成任务的有向非循环图。 与常规调度不同,有向非循环图中的依赖弧基于页面资源。 任务可以包括可由硬件执行以执行查询的部分的机器代码。 这些任务也可以用软件执行或与I / O相关。

    Accessing data in column store database based on hardware compatible data structures
    34.
    发明授权
    Accessing data in column store database based on hardware compatible data structures 有权
    基于硬件兼容的数据结构访问列存储数据库中的数据

    公开(公告)号:US09378231B2

    公开(公告)日:2016-06-28

    申请号:US13107399

    申请日:2011-05-13

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30315

    摘要: Embodiments of the present invention provide one or more hardware-friendly data structures that enable efficient hardware acceleration of database operations. In particular, the present invention employs a column-store format for the database. In the database, column-groups are stored with implicit row ids (RIDs) and a RID-to-primary key column having both column-store and row-store benefits via column hopping and a heap structure for adding new data. Fixed-width column compression allow for easy hardware database processing directly on the compressed data. A global database virtual address space is utilized that allows for arithmetic derivation of any physical address of the data regardless of its location. A word compression dictionary with token compare and sort index is also provided to allow for efficient hardware-based searching of text. A tuple reconstruction process is provided as well that allows hardware to reconstruct a row by stitching together data from multiple column groups.

    摘要翻译: 本发明的实施例提供了一个或多个能够有效地加速数据库操作的硬件友好的数据结构。 特别地,本发明采用数据库的列存储格式。 在数据库中,列组通过列跳转和用于添加新数据的堆结构存储隐式行ids(RID)和具有列存储和行存储优势的RID至主键列。 固定宽度列压缩允许直接对压缩数据进行硬件数据库处理。 使用全局数据库虚拟地址空间,允许对数据的任何物理地址的算术推导,而不管其位置如何。 还提供了具有令牌比较和排序索引的单词压缩字典,以允许对文本进行高效的基于硬件的搜索。 还提供了一个元组重建过程,允许硬件通过将来自多个列组的数据进行拼接来重建行。

    Accessing data in a column store database based on hardware compatible data structures
    35.
    发明授权
    Accessing data in a column store database based on hardware compatible data structures 有权
    基于硬件兼容的数据结构访问列存储数据库中的数据

    公开(公告)号:US07966343B2

    公开(公告)日:2011-06-21

    申请号:US12099131

    申请日:2008-04-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30315

    摘要: Embodiments of the present invention provide one or more hardware-friendly data structures that enable efficient hardware acceleration of database operations. In particular, the present invention employs a column-store format for the database. In the database, column-groups are stored with implicit row ids (RIDs) and a RID-to-primary key column having both column-store and row-store benefits via column hopping and a heap structure for adding new data. Fixed-width column compression allow for easy hardware database processing directly on the compressed data. A global database virtual address space is utilized that allows for arithmetic derivation of any physical address of the data regardless of its location. A word compression dictionary with token compare and sort index is also provided to allow for efficient hardware-based searching of text. A tuple reconstruction process is provided as well that allows hardware to reconstruct a row by stitching together data from multiple column groups.

    摘要翻译: 本发明的实施例提供了一个或多个能够有效地加速数据库操作的硬件友好的数据结构。 特别地,本发明采用数据库的列存储格式。 在数据库中,列组通过列跳转和用于添加新数据的堆结构存储隐式行ids(RID)和具有列存储和行存储优势的RID至主键列。 固定宽度列压缩允许直接对压缩数据进行硬件数据库处理。 使用全局数据库虚拟地址空间,允许对数据的任何物理地址的算术推导,而不管其位置如何。 还提供了具有令牌比较和排序索引的单词压缩字典,以允许对文本进行高效的基于硬件的搜索。 还提供了一个元组重建过程,允许硬件通过将来自多个列组的数据进行拼接来重建行。

    Fast bulk loading and incremental loading of data into a database
    36.
    发明授权
    Fast bulk loading and incremental loading of data into a database 有权
    快速批量加载和将数据递增加载到数据库中

    公开(公告)号:US07895151B2

    公开(公告)日:2011-02-22

    申请号:US12144303

    申请日:2008-06-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30595

    摘要: Embodiments of the present invention provide for batch and incremental loading of data into a database. In the present invention, the loader infrastructure utilizes machine code database instructions and hardware acceleration to parallelize the load operations with the I/O operations. A large, hardware accelerator memory is used as staging cache for the load process. The load process also comprises an index profiling phase that enables balanced partitioning of the created indexes to allow for pipelined load. The online incremental loading process may also be performed while serving queries.

    摘要翻译: 本发明的实施例提供数据批量和增量加载到数据库中。 在本发明中,装载机基础设施利用机器码数据库指令和硬件加速来将加载操作与I / O操作并行化。 大型的硬件加速器内存用作加载进程的分段缓存。 加载过程还包括一个索引分析阶段,可以对所创建的索引进行平衡分区,以允许流水线负载。 在提供查询时也可以执行在线增量加载过程。

    METHODS AND SYSTEMS FOR REAL-TIME CONTINUOUS UPDATES
    37.
    发明申请
    METHODS AND SYSTEMS FOR REAL-TIME CONTINUOUS UPDATES 有权
    实时连续更新的方法和系统

    公开(公告)号:US20090319486A1

    公开(公告)日:2009-12-24

    申请号:US12144486

    申请日:2008-06-23

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F17/30356 G06F17/30551

    摘要: Embodiments of the present invention provide fine grain concurrency control for transactions in the presence of database updates. During operations, each transaction is assigned a snapshot version number or SVN. A SVN refers to a historical snapshot of the database that can be created periodically or on demand. Transactions are thus tied to a particular SVN, such as, when the transaction was created. Queries belonging to the transactions can access data that is consistent as of a point in time, for example, corresponding to the latest SVN when the transaction was created. At various times, data from the database stored in a memory can be updated using the snapshot data corresponding to a SVN. When a transaction is committed, a snapshot of the database with a new SVN is created based on the data modified by the transaction and the snapshot is synchronized to the memory. When a transaction query requires data from a version of the database corresponding to a SVN, the data in the memory may be synchronized with the snapshot data corresponding to that SVN.

    摘要翻译: 在存在数据库更新的情况下,本发明的实施例为事务提供细粒度并行性控制。 在操作期间,为每个事务分配一个快照版本号或SVN。 SVN是指可以定期或按需创建的数据库的历史快照。 因此,事务与特定的SVN相关联,例如,当事务被创建时。 属于事务的查询可以访问一个时间点一致的数据,例如对应于创建事务时的最新SVN。 在不同时间,可以使用对应于SVN的快照数据来更新存储在存储器中的数据库的数据。 提交事务时,将根据事务修改的数据创建具有新SVN的数据库快照,并将快照与内存同步。 当事务查询需要来自与SVN对应的数据库的版本的数据时,存储器中的数据可以与对应于该SVN的快照数据同步。

    METHODS AND SYSTEMS FOR RUN-TIME SCHEDULING DATABASE OPERATIONS THAT ARE EXECUTED IN HARDWARE
    38.
    发明申请
    METHODS AND SYSTEMS FOR RUN-TIME SCHEDULING DATABASE OPERATIONS THAT ARE EXECUTED IN HARDWARE 有权
    在硬件中执行的运行时间调度数据库操作的方法和系统

    公开(公告)号:US20090254774A1

    公开(公告)日:2009-10-08

    申请号:US12099076

    申请日:2008-04-07

    摘要: Embodiments of the present invention provide a run-time scheduler that schedules tasks for database queries on one or more execution resources in a dataflow fashion. In some embodiments, the run-time scheduler may comprise a task manager, a memory manager, and hardware resource manager. When a query is received by a host database management system, a query plan is created for that query. The query plan splits a query into various fragments. These fragments are further compiled into a directed acyclic graph of tasks. Unlike conventional scheduling, the dependency arc in the directed acyclic graph is based on page resources. Tasks may comprise machine code that may be executed by hardware to perform portions of the query. These tasks may also be performed in software or relate to I/O.

    摘要翻译: 本发明的实施例提供一种运行时间调度器,其以数据流的方式调度对一个或多个执行资源的数据库查询的任务。 在一些实施例中,运行时调度器可以包括任务管理器,存储器管理器和硬件资源管理器。 当主机数据库管理系统接收到查询时,将为该查询创建一个查询计划。 查询计划将查询分割成各种片段。 这些片段进一步编译成任务的有向非循环图。 与常规调度不同,有向非循环图中的依赖弧基于页面资源。 任务可以包括可由硬件执行以执行查询的部分的机器代码。 这些任务也可以用软件执行或与I / O相关。