Performing database operations using a vectorized approach or a non-vectorized approach

    公开(公告)号:US10671583B2

    公开(公告)日:2020-06-02

    申请号:US15684943

    申请日:2017-08-24

    Abstract: Techniques for performing database operations using vectorized instructions are provided. In one technique, it is determined whether to perform a database operation using one or more vectorized instructions or without using any vectorized instructions. This determination may comprise estimating a first cost of performing the database operation using one or more vectorized instructions and estimating a second cost of performing the database operation without using any vectorized instructions. Multiple factors that may be used to determine which approach to follow, such as the number of data elements that may fit into a SIMD register, a number of vectorized instructions in the vectorized approach, a number of data movement instructions that involve moving data from a SIMD register to a non-SIMD register and/or vice versa, a size of a cache, and a projected size of a hash table.

    Workload aware data placement for join-based query processing in a cluster

    公开(公告)号:US09984081B2

    公开(公告)日:2018-05-29

    申请号:US14610892

    申请日:2015-01-30

    Abstract: A method for distributing tables to a cluster of nodes managed by database management system (DBMS), is disclosed. Multiple data placement schemes are evaluated based on a query workload set to select a data placement scheme for the cluster of nodes. Tables, used in join operations in the workload set, are selected for evaluation of data placement schemes. Query execution costs for the workload set are generated based on estimating a query execution cost for each data placement scheme for the tables. The data placement scheme that has least costly estimated execution cost for the workload set is selected as the data placement scheme for the cluster of nodes managed by DBMS.

    Method for failure-resilient data placement in a distributed query processing system

    公开(公告)号:US09842148B2

    公开(公告)日:2017-12-12

    申请号:US14704825

    申请日:2015-05-05

    CPC classification number: G06F17/30545

    Abstract: Herein is described a data placement scheme for a distributed query processing systems that achieves load balance amongst the nodes of the system. To identify a node on which to place particular data, a supervisor node performs a placement algorithm over the particular data's identifier, where the placement algorithm utilizes two or more hash functions. The supervisor node runs the placement algorithm until a destination node is identified that is available to store the data, or the supervisor node has run the placement algorithm an established number of times. If no available node is identified using the placement algorithm, then an available destination node is identified for the particular data and information identifying the data and the selected destination node is included in an exception map. Most data may be located by any node in the system based on the node performing the placement algorithm for the required data.

    PERFORMING DATABASE OPERATIONS USING A VECTORIZED APPROACH OR A NON-VECTORIZED APPROACH

    公开(公告)号:US20170351670A1

    公开(公告)日:2017-12-07

    申请号:US15684943

    申请日:2017-08-24

    Abstract: Techniques for performing database operations using vectorized instructions are provided. In one technique, it is determined whether to perform a database operation using one or more vectorized instructions or without using any vectorized instructions. This determination may comprise estimating a first cost of performing the database operation using one or more vectorized instructions and estimating a second cost of performing the database operation without using any vectorized instructions. Multiple factors that may be used to determine which approach to follow, such as the number of data elements that may fit into a SIMD register, a number of vectorized instructions in the vectorized approach, a number of data movement instructions that involve moving data from a SIMD register to a non-SIMD register and/or vice versa, a size of a cache, and a projected size of a hash table.

    Pushdown of sorting and set operations (union, intersection, minus) to a large number of low-power cores in a heterogeneous system
    80.
    发明授权
    Pushdown of sorting and set operations (union, intersection, minus) to a large number of low-power cores in a heterogeneous system 有权
    在异构系统中对大量低功耗核心进行排序和设置操作(union,intersection,minus)的下推

    公开(公告)号:US09135301B2

    公开(公告)日:2015-09-15

    申请号:US13831279

    申请日:2013-03-14

    Abstract: A system and method for performing a query operation on a pair of relations in a database system coupled to a heterogeneous system (HS) is disclosed. Assuming that that pair of relations is partitioned and already loaded into the HS, the database system receives a query on the pair of relations and based on the type of query operation computes the cost of performing the query operation on the database alone or the costs of performing the query operation with the assistance of the HS, each of the costs corresponding to a particular algorithm. If the costs indicate that the HS improves the performance of the query operation, then the HS computes portions of the operation, and returns the results back to the database system. If any parts of the relation are out of sync with the database system, the database system performs operations to maintain transactional consistency.

    Abstract translation: 公开了一种用于对耦合到异构系统(HS)的数据库系统中的一对关系执行查询操作的系统和方法。 假设该对关系被分割并已经加载到HS中,则数据库系统接收关于该对关系的查询,并且基于查询操作的类型来计算对数据库执行查询操作的成本或单独的成本 在HS的帮助下执行查询操作,每个成本对应于特定算法。 如果成本表明HS提高了查询操作的性能,则HS计算操作的一部分,并将结果返回到数据库系统。 如果关系的任何部分与数据库系统不同步,则数据库系统将执行操作以维护事务一致性。

Patent Agency Ranking