Method and apparatus to support bitmap filtering in a parallel system
    1.
    发明申请
    Method and apparatus to support bitmap filtering in a parallel system 有权
    支持并行系统中位图过滤的方法和装置

    公开(公告)号:US20060117036A1

    公开(公告)日:2006-06-01

    申请号:US11001363

    申请日:2004-11-30

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30498

    摘要: Techniques are described for combining pieces of information from two sources. The techniques may be used to improve the performance, for example, of hash join operations that are parallelized using slaves distributed across multiple nodes. According to one technique, bitmap filtering operations are performed by the probe-phase producer slaves, rather than the probe-phase consumer slaves. To avoid having to merge separately built bitmap filter chunks, the left-hand rows may be sent to every probe-phase consumer slave. Alternatively, the merge operation may be avoided by distributing the rows of one source based on how the other source has been statically partitioned.

    摘要翻译: 描述了用于组合来自两个来源的信息的技术。 这些技术可以用于改进例如使用分布在多个节点上的从属并行化的散列连接操作的性能。 根据一种技术,位图过滤操作由探针相位生成器从站而不是探针相消费者从站执行。 为了避免必须合并单独构建的位图过滤器块,可以将左侧行发送到每个探针阶段的客户从站。 或者,可以基于如何静态地分割其他源来分发一个源的行来避免合并操作。

    Dynamic performance views with a parallel single cursor model
    2.
    发明申请
    Dynamic performance views with a parallel single cursor model 有权
    具有并行单光标模型的动态性能视图

    公开(公告)号:US20050131878A1

    公开(公告)日:2005-06-16

    申请号:US10898300

    申请日:2004-07-23

    IPC分类号: G06F7/00 G06F17/30

    摘要: A query coordinator handles a multiple-server dynamic performance query by sending remote query slaves (1) first information for generating a complete plan for the query, and (2) second information for participating in the dynamic performance view portion of the query. If the slaves on the remote servers are unable to use the first information to generate an equivalent query (for example, if they reside in a database server that has closed the database), then the slaves on the remote servers use the second information to participate in the dynamic performance view portion of the query.

    摘要翻译: 查询协调器通过发送远程查询从站来处理多服务器动态性能查询(1)用于生成查询的完整计划的第一信息,以及(2)用于参与查询的动态性能视图部分的第二信息。 如果远程服务器上的从站无法使用第一个信息来生成等效查询(例如,如果它们位于已关闭数据库的数据库服务器中),则远程服务器上的从站将使用第二个信息来参与 在查询的动态性能视图部分。

    Techniques for recommending parallel execution of SQL statements
    3.
    发明授权
    Techniques for recommending parallel execution of SQL statements 有权
    推荐并行执行SQL语句的技术

    公开(公告)号:US09256642B2

    公开(公告)日:2016-02-09

    申请号:US12827926

    申请日:2010-06-30

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30445

    摘要: Techniques for automatically recommending parallel execution of a SQL statement. In one set of embodiments, a first determination can be made regarding whether a SQL statement can be executed in parallel. Further, a second determination can be made regarding whether executing the SQL statement in parallel is faster than executing the statement in serial by a predetermined factor. If the first determination and second determination are positive (i.e., the statement can be executed in parallel and parallel execution is faster by the predetermined factor), a recommendation can be provided indicating that the SQL statement should be executed in parallel. In some embodiments, the recommendation can include a report specifying the degree of performance improvement gained from parallel execution, additional system resources consumed by parallel execution, and other statistics pertaining to the recommended parallel execution plan.

    摘要翻译: 自动推荐并行执行SQL语句的技术。 在一组实施例中,可以首先确定是否可以并行执行SQL语句。 此外,可以进行关于并行执行SQL语句是否比以预定因子串行执行语句更快的第二确定。 如果第一确定和第二确定是正的(即,可以并行执行语句并且并行执行以预定因子更快),则可以提供指示应该并行执行SQL语句的建议。 在一些实施例中,该建议可以包括指定从并行执行获得的性能改进程度的报告,并行执行消耗的附加系统资源以及与推荐的并行执行计划有关的其他统计信息。

    TECHNIQUES FOR RECOMMENDING PARALLEL EXECUTION OF SQL STATEMENTS
    4.
    发明申请
    TECHNIQUES FOR RECOMMENDING PARALLEL EXECUTION OF SQL STATEMENTS 有权
    推荐使用SQL语句的并行执行技术

    公开(公告)号:US20120005188A1

    公开(公告)日:2012-01-05

    申请号:US12827926

    申请日:2010-06-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30445

    摘要: Techniques for automatically recommending parallel execution of a SQL statement. In one set of embodiments, a first determination can be made regarding whether a SQL statement can be executed in parallel. Further, a second determination can be made regarding whether executing the SQL statement in parallel is faster than executing the statement in serial by a predetermined factor. If the first determination and second determination are positive (i.e., the statement can be executed in parallel and parallel execution is faster by the predetermined factor), a recommendation can be provided indicating that the SQL statement should be executed in parallel. In some embodiments, the recommendation can include a report specifying the degree of performance improvement gained from parallel execution, additional system resources consumed by parallel execution, and other statistics pertaining to the recommended parallel execution plan.

    摘要翻译: 自动推荐并行执行SQL语句的技术。 在一组实施例中,可以首先确定是否可以并行执行SQL语句。 此外,可以进行关于并行执行SQL语句是否比以预定因子串行执行语句更快的第二确定。 如果第一确定和第二确定是正的(即,可以并行执行语句并且并行执行以预定因子更快),则可以提供指示应该并行执行SQL语句的建议。 在一些实施例中,该建议可以包括指定从并行执行获得的性能改进程度的报告,并行执行消耗的附加系统资源以及与推荐的并行执行计划有关的其他统计信息。

    Executing a parallel single cursor model
    5.
    发明授权
    Executing a parallel single cursor model 有权
    执行并行单光标模型

    公开(公告)号:US07685095B2

    公开(公告)日:2010-03-23

    申请号:US10944175

    申请日:2004-09-16

    IPC分类号: G06F17/30

    摘要: Techniques are provided for executing database statements, or portions thereof, in parallel without using slave SQL to communicate to each slave the operations to be performed by the slave. Techniques are provided for incorporating within a shared cursor the code fragments that govern both sides of the interaction between a query coordinator (QC) and remotely-located slaves. Further, techniques are provided for the QC to communicate with each slave on how and which portions of the execution plan to execute and when. A state-transition engine for slave execution under the control of the query-coordinator is also provided.

    摘要翻译: 提供了用于并行地执行数据库语句或其部分的技术,而不使用从属SQL向从属单元通信由从属单元执行的操作。 提供了在共享游标中并入管理查询协调器(QC)和远程定位的从站之间的交互的两侧的代码片段的技术。 此外,提供了用于QC的技术以与每个从属单元通信以执行执行计划以及何时执行计划的哪些部分。 还提供了在查询协调器的控制下从属执行的状态转换引擎。

    Hybrid optimization strategies in automatic SQL tuning
    6.
    发明申请
    Hybrid optimization strategies in automatic SQL tuning 有权
    自动SQL调优中的混合优化策略

    公开(公告)号:US20100030758A1

    公开(公告)日:2010-02-04

    申请号:US12221131

    申请日:2008-07-30

    IPC分类号: G06F17/30

    摘要: Systems, methods, and other embodiments associated with hybrid optimization strategies in automatic SQL tuning are described. One example method includes receiving a first (e.g., cost-based) execution plan for a user structured query language statement (User SQL) from a first (e.g., cost-based) optimizer. The example method may also include receiving a second (e.g., rules-based) execution plan for the User SQL from a second, different (e.g., rules-based) query optimizer. The method may include identifying a preferred execution plan based on data produced by test executing the execution plans in a reproduced execution environment that reproduces at least a portion of an execution environment in which the user SQL runs. The method may also include controlling a database to execute the User SQL using the preferred execution plan.

    摘要翻译: 描述了与自动SQL调优中的混合优化策略相关联的系统,方法和其他实施例。 一个示例性方法包括从第一(例如基于成本的)优化器接收用户结构化查询语言语句(User SQL)的第一(例如基于成本的)执行计划。 示例性方法还可以包括从第二不同(例如,基于规则的)查询优化器接收针对用户SQL的第二(例如基于规则的)执行计划。 该方法可以包括基于在再现执行用户SQL运行的执行环境的至少一部分的再现执行环境中执行执行计划的测试产生的数据来识别优选执行计划。 该方法还可以包括控制数据库以使用优选的执行计划来执行User SQL。

    Approximating a database statistic
    7.
    发明申请
    Approximating a database statistic 有权
    近似数据库统计

    公开(公告)号:US20080120274A1

    公开(公告)日:2008-05-22

    申请号:US11796102

    申请日:2007-04-25

    IPC分类号: G06F7/00

    摘要: A method and apparatus for approximating a database statistic, such as the number of distinct values (NDV) is provided. To approximate the NDV for a portion of a table, a synopsis of distinct values is constructed. Each value in the portion is mapped to a domain of values. The mapping function is implemented with a uniform hash function, in one embodiment. If the resultant domain value does not exist in the synopsis, the domain value is added to the synopsis. If the synopsis reaches its capacity, a portion of the domain values are discarded from the synopsis. The statistic is approximated based on the number (N) of domain values in the synopsis and the portion of the domain that is represented in the synopsis relative to the size of the domain.

    摘要翻译: 提供了用于近似数据库统计量的方法和装置,例如不同值(NDV)的数量。 为了近似表的一部分的NDV,构建了不同值的概要。 该部分中的每个值都映射到值的域。 在一个实施例中,映射功能是用均匀散列函数实现的。 如果在概要中不存在结果域值,则将域值添加到概要中。 如果概要达到其容量,则域值的一部分将从摘要中被丢弃。 统计量基于概要中的域值的数量(N)和在概要中相对于域的大小表示的域的部分近似。

    Performing recursive database operations
    8.
    发明授权
    Performing recursive database operations 有权
    执行递归数据库操作

    公开(公告)号:US07155446B2

    公开(公告)日:2006-12-26

    申请号:US10867923

    申请日:2004-06-14

    IPC分类号: G06F17/00

    摘要: A method and apparatus for performing recursive database operations is provided. According to one aspect, a plurality of first-stage slaves and a plurality of second-stage slaves are established in a database server. During one or more iterations of a recursive database operation, the first-stage slaves concurrently process data items stored in a data repository and send results to the second-stage slaves. The second-stage slaves receive the results and concurrently process those results. The second-stage slaves store the results of the second-stage slaves' processing in the data repository. Subsequent iterations of the recursive database operation proceed in this manner until the recursive database operation has been completed. In each iteration, the first-stage slaves consume the product of the second-stage slaves' previous iteration's processing, and the second-stage slaves consume the product of the first-stage slaves' current iteration's processing.

    摘要翻译: 提供了一种用于执行递归数据库操作的方法和装置。 根据一个方面,在数据库服务器中建立多个第一级从站和多个第二级从站。 在递归数据库操作的一个或多个迭代期间,第一级从站同时处理存储在数据存储库中的数据项,并将结果发送到第二级从站。 第二阶段的奴隶收到结果并同时处理这些结果。 第二级从站将数据存储库中的第二级从站处理的结果存储起来。 递归数据库操作的后续迭代以这种方式进行,直到递归数据库操作完成。 在每次迭代中,第一级从站消耗第二级从站先前迭代处理的乘积,第二级从站消耗第一级从站当前迭代处理的乘积。

    Parallel partition-wise aggregation
    9.
    发明申请
    Parallel partition-wise aggregation 有权
    并行分区聚合

    公开(公告)号:US20060182046A1

    公开(公告)日:2006-08-17

    申请号:US11060260

    申请日:2005-02-16

    IPC分类号: H04L12/16 H04Q11/00

    CPC分类号: G06F9/4494

    摘要: Techniques are provided for performing a parallel aggregation operation on data that resides in a container, such as a relational table. During generation of the execution plan for the operation, it is determined whether partition-wise aggregation should be performed, based on the grouping keys involved in the aggregation and the partition keys used to partition the container. If partition-wise aggregation is to be performed, then the assignments given to the slave processes that are assigned to scan a container are made on a partition-wise basis. The scan slaves themselves may perform full or partial aggregation (depending on whether they are the only scan slaves assigned to the partition). If the scan slaves perform no aggregation, or only partial aggregation, then the scan slaves redistribute the data items to aggregation slaves that are local to the scan slaves.

    摘要翻译: 提供了用于对驻留在诸如关系表的容器中的数据执行并行聚合操作的技术。 在生成操作的执行计划时,基于聚合中涉及的分组密钥和用于分区容器的分区密钥,确定是否应该执行分区聚合。 如果要执行分区聚合,则分配给分配给扫描容器的从属进程的分配是以分区方式进行的。 扫描从站本身可以执行完全或部分聚合(取决于它们是否是分配给分区的唯一扫描从站)。 如果扫描从站不执行聚合或仅执行部分聚合,则扫描从站将数据项重新分配到扫描从站本地的聚合从站。

    Dynamic optimization for processing a restartable sub-tree of a query execution plan
    10.
    发明授权
    Dynamic optimization for processing a restartable sub-tree of a query execution plan 有权
    用于处理查询执行计划的可重新启动的子树的动态优化

    公开(公告)号:US07051034B1

    公开(公告)日:2006-05-23

    申请号:US10325782

    申请日:2002-12-18

    IPC分类号: G06F17/30

    摘要: Execution of a restartable sub-tree of a query execution plan comprises determining whether use of parallel processes is a preferred or optimal mode of executing the sub-tree. The determination is based, at least in part, on how long it takes to restart the sub-tree using two or more parallel processes and/or how long it takes to probe the sub-tree, i.e., to fetch a row that meets one or more conditions or correlations associated with the sub-query, using the two or more parallel processes. Thus, a dynamic computational cost-based operation is described, which determines at query runtime whether to execute the restartable sub-tree using a single server process or multiple parallel server processes.

    摘要翻译: 执行查询执行计划的可重新开始的子树包括确定并行进程的使用是否是执行子树的优选或最佳模式。 该决定至少部分地基于使用两个或更多个并行进程重新启动子树所需的时间和/或探测子树需要多长时间,即,获取满足一个 或更多与子查询相关联的条件或相关性,使用两个或多个并行进程。 因此,描述了基于动态计算成本的操作,其在查询运行时确定是否使用单个服务器进程或多个并行服务器进程来执行可重新启动的子树。