System, method, and computer program product for progressive query processing
    12.
    发明授权
    System, method, and computer program product for progressive query processing 有权
    系统,方法和计算机程序产品,用于逐行查询处理

    公开(公告)号:US07383246B2

    公开(公告)日:2008-06-03

    申请号:US10698828

    申请日:2003-10-31

    IPC分类号: G06F17/30

    摘要: A method, system, and computer program product to make query processing more robust in the face of optimization errors. The invention validates the statistics and assumptions used for compiling a query as the query is executed and, when necessary, progressively re-optimizes the query in mid-execution based on the knowledge learned during its partial execution. The invention selectively places a number of CHECK operators in a query execution plan to validate the optimizer's cardinality estimates against actual cardinalities. Errors beyond a threshold trigger re-optimization, and the optimizer decides whether the old plan is still optimal and whether to re-use previously computed results. The invention addresses arbitrary SQL queries whose plans can contain sub-queries, updates, trigger checking, and view maintenance operations. The invention can handle concurrent update transactions or updates of common sub-expressions in a query execution plan without compromising consistency and isolation as locking information is tied to the record ID.

    摘要翻译: 一种方法,系统和计算机程序产品,使查询处理在面对优化错误时更加鲁棒。 本发明验证了在执行查询时用于编译查询的统计和假设,并且在必要时,根据在其部分执行期间学到的知识,在中间执行中逐渐重新优化查询。 本发明选择性地将多个CHECK运算符放置在查询执行计划中以根据实际基数验证优化器的基数估计。 超出阈值的错误触发重新优化,优化器决定旧计划是否仍然最优,以及是否重新使用先前计算的结果。 本发明解决任意SQL查询,其计划可以包含子查询,更新,触发器检查和查看维护操作。 本发明可以处理查询执行计划中的并发更新事务或公用子表达式的更新,而不会在锁定信息与记录ID相关联的情况下不影响一致性和隔离。

    Information retrieval system and method using index ANDing for improving performance
    13.
    发明授权
    Information retrieval system and method using index ANDing for improving performance 失效
    信息检索系统和方法使用索引ANDing来提高性能

    公开(公告)号:US07111025B2

    公开(公告)日:2006-09-19

    申请号:US10427345

    申请日:2003-04-30

    IPC分类号: G06F17/30

    摘要: An information retrieval system is adapted to process a query having an associated plan that includes applying predicates comprising local predicates and a join predicate, to at least two tables. Index ANDing is used to construct a filter for filtering a first of the tables in accordance with the local predicates. The join predicate is applied to the tables for determining row identifiers of rows satisfying the join predicate. The row identifiers are used to probe the filter for determining whether the rows satisfying the join predicate also satisfy the local predicates. The rows that satisfy both the join and local predicates are fetched and the join predicate is completed. Results of the join predicate are then returned.

    摘要翻译: 信息检索系统适于处理具有相关联的计划的查询,其包括将包括本地谓词和连接谓词的谓词应用于至少两个表。 索引ANDing用于构造一个过滤器,用于根据本地谓词过滤第一个表。 连接谓词适用于确定满足连接谓词的行的行标识符的表。 行标识符用于探测过滤器,以确定满足连接谓词的行是否也满足本地谓词。 满足连接和本地谓词的行被取出,并且连接谓词已完成。 然后返回连接谓词的结果。

    Client-based index advisor
    14.
    发明授权
    Client-based index advisor 失效
    基于客户的指数顾问

    公开(公告)号:US08458167B2

    公开(公告)日:2013-06-04

    申请号:US12416208

    申请日:2009-04-01

    IPC分类号: G06F17/30

    摘要: A common interface to manage heterogeneous databases and develop enterprise class applications is provided. In particular, it is shown that a client-based system and method can: (a) provide a uniform interface for the DBA or the application developer to use across all the database deployments, (b) provide flexibility in the number and kinds of scenarios it can be used, and finally (c) reduce the total cost of ownership for the enterprise.

    摘要翻译: 提供了管理异构数据库和开发企业级应用程序的通用界面。 特别地,显示基于客户端的系统和方法可以:(a)为DBA或应用程序开发人员提供统一的接口,以便在所有数据库部署中使用,(b)在场景数量和种类方面提供灵活性 可以使用,最后(c)降低企业的总体拥有成本。

    System and method for automating data partitioning in a parallel database
    15.
    发明授权
    System and method for automating data partitioning in a parallel database 有权
    用于在并行数据库中自动化数据分区的系统和方法

    公开(公告)号:US07562090B2

    公开(公告)日:2009-07-14

    申请号:US10324362

    申请日:2002-12-19

    IPC分类号: G06F7/00 G06F12/00

    摘要: A system for automating data partitioning in a parallel database includes plural nodes connected in parallel. Each node includes a database server and two databases connected thereto. Each database server includes a query optimizer. Moreover, a partitioning advisor communicates with the database server and the query optimizer. The query optimizer and the partitioning advisor include a program for recommending and evaluating data table partitions that are useful for processing a workload of query statements. The data table partitions are recommended and evaluated without requiring the data tables to be physically repartitioned.

    摘要翻译: 用于在并行数据库中自动化数据划分的系统包括并行连接的多个节点。 每个节点包括一个数据库服务器和两个连接到其上的数据库。 每个数据库服务器都包含一个查询优化器。 此外,分区顾问与数据库服务器和查询优化器进行通信。 查询优化器和分区顾问程序包括一个用于推荐和评估对处理查询语句的工作负载有用的数据表分区的程序。 建议和评估数据表分区,而不需要物理重新分区数据表。

    AUTOMATICALLY AND ADAPTIVELY DETERMINING EXECUTION PLANS FOR QUERIES WITH PARAMETER MARKERS
    17.
    发明申请
    AUTOMATICALLY AND ADAPTIVELY DETERMINING EXECUTION PLANS FOR QUERIES WITH PARAMETER MARKERS 失效
    自动和自适应地确定具有参数标记的查询的执行计划

    公开(公告)号:US20080222093A1

    公开(公告)日:2008-09-11

    申请号:US12125221

    申请日:2008-05-22

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30469

    摘要: A method and system for automatically and adaptively determining query execution plans for parametric queries. A first classifier trained by an initial set of training points is generated. A query workload and/or database statistics are dynamically updated. A new set of training points is collected off-line. Using the new set of training points, the first classifier is modified into a second classifier. A database query is received at a runtime subsequent to the off-line phase. The query includes predicates having parameter markers bound to actual values. The predicates are associated with selectivities. A mapping of the selectivities into a plan determines the query execution plan. The determined query execution plan is included in an augmented set of training points, where the augmented set includes the initial set and the new set.

    摘要翻译: 一种用于自动和自适应地确定参数查询的查询执行计划的方法和系统。 产生由初始训练点训练的第一分类器。 动态更新查询工作负载和/或数据库统计信息。 离线收集了一套新的培训点。 使用新的一组训练点,第一个分类器被修改为第二个分类器。 在离线阶段之后的运行时间接收数据库查询。 该查询包括具有绑定到实际值的参数标记的谓词。 谓词与选择性相关联。 将选择性映射到计划中确定查询执行计划。 确定的查询执行计划被包括在增强的训练点集合中,其中增强集合包括初始集合和新集合。

    Fast and robust optimization of complex database queries
    18.
    发明授权
    Fast and robust optimization of complex database queries 有权
    复杂数据库查询的快速和可靠的优化

    公开(公告)号:US07076477B2

    公开(公告)日:2006-07-11

    申请号:US10248132

    申请日:2002-12-19

    IPC分类号: B06F17/30

    摘要: A robust way is described for optimizing complex data base queries while retaining the optimization speed of heuristic methods. The heuristic join-sequencing algorithm is modified to permit any of, or a combination of: (1) multiple passes of the heuristic algorithm, each with a different metric, producing multiple plans; (2) complex combinations of the criteria by which such heuristics make their choices; and/or (3) backtracking to consider alternatives to any particular decision in the sequence.

    摘要翻译: 描述了一种稳健的方法来优化复杂数据库查询,同时保留启发式方法的优化速度。 启发式连接排序算法被修改为允许以下任何或组合:(1)启发式算法的多遍,每个具有不同的度量,产生多个计划; (2)这种启发式做出选择的标准的复杂组合; 和/或(3)回溯以考虑序列中任何特定决定的替代方案。

    Efficient sampling of a relational database
    19.
    发明授权
    Efficient sampling of a relational database 有权
    关系数据库的有效抽样

    公开(公告)号:US06993516B2

    公开(公告)日:2006-01-31

    申请号:US10329892

    申请日:2002-12-26

    IPC分类号: G06F17/30

    摘要: A system, method and computer readable medium for sampling data from a relational database are disclosed, where an information processing system chooses rows from a table in a relational database for sampling, wherein data values are arranged into rows, rows are arranged into pages, and pages are arranged into tables. Pages are chosen for sampling according to a probability P and rows in a selected page are chosen for sampling according to a probability R, so that the overall probability of choosing a row for sampling is Q=PR. The probabilities P and R are based on the desired precision of estimates computed from a sample, as well as processing speed. The probabilities P and R are further based on either catalog statistics of the relational database or a pilot sample of rows from the relational database.

    摘要翻译: 公开了一种用于从关系数据库采样数据的系统,方法和计算机可读介质,其中信息处理系统从用于采样的关系数据库中的表中选择行,其中数据值被排列成行,行被排列成页,并且 页面排列成表。 根据概率P选择页面进行采样,并根据概率R选择所选页面中的行进行采样,以便选择采样行的总体概率为Q = PR。 概率P和R基于从样本计算的估计精度以及处理速度。 概率P和R进一步基于关系数据库的目录统计数据或来自关系数据库的行的导频样本。

    Method for determining optimal database materializations using a query optimizer
    20.
    发明授权
    Method for determining optimal database materializations using a query optimizer 有权
    使用查询优化器确定最佳数据库实现的方法

    公开(公告)号:US06356889B1

    公开(公告)日:2002-03-12

    申请号:US09409323

    申请日:1999-09-30

    IPC分类号: G06F1730

    摘要: A method for determining optimal database materializations utilizing a query optimizer in a database management system. The method takes one or more queries as inputs and using the query optimizer in the database management system generates a series of virtual materializations by materializing some subsets of the database. The virtual materializations are used to consider the relative performance benefits, i.e. cost-benefits, for the queries based on the various virtual materializations. If the query optimizer decides to use any of the materializations in its plan, then those materializations are recommended to the user, or created automatically for the user.

    摘要翻译: 一种利用数据库管理系统中的查询优化器确定最佳数据库实现的方法。 该方法将一个或多个查询作为输入,并且使用数据库管理系统中的查询优化器通过实现数据库的某些子集来生成一系列虚拟实现。 虚拟实体被用于考虑基于各种虚拟实现的查询的相对性能优点,即成本效益。 如果查询优化程序决定在其计划中使用任何实现,那么这些实现被推荐给用户,或者为用户自动创建。