Constructing database object workload summaries
    11.
    发明授权
    Constructing database object workload summaries 有权
    构建数据库对象工作量摘要

    公开(公告)号:US07299220B2

    公开(公告)日:2007-11-20

    申请号:US10815061

    申请日:2004-03-31

    IPC分类号: G06F17/30

    摘要: A database object summarization tool is provided that selects a subset of database objects subject to filtering constraints such as a partial order or optimization of some attribute. A dominance primitive filters out tuples that are dominated according to a partial order constraint by another tuple. A representation primitive selects a representative subset of tuples such than an optimization criteria is met.

    摘要翻译: 提供了一种数据库对象摘要工具,该工具选择受过滤约束(如某些属性的部分顺序或优化)的数据库对象的子集。 优势原语过滤掉由另一个元组根据部分顺序约束所主导的元组。 表示基元选择满足优化标准的元组的代表性子集。

    Sampling for queries
    12.
    发明申请
    Sampling for queries 有权
    查询抽样

    公开(公告)号:US20060085410A1

    公开(公告)日:2006-04-20

    申请号:US11296036

    申请日:2005-12-07

    IPC分类号: G06F17/30

    摘要: A method of estimating the Results of a database query are estimated by performing a sampling of weighted tuples in a database based on a probability of usage of tuples required in executing a workload. A probability is associated with each tuple sampled. And, can aggregate is computed over values in each sampled tuple while multiplying by the inverses of the probabilities associated with each tuple sampled.

    摘要翻译: 通过基于执行工作负载所需的元组的使用概率,对数据库中的加权元组进行抽样来估计估计数据库查询结果的方法。 每个元组采样的概率相关。 并且,可以在每个采样的元组中的值上计算可以聚合,同时乘以与每个元组采样相关联的概率的逆。

    Detecting estimation errors in dictinct page counts
    15.
    发明授权
    Detecting estimation errors in dictinct page counts 有权
    检测特定页数中的估计误差

    公开(公告)号:US07958114B2

    公开(公告)日:2011-06-07

    申请号:US12098178

    申请日:2008-04-04

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30306 G06Q30/0202

    摘要: A database server may be configured to compute distinct page counts of pages accessed to execute operands of respective queries. The queries may be executed against a table comprised of the pages and having an index managed by the database server. The distinct page counts may be obtained by counting, as a part of the executing of the queries, distinct pages accessed during the execution of the queries.

    摘要翻译: 数据库服务器可以被配置为计算被访问的页面的不同页面计数以执行各个查询的操作数。 可以针对由该页组成的表并且具有由数据库服务器管理的索引来执行查询。 独立页面计数可以通过在执行查询期间计数访问的不同页面作为执行查询的一部分来获得。

    Sampling for database systems
    16.
    发明授权
    Sampling for database systems 失效
    数据库系统的抽样

    公开(公告)号:US07567949B2

    公开(公告)日:2009-07-28

    申请号:US10238175

    申请日:2002-09-10

    IPC分类号: G06F17/30 G06F7/00

    摘要: A database server supports weighted and unweighted sampling of records or tuples in accordance with desired sampling semantics such as with replacement (WR), without replacement (WoR), or independent coin flips (CF) semantics, for example. The database server may perform such sampling sequentially not only to sample non-materialized records, such as those produced as a stream by a pipeline in a query tree for example, but also to sample records, whether materialized or not, in a single pass. The database server also supports sampling over a join of two relations of records or tuples without requiring the computation of the full join and without requiring the materialization of both relations and/or indexes on the join attribute values of both relations.

    摘要翻译: 数据库服务器根据期望的抽样语义(例如替换(WR),无替换(WoR)或独立硬币翻转(CF))语义支持对记录或元组进行加权和未加权采样。 数据库服务器可以顺序地执行这样的采样,以便例如非查询记录例如在查询树中由流水线生成的非实体记录,但是也可以在一次通过中对采样记录(无论是否实现)进行采样。 数据库服务器还支持对两个记录或元组关系的连接进行抽样,而不需要计算完整连接,而不需要在关系的连接属性值上实现关系和/或索引。

    Query progress estimation
    19.
    发明申请
    Query progress estimation 有权
    查询进度估计

    公开(公告)号:US20050222965A1

    公开(公告)日:2005-10-06

    申请号:US10813963

    申请日:2004-03-31

    IPC分类号: G06F7/00 G06F17/30

    摘要: A query progress indicator that provides an indication to a user of the progress of a query being executed on a database. The indication of the progress of the query allows the user to decide whether the query should be allowed to complete or should be aborted. One method that may be used to estimate the progress of a query that is being executed on a database defines a model of work performed during execution of a query. The total amount of work that will be performed during execution of the query is estimated according to the model. The amount of work performed at a given point during execution of the query is estimated according to the model. The progress of the query is estimated using the estimated amount of work at the given point in time and the estimated total amount of work. This estimated progress of query execution may be provided to the user.

    摘要翻译: 查询进度指示符,向用户提供在数据库上执行的查询的进度的指示。 查询进度的指示允许用户决定是否允许查询完成或应该被中止。 可用于估计在数据库上执行的查询的进度的一种方法定义了在查询执行过程中执行的工作模型。 根据模型估计执行查询期间执行的总工作量。 在执行查询期间在给定点执行的工作量根据模型进行估计。 查询的进度使用在给定时间点的估计工作量和估计的总工作量来估计。 可以向用户提供该估计的查询执行进度。

    Optimization based method for estimating the results of aggregate queries
    20.
    发明申请
    Optimization based method for estimating the results of aggregate queries 失效
    用于估计聚合查询结果的基于优化的方法

    公开(公告)号:US20050033759A1

    公开(公告)日:2005-02-10

    申请号:US10935803

    申请日:2004-09-08

    IPC分类号: G06F17/30 G06F17/00

    摘要: A method for estimating the result of a query on a database having data records arranged in tables. The database has an expected workload that includes a set of queries that can be executed on the database. An expected workload is derived comprising a set of queries that can be executed on the database. A sample is constructed by selecting data records for inclusion in the sample in a manner that minimizes an estimation error when the data records are acted upon by a query in the expected workload to provide an expected workload to provide an expected result. The query accesses the sample and is executed on the sample, returning an estimated query result. The expected workload can be constructed by specifying a degree of overlap between records selected by queries in the given workload and records selected by queries in the expected workload.

    摘要翻译: 一种用于估计具有以表格排列的数据记录的数据库的查询结果的方法。 数据库具有预期的工作负载,其中包括可在数据库上执行的一组查询。 导出预期的工作负载,包括可在数据库上执行的一组查询。 通过在以下方式选择数据记录来构建样本,以便在预期工作负载中通过查询对数据记录进行操作以最小化估计误差的方式,以提供预期的工作负载以提供预期结果的方式来选择包含在样本中的数据记录。 查询访问样本并在样本上执行,返回估计的查询结果。 可以通过指定给定工作负载中的查询选择的记录与预期工作负载中的查询所选择的记录之间的重叠程度来构建预期的工作负载。