Cube indices for relational database management systems
    3.
    发明授权
    Cube indices for relational database management systems 有权
    关系数据库管理系统的多维数据集索引

    公开(公告)号:US06560594B2

    公开(公告)日:2003-05-06

    申请号:US09401474

    申请日:1999-09-22

    IPC分类号: G06F700

    摘要: A method, apparatus, and article of manufacture for optimizing database queries using subsumption tests between the query and at least one summary table that comprises a cube index to determine whether an expression in the query can be subsumed in the summary table. The summary table stores at least one materialized view involving at least one GROUP BY operation that computes at least one of the following: (1) a cube, (2) a rollup, (3) a grouping set, and (4) a concatenation of cubes, rollups, grouping sets, and one or more grouping items. When the expression in the query can be subsumed in the summary table, the query is rewritten to use the summary table.

    摘要翻译: 一种使用查询和包括多维数据集索引的至少一个汇总表之间的包含测试优化数据库查询的方法,装置和制品,用于确定查询中的表达式是否可以包含在汇总表中。 汇总表存储至少一个涉及至少一个GROUP BY操作的物化视图,该至少一个GROUP BY操作计算以下中的至少一个:(1)立方体,(2)汇总,(3)分组集合,以及(4)连接 多维数据集,汇总,分组集和一个或多个分组项目。 当查询中的表达式可以包含在汇总表中时,将重写查询以使用汇总表。

    Efficient sampling of a relational database
    5.
    发明授权
    Efficient sampling of a relational database 有权
    关系数据库的有效抽样

    公开(公告)号:US06993516B2

    公开(公告)日:2006-01-31

    申请号:US10329892

    申请日:2002-12-26

    IPC分类号: G06F17/30

    摘要: A system, method and computer readable medium for sampling data from a relational database are disclosed, where an information processing system chooses rows from a table in a relational database for sampling, wherein data values are arranged into rows, rows are arranged into pages, and pages are arranged into tables. Pages are chosen for sampling according to a probability P and rows in a selected page are chosen for sampling according to a probability R, so that the overall probability of choosing a row for sampling is Q=PR. The probabilities P and R are based on the desired precision of estimates computed from a sample, as well as processing speed. The probabilities P and R are further based on either catalog statistics of the relational database or a pilot sample of rows from the relational database.

    摘要翻译: 公开了一种用于从关系数据库采样数据的系统,方法和计算机可读介质,其中信息处理系统从用于采样的关系数据库中的表中选择行,其中数据值被排列成行,行被排列成页,并且 页面排列成表。 根据概率P选择页面进行采样,并根据概率R选择所选页面中的行进行采样,以便选择采样行的总体概率为Q = PR。 概率P和R基于从样本计算的估计精度以及处理速度。 概率P和R进一步基于关系数据库的目录统计数据或来自关系数据库的行的导频样本。