Dynamic physical database design
    11.
    发明授权
    Dynamic physical database design 有权
    动态物理数据库设计

    公开(公告)号:US07483918B2

    公开(公告)日:2009-01-27

    申请号:US10914901

    申请日:2004-08-10

    IPC分类号: G06F7/00

    摘要: A monitoring component of a database server collects a subset of a query workload along with related statistics. A remote index tuning component uses the workload subset and related statistics to determine a physical design that minimizes the cost of executing queries in the workload subset while ensuring that queries omitted from the subset do not degrade in performance.

    摘要翻译: 数据库服务器的监视组件收集查询工作负载的一部分以及相关统计信息。 远程索引调整组件使用工作负载子集和相关统计信息来确定最小化在工作负载子集中执行查询的成本的物理设计,同时确保从子集中省略的查询不会降低性能。

    Identifying indexes on materialized views for database workload
    12.
    发明授权
    Identifying indexes on materialized views for database workload 有权
    识别数据库工作负载的物化视图的索引

    公开(公告)号:US06356891B1

    公开(公告)日:2002-03-12

    申请号:US09629412

    申请日:2000-08-01

    IPC分类号: G06F1730

    摘要: An index and materialized view selection wizard produces a fast and reasonable recommendation for a configuration of indexes, materialized views, and indexes on materialized views which are beneficial given a specified workload for a given database and database server. Candidate materialized views and indexes are obtained, and a joint enumeration of the combined materialized views and indexes is performed to obtain a recommended configuration. The configuration includes indexes, materialized views and indexes on materialized views. Candidate materialized views are obtained by first determining subsets of tables are referenced in queries in the workload and then finding interesting table subsets. Next, interesting subsets are considered on a per query basis to determine which are syntactically relevant for a query. Materialized views which are likely to be used for the workload are then generated along with a set of merged materialized views. Clustered indexes and non-clustered indexes on materialized views are then generated. The indexes, materialized views and indexes on materialized views are then enumerated together to form the recommended configuration.

    摘要翻译: 索引和物化视图选择向导可以为物理视图的索引,物化视图和索引配置提供快速合理的建议,这对给定数据库和数据库服务器的指定工作负载是有益的。 获取候选物化视图和索引,并执行组合实例化视图和索引的联合枚举,以获得推荐的配置。 配置包括物化视图的索引,物化视图和索引。 通过首先确定表中的子集在工作负载中的查询中引用并且然后找到有趣的表子集来获得候选物化视图。 接下来,在每个查询的基础上考虑有趣的子集,以确定哪个在查询语法上相关。 可能用于工作负载的物化视图随同一组合并物化视图一起生成。 然后生成物化视图上的聚簇索引和非聚集索引。 然后将物化视图的索引,物化视图和索引列在一起以形成推荐的配置。

    Index tuner for given workload
    13.
    发明授权
    Index tuner for given workload 有权
    索引调谐器用于给定的工作负载

    公开(公告)号:US06266658B1

    公开(公告)日:2001-07-24

    申请号:US09553070

    申请日:2000-04-20

    IPC分类号: G06F1730

    摘要: An index tuning wizard produces a fast and reasonable recommendation identifying database indexes to use given a specified workload. A query optimizer is used to determine the expected usefulness of potential indexes for the specified workload by taking cost of queries in the workload into account. A cost based pruning of indexes is then performed to provide an intermediate set of proposed indexes. Indexes having most benefit based on storage constraints are then selected. The optimizer is then used again, and further pruning is done on a benefits basis. An index is not recommended unless it has a significant impact on the workload.

    摘要翻译: 索引调整向导会产生一个快速合理的建议,用于标识在指定工作负载下使用的数据库索引。 查询优化器用于通过考虑工作负载中的查询成本来确定指定工作负载的潜在索引的预期有用性。 然后执行索引的基于成本的修剪以提供提出的索引的中间集合。 然后选择基于存储约束最有利的索引。 然后,优化器再次被使用,并且进一步修剪是在有益的基础上进行的。 不建议使用索引,除非它对工作负载有重大影响。

    APPLICATION AND DATABASE CONTEXT FOR DATABASE APPLICATION DEVELOPERS
    14.
    发明申请
    APPLICATION AND DATABASE CONTEXT FOR DATABASE APPLICATION DEVELOPERS 有权
    数据库应用开发者的应用和数据库背景

    公开(公告)号:US20090106746A1

    公开(公告)日:2009-04-23

    申请号:US11875616

    申请日:2007-10-19

    IPC分类号: G06F9/45

    摘要: Infrastructure for capturing and correlating application context and database context for tuning, profiling and debugging tasks. The infrastructure extends the DBMS and application profiling infrastructure making it easy for a developer to invoke and interact with a tool from inside the development environment. Three sources of information are employed when an application is executed: server tracing, data access layer tracing, and application tracing. The events obtained from each of these sources are written into a log file. An event log is generated on each machine that involves either an application process or the DBMS server process and the log file receives log traces from different processes on a machine to the same trace session. A post-processing step over the event log(s) correlates the application and database contexts. The output is a single view where both the application and database profile of each statement issued by the application are exposed.

    摘要翻译: 用于捕获和关联应用程序上下文和数据库上下文以进行调整,分析和调试任务的基础结构。 该基础架构扩展了DBMS和应用程序分析基础架构,使开发人员可以轻松地从开发环境内调用和交互工具。 执行应用程序时,将采用三种信息来源:服务器跟踪,数据访问层跟踪和应用程序跟踪。 从这些源中获取的事件将被写入日志文件。 在涉及应用程序进程或DBMS服务器进程的每台计算机上生成事件日志,并且日志文件从机器上的不同进程接收到相同跟踪会话的日志跟踪。 在事件日志之后的后处理步骤将应用程序和数据库上下文相关联。 输出是单个视图,其中应用程序和应用程序发出的每个语句的数据库配置文件都将公开。

    EXPLOITING EXECUTION FEEDBACK FOR OPTIMIZING CHOICE OF ACCESS METHODS
    15.
    发明申请
    EXPLOITING EXECUTION FEEDBACK FOR OPTIMIZING CHOICE OF ACCESS METHODS 审中-公开
    用于优化访问方法选择的执行反馈

    公开(公告)号:US20090094191A1

    公开(公告)日:2009-04-09

    申请号:US11868957

    申请日:2007-10-08

    IPC分类号: G06F17/30

    CPC分类号: G06F16/24542

    摘要: A proactive monitoring mechanism for correcting the choice of access methods (available query plans) for a given query, based on execution feedback from the same query. The mechanism exploits bypassing predicate short-circuiting inside the database server's predicate evaluation module to obtain expression cardinalities. The mechanism can also modify a plan to obtain expression cardinalities. These techniques are used judiciously by the query optimizer and/or a database administrator (DBA) so that the execution overheads are within acceptable limits.

    摘要翻译: 一种主动监视机制,用于根据同一查询的执行反馈来校正给定查询的访问方法(可用查询计划)的选择。 该机制利用绕过数据库服务器谓词评估模块内的谓词短路来获得表达式的基数。 该机制还可以修改计划以获得表达式的基数。 这些技术由查询优化器和/或数据库管理员(DBA)明智地使用,以便执行开销在可接受的限度内。

    Index and materialized view selection for a given workload
    16.
    发明授权
    Index and materialized view selection for a given workload 有权
    指定工作量的索引和物化视图选择

    公开(公告)号:US06366903B1

    公开(公告)日:2002-04-02

    申请号:US09553033

    申请日:2000-04-20

    IPC分类号: G06F1730

    摘要: An index and materialized view selection wizard produces a fast and reasonable recommendation of indexes and materialized views which are beneficial given a specified workload for a given database and database server. Candidate materialized views and indexes are obtained, and a joint enumeration of the combined materialized views and indexes is performed to obtain a recommended configuration. The configuration includes both indexes and materialized views. Candidate materialized views are obtained by first determining subsets of tables are referenced in queries in the workload and then finding interesting table subsets. Next, interesting subsets are considered on a per query basis to determine which are syntactically relevant for a query. Materialized views which are likely to be used for the workload are then generated along with a set of merged materialized views.

    摘要翻译: 索引和物化视图选择向导可以快速合理地推荐索引和物化视图,这对给定数据库和数据库服务器的指定工作负载是有益的。 获取候选物化视图和索引,并执行组合实例化视图和索引的联合枚举,以获得推荐的配置。 配置包括索引和物化视图。 通过首先确定表中的子集在工作负载中的查询中引用并且然后找到有趣的表子集来获得候选物化视图。 接下来,在每个查询的基础上考虑有趣的子集,以确定哪个在查询语法上相关。 可能用于工作负载的物化视图随同一组合并物化视图一起生成。

    Database aggregation query result estimator
    18.
    发明授权
    Database aggregation query result estimator 有权
    数据库聚合查询结果估计器

    公开(公告)号:US07293037B2

    公开(公告)日:2007-11-06

    申请号:US11246354

    申请日:2005-10-07

    IPC分类号: G06F17/30

    摘要: Aggregation queries are performed by first identifying outlier values, aggregating the outlier values, and sampling the remaining data after pruning the outlier values. The sampled data is extrapolated and added to the aggregated outlier values to provide an estimate for each aggregation query. Outlier values are identified by selecting values outside of a selected sliding window of data having the lowest variance. An index is created for the outlier values. The outlier data is removed from the window of data, and separately aggregated. The remaining data without the outliers is then sampled to provide a statistically relevant sample that is then aggregated and extrapolated to provide an estimate for the remaining data. This sampled estimate is combined with the outlier aggregate to form an estimate for the entire set of data.

    摘要翻译: 通过首先识别异常值,聚合异常值和在修剪异常值之后对剩余数据进行采样来执行聚合查询。 采样数据被外推并加到聚合异常值中,以提供每个聚合查询的估计。 异常值通过选择具有最小方差的数据的所选滑动窗口之外的值来识别。 为异常值创建索引。 离群数据从数据窗口中移除,并单独汇总。 然后对没有异常值的剩余数据进行采样,以提供统计学上相关的样本,然后对其进行聚合和外插,以提供剩余数据的估计。 该采样估计与异常值聚合组合以形成整套数据的估计。

    Sampling for queries
    19.
    发明授权
    Sampling for queries 有权
    查询抽样

    公开(公告)号:US07287020B2

    公开(公告)日:2007-10-23

    申请号:US09759804

    申请日:2001-01-12

    IPC分类号: G06F17/30

    摘要: This disclosure describes leveraging workload information associated with executed database queries for estimating the result of a current database query. The workload information is analyzed to determine the usage of tuples in a database during query execution, such as how often a tuple is accessed and the number of different queries that accessed the tuple. A tuple is assigned a weight value that is based on the analyzed workload information. The particular tuples sampled for estimating a result for the current query is based on each tuple's weight value. The workload information may also be leveraged to generate an outlier index that identifies outlier tuples associated with the executed queries or that identifies outlier tuples associated with particular queries that are executed more frequently than other queries. The result for the current query can also be estimated using the sampled values along with the outlier tuples from the outlier index.

    摘要翻译: 本公开描述了利用与执行的数据库查询相关联的工作负载信息来估计当前数据库查询的结果。 分析工作负载信息以确定查询执行期间数据库中元组的使用情况,例如访问元组的频率以及访问元组的不同查询的数量。 一个元组被分配一个基于分析的工作量信息的权重值。 为当前查询估计结果而采样的特定元组基于每个元组的权重值。 还可以利用工作负载信息来生成异常值索引,该索引识别与执行的查询相关联的异常值元组,或者识别与其他查询更频繁执行的特定查询相关联的异常值元组。 当前查询的结果也可以使用采样值以及来自离群值索引的异常值元组来估计。

    Sampling for aggregation queries
    20.
    发明授权
    Sampling for aggregation queries 有权
    聚合查询的抽样

    公开(公告)号:US06842753B2

    公开(公告)日:2005-01-11

    申请号:US09759799

    申请日:2001-01-12

    IPC分类号: G06F17/30

    摘要: Aggregation queries are performed by first identifying outlier values, aggregating the outlier values, and sampling the remaining data after pruning the outlier values. The sampled data is extrapolated and added to the aggregated outlier values to provide an estimate for each aggregation query. Outlier values are identified by selecting values outside of a selected sliding window of data having the lowest variance. An index is created for the outlier values. The outlier data is removed from the window of data, and separately aggregated. The remaining data without the outliers is then sampled in one of many known ways to provide a statistically relevant sample that is then aggregated and extrapolated to provide an estimate for the remaining data. This sampled estimate is combined with the outlier aggregate to form an estimate for the entire set of data. Further methods involve the use of weighted sampling and weighted selection of outlier values for low selectivity queries, or queries having group by.

    摘要翻译: 通过首先识别异常值,聚合异常值和在修剪异常值之后对剩余数据进行采样来执行聚合查询。 采样数据被外推并加到聚合异常值中,以提供每个聚合查询的估计。 异常值通过选择具有最小方差的数据的所选滑动窗口之外的值来识别。 为异常值创建索引。 离群数据从数据窗口中移除,并单独汇总。 然后以许多已知方式之一对剩余的没有异常值的数据进行采样,以提供统计学相关的样本,然后进行聚合和外推,以提供剩余数据的估计。 该采样估计与异常值聚合组合以形成整套数据的估计。 进一步的方法涉及对低选择性查询或具有分组查询的异常值的加权采样和加权选择。