Dynamic query and step routing between systems tuned for different objectives
    1.
    发明授权
    Dynamic query and step routing between systems tuned for different objectives 有权
    针对不同目标的系统之间的动态查询和步骤路由

    公开(公告)号:US08392404B2

    公开(公告)日:2013-03-05

    申请号:US11985910

    申请日:2007-11-19

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30545 G06F17/30306

    摘要: A virtual regulator monitors and manages a plurality of database systems in a domain. Each of the database systems is tuned for a particular type of workload, and the virtual regulator or multiple virtual regulators running in parallel routes a set of one or more queries to a particular database system within the domain based on a cost function for each database system.

    摘要翻译: 虚拟调节器监视和管理域中的多个数据库系统。 每个数据库系统针对特定类型的工作负载进行调整,并且运行并行的虚拟调节器或多个虚拟调节器基于每个数据库系统的成本函数将一组一个或多个查询路由到域内的特定数据库系统 。

    Virtual data maintenance
    2.
    发明授权

    公开(公告)号:US08392461B2

    公开(公告)日:2013-03-05

    申请号:US11985909

    申请日:2007-11-19

    IPC分类号: G06F17/30 G06F17/00

    CPC分类号: G06F17/30306

    摘要: A computer-implemented apparatus, method, and article of manufacture manage a plurality of database systems and perform data maintenance tasks in a data warehouse system. A domain includes a plurality of database systems. A virtual regulator manages the domain, detects a request to invoke a data maintenance task on a first system in the domain, routes the data maintenance task, for execution, to a second system in the domain, and applies results from the data maintenance task (executed by the second system) to the first system.

    OPTIMIZING THE EXECUTION OF A QUERY IN A MULTI-DATABASE SYSTEM
    3.
    发明申请
    OPTIMIZING THE EXECUTION OF A QUERY IN A MULTI-DATABASE SYSTEM 有权
    优化多数据库系统中查询的执行

    公开(公告)号:US20120265751A1

    公开(公告)日:2012-10-18

    申请号:US13271867

    申请日:2011-10-12

    IPC分类号: G06F17/30

    摘要: Optimizing the execution of a query in a multi-database system includes identifying a region within a table, the table being referenced in the query. The region is stored on data-storage devices on first and second system databases in the multi-database system. A first access plan for the query is developed, the first access plan comprising accessing the version of the region stored on the first system database. A second access plan for the query is developed, the second access plan comprising accessing the version of the region stored on the second system database. A selection is made between the first access plan and the second access plan to execute the query. The query is executed using the selected access plan to produce a result.

    摘要翻译: 在多数据库系统中优化查询的执行包括识别表中的区域,该表在查询中引用。 该区域存储在多数据库系统中的第一和第二系统数据库上的数据存储设备上。 开发了用于查询的第一访问计划,第一访问计划包括访问存储在第一系统数据库上的区域的版本。 开发了用于查询的第二访问计划,第二访问计划包括访问存储在第二系统数据库上的区域的版本。 在第一访问计划和第二访问计划之间进行选择以执行查询。 使用所选访问计划执行查询以产生结果。

    Optimizing the execution of a query in a multi-database system
    4.
    发明授权
    Optimizing the execution of a query in a multi-database system 有权
    优化多数据库系统中查询的执行

    公开(公告)号:US08108382B1

    公开(公告)日:2012-01-31

    申请号:US12344794

    申请日:2008-12-29

    IPC分类号: G06F7/00 G06F17/30

    摘要: Optimizing the execution of a query in a multi-database system includes identifying a region within a table, the table being referenced in the query. The region is stored on a data-storage device on a first of the system databases in the multi-database system. The region is stored on a data-storage device on a second of the system databases in the multi-database system, the second system database being a different system database than the first system database. A first access plan for the query is developed, the first access plan comprising accessing the version of the region stored on the first system database. A second access plan for the query is developed, the second access plan comprising accessing the version of the region stored on the second system database. A selection is made between the first access plan and the second access plan to execute the query. The query is executed using the selected access plan to produce a result. The result is stored.

    摘要翻译: 在多数据库系统中优化查询的执行包括识别表中的区域,该表在查询中引用。 该区域存储在多数据库系统中的第一个系统数据库上的数据存储设备上。 该区域存储在多数据库系统中的第二系统数据库上的数据存储设备上,第二系统数据库是与第一系统数据库不同的系统数据库。 开发了用于查询的第一访问计划,第一访问计划包括访问存储在第一系统数据库上的区域的版本。 开发了用于查询的第二访问计划,第二访问计划包括访问存储在第二系统数据库上的区域的版本。 在第一访问计划和第二访问计划之间进行选择以执行查询。 使用所选访问计划执行查询以产生结果。 结果存储。

    SYSTEM, METHOD, AND COMPUTER-READABLE MEDIUM FOR SPOOL CACHE MANAGEMENT
    5.
    发明申请
    SYSTEM, METHOD, AND COMPUTER-READABLE MEDIUM FOR SPOOL CACHE MANAGEMENT 有权
    系统,方法和计算机可读介质,用于SPOOL CACHE管理

    公开(公告)号:US20100115205A1

    公开(公告)日:2010-05-06

    申请号:US12263516

    申请日:2008-11-03

    IPC分类号: G06F12/08

    摘要: A system, method, and computer-readable medium that facilitate efficient use of cache memory in a massively parallel processing system are provided. A residency time of a data block to be stored in cache memory or a disk drive is estimated. A metric is calculated for the data block as a function of the residency time. The metric may further be calculated as a function of the data block size. One or more data blocks stored in cache memory are evaluated by comparing a respective metric of the one or more data blocks with the metric of the data block to be stored. A determination is then made to either store the data block on the disk drive or flush the one or more data blocks from the cache memory and store the data block in the cache memory. In this manner, the cache memory may be more efficiently utilized by storing smaller data blocks with lesser residency times by flushing larger data blocks with significant residency times from the cache memory. The disclosed cache management mechanisms are effective for many workloads and are adaptable to various database usage scenarios without requiring detailed studies of the particular data demographics and workload.

    摘要翻译: 提供了一种有助于在大规模并行处理系统中有效使用高速缓冲存储器的系统,方法和计算机可读介质。 估计要存储在高速缓冲存储器或磁盘驱动器中的数据块的驻留时间。 根据居住时间的函数计算数据块的度量。 该度量还可以作为数据块大小的函数来计算。 通过将一个或多个数据块的相应度量与要存储的数据块的度量进行比较来评估存储在高速缓冲存储器中的一个或多个数据块。 然后确定将数据块存储在磁盘驱动器上或从高速缓存存储器刷新一个或多个数据块,并将数据块存储在高速缓冲存储器中。 以这种方式,通过用来自高速缓冲存储器的大量驻留时间冲洗较大的数据块,可以通过存储具有较小驻留时间的较小数据块来更有效地利用高速缓冲存储器。 公开的缓存管理机制对于许多工作负载是有效的,并且适用于各种数据库使用场景,而不需要详细研究特定数据人口统计特征和工作负载。

    System, method, and computer-readable medium for spool cache management
    6.
    发明授权
    System, method, and computer-readable medium for spool cache management 有权
    用于假脱机缓存管理的系统,方法和计算机可读介质

    公开(公告)号:US08601213B2

    公开(公告)日:2013-12-03

    申请号:US12263516

    申请日:2008-11-03

    IPC分类号: G06F12/12

    摘要: A system, method, and computer-readable medium that facilitate efficient use of cache memory in a massively parallel processing system are provided. A residency time of a data block to be stored in cache memory or a disk drive is estimated. A metric is calculated for the data block as a function of the residency time. The metric may further be calculated as a function of the data block size. One or more data blocks stored in cache memory are evaluated by comparing a respective metric of the one or more data blocks with the metric of the data block to be stored. A determination is then made to either store the data block on the disk drive or flush the one or more data blocks from the cache memory and store the data block in the cache memory. In this manner, the cache memory may be more efficiently utilized by storing smaller data blocks with lesser residency times by flushing larger data blocks with significant residency times from the cache memory. The disclosed cache management mechanisms are effective for many workloads and are adaptable to various database usage scenarios without requiring detailed studies of the particular data demographics and workload.

    摘要翻译: 提供了一种有助于在大规模并行处理系统中有效使用高速缓冲存储器的系统,方法和计算机可读介质。 估计要存储在高速缓冲存储器或磁盘驱动器中的数据块的驻留时间。 根据居住时间的函数计算数据块的度量。 该度量还可以作为数据块大小的函数来计算。 通过将一个或多个数据块的相应度量与要存储的数据块的度量进行比较来评估存储在高速缓冲存储器中的一个或多个数据块。 然后确定将数据块存储在磁盘驱动器上或从高速缓存存储器刷新一个或多个数据块,并将数据块存储在高速缓冲存储器中。 以这种方式,通过用来自高速缓冲存储器的大量驻留时间冲洗较大的数据块,可以通过存储具有较小驻留时间的较小数据块来更有效地利用高速缓冲存储器。 公开的缓存管理机制对于许多工作负载是有效的,并且适用于各种数据库使用场景,而不需要详细研究特定数据人口统计特征和工作负载。

    Virtual data maintenance
    7.
    发明申请
    Virtual data maintenance 有权
    虚拟数据维护

    公开(公告)号:US20090132602A1

    公开(公告)日:2009-05-21

    申请号:US11985909

    申请日:2007-11-19

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30306

    摘要: A computer-implemented apparatus, method, and article of manufacture manage a plurality of database systems and perform data maintenance tasks in a data warehouse system. A domain includes a plurality of database systems. A virtual regulator manages the domain, detects a request to invoke a data maintenance task on a first system in the domain, routes the data maintenance task, for execution, to a second system in the domain, and applies results from the data maintenance task (executed by the second system) to the first system.

    摘要翻译: 计算机实现的装置,方法和制品在数据仓库系统中管理多个数据库系统并执行数据维护任务。 域包括多个数据库系统。 虚拟调节器管理域,检测在域中的第一系统上调用数据维护任务的请求,将用于执行的数据维护任务路由到域中的第二系统,并且从数据维护任务( 由第二系统执行)到第一系统。

    Optimizing the execution of a query in a multi-database system
    8.
    发明授权
    Optimizing the execution of a query in a multi-database system 有权
    优化多数据库系统中查询的执行

    公开(公告)号:US08849795B2

    公开(公告)日:2014-09-30

    申请号:US13271867

    申请日:2011-10-12

    IPC分类号: G06F17/30

    摘要: Optimizing the execution of a query in a multi-database system includes identifying a region within a table, the table being referenced in the query. The region is stored on data-storage devices on first and second system databases in the multi-database system. A first access plan for the query is developed, the first access plan comprising accessing the version of the region stored on the first system database. A second access plan for the query is developed, the second access plan comprising accessing the version of the region stored on the second system database. A selection is made between the first access plan and the second access plan to execute the query. The query is executed using the selected access plan to produce a result.

    摘要翻译: 在多数据库系统中优化查询的执行包括识别表中的区域,该表在查询中引用。 该区域存储在多数据库系统中的第一和第二系统数据库上的数据存储设备上。 开发了用于查询的第一访问计划,第一访问计划包括访问存储在第一系统数据库上的区域的版本。 开发了用于查询的第二访问计划,第二访问计划包括访问存储在第二系统数据库上的区域的版本。 在第一访问计划和第二访问计划之间做出选择以执行查询。 使用所选访问计划执行查询以产生结果。

    Index selection in a multi-system database management system
    9.
    发明授权
    Index selection in a multi-system database management system 有权
    多系统数据库管理系统中的索引选择

    公开(公告)号:US08510290B1

    公开(公告)日:2013-08-13

    申请号:US12317808

    申请日:2008-12-30

    IPC分类号: G06F17/30

    摘要: A system includes a multi-system database management system having a plurality of database systems. An index selection subsystem combines sets of query information from respective ones of the plurality of database systems into a workload. The index selection subsystem then generates candidate indexes from the workload, and selects recommended indexes from the candidate indexes based on one or more criteria.

    摘要翻译: 系统包括具有多个数据库系统的多系统数据库管理系统。 索引选择子系统将来自多个数据库系统中的相应数据库系统的查询信息集合组合成工作负载。 索引选择子系统然后从工作负载生成候选索引,并根据一个或多个标准从候选索引中选择推荐的索引。

    Accurate and timely enforcement of system resource allocation rules
    10.
    发明授权
    Accurate and timely enforcement of system resource allocation rules 有权
    准确及时地执行系统资源分配规则

    公开(公告)号:US08762367B2

    公开(公告)日:2014-06-24

    申请号:US12329850

    申请日:2008-12-08

    IPC分类号: G06F17/30

    摘要: A computer-implemented method, apparatus and article of manufacture for optimizing a database query. A query execution plan for the database query is generated using estimated cost information; one or more steps of the query execution plan are executed to retrieve data from a database stored on the computer system. Actual cost information is generated for each of the executed steps, and the estimated cost information is re-calculated using the actual cost information. One or more resource allocation rules defined on one or more steps of the query execution plan are executed, based on the estimated cost information, wherein the resource allocation rules include one or more defined actions. The estimated cost information may be re-calculated using the actual cost information when confidence in the estimated cost information is low, but the estimated cost information may not be re-calculated when confidence in the estimated cost information is high. In addition, the estimated cost information may be re-calculated using the actual cost information, only when the step has one or more resource allocation rules defined thereon.

    摘要翻译: 用于优化数据库查询的计算机实现的方法,装置和制品。 使用估计的成本信息生成数据库查询的查询执行计划; 执行查询执行计划的一个或多个步骤以从存储在计算机系统上的数据库检索数据。 为每个执行步骤生成实际成本信息,并且使用实际成本信息重新计算估计成本信息。 基于估计成本信息,执行在查询执行计划的一个或多个步骤上定义的一个或多个资源分配规则,其中所述资源分配规则包括一个或多个定义的动作。 当对估计成本信息的置信度低时,可以使用实际成本信息重新计算估计成本信息,但是当估计成本信息的置信度高时,估计成本信息可能不被重新计算。 此外,仅当步骤具有在其上定义了一个或多个资源分配规则时,可以使用实际成本信息重新计算估计成本信息。