Dynamic query optimization between systems based on system conditions
    1.
    发明申请
    Dynamic query optimization between systems based on system conditions 审中-公开
    基于系统条件的系统之间的动态查询优化

    公开(公告)号:US20090049024A1

    公开(公告)日:2009-02-19

    申请号:US11891919

    申请日:2007-08-14

    IPC分类号: G06F17/30

    CPC分类号: G06F16/2453

    摘要: A computer-implemented system, method and article of manufacture for managing a plurality of database systems in a domain, wherein each of the database systems are managed by one or more closed-loop system management (CLSM) regulators. A virtual regulator manages the domain, wherein the virtual regulator communicates with the CLSM regulators to monitor the database systems' system conditions and operating environment events within the domain and to provide for dynamic query optimization between the database systems within in the domain based on the system conditions and operating environment events.

    摘要翻译: 一种用于管理域中的多个数据库系统的计算机实现的系统,方法和制品,其中每个数据库系统由一个或多个闭环系统管理(CLSM)调节器管理。 虚拟调节器管理域,其中虚拟调节器与CLSM调节器通信以监视数据库系统的系统状况和域内的操作环境事件,并且基于系统在域内的数据库系统之间提供动态查询优化 条件和操作环境事件。

    CONVERTING TEMPORAL DATA INTO TIME SERIES DATA
    2.
    发明申请
    CONVERTING TEMPORAL DATA INTO TIME SERIES DATA 审中-公开
    将时间数据转换为时间序列数据

    公开(公告)号:US20080147603A1

    公开(公告)日:2008-06-19

    申请号:US11610534

    申请日:2006-12-14

    IPC分类号: G06F7/06

    CPC分类号: G06F7/10 G06F16/245

    摘要: A method of and system for extracting time series data from temporal data in a database table is disclosed. The method comprises receiving a request for time series data, the request including information indicative of a period of time to be covered by the request, values sought by the request, time instants at which the values are sought and a precision between consecutive time instants, and creating a temporary table and populating the temporary table with a plurality of time stamps covering the period of time indicated by the request, consecutive time stamps being separated by the precision indicated by the request, and the time stamps specifying the time instants indicated by the request. The method also comprises querying the database table so as to identify for each time stamp in the temporary table, a row in the database table which has temporal data covering the time stamp and extracting values specified in the request from the identified rows, and associating the extracted data with the appropriate time stamps.

    摘要翻译: 公开了一种从数据库表中的时间数据中提取时间序列数据的方法和系统。 所述方法包括接收对时间序列数据的请求,所述请求包括指示要由所述请求覆盖的时间段的信息,所述请求寻求的值,寻求所述值的时刻和连续时刻之间的精度, 以及创建一个临时表格并且填充临时表格,该多个时间戳覆盖由该请求所指示的时间段,连续的时间戳被该请求所指示的精度分隔,以及指定由该请求指示的时刻的时间戳 请求。 该方法还包括查询数据库表以识别临时表中的每个时间戳,数据库表中的一行具有覆盖时间标记的时间数据,并从所识别的行中提取请求中指定的值,并将 用适当的时间戳提取数据。

    RUNTIME RESOURCE SENSITIVE AND DATA DRIVEN OPTIMIZATION
    3.
    发明申请
    RUNTIME RESOURCE SENSITIVE AND DATA DRIVEN OPTIMIZATION 审中-公开
    运行资源敏感和数据驱动优化

    公开(公告)号:US20080147593A1

    公开(公告)日:2008-06-19

    申请号:US11610544

    申请日:2006-12-14

    IPC分类号: G06F7/06

    CPC分类号: G06F16/24524

    摘要: A technique for generating two or more execution plans for an SQL query within a database system. The system has two or more resources. A first resource utilization profile is defined by associating a first set of numerical utilization values respectively with two or more of the resources. The utilization values represent utilization of the resources. A first execution plan is generated that is optimal assuming utilization of the resources specified in the first resource utilization profile. The technique defines at least one further resource utilization profile by associating at least one further set of numerical utilization values respectively with two or more of the resources, the further utilization values representing utilization of the resources. At least one further execution plan is generated that is optimal assuming utilization of the resources specified in the further resource utilization profile(s).

    摘要翻译: 一种用于为数据库系统中的SQL查询生成两个或多个执行计划的技术。 系统有两个或更多的资源。 通过将第一组数值利用值分别与资源中的两个或更多个相关联来定义第一资源利用率简档。 利用率值表示资源的利用率。 生成最佳的第一执行计划,假定利用第一资源利用率配置文件中指定的资源。 该技术通过将至少一个另外的数值利用值集合分别与两个或更多个资源相关联来定义至少一个其他资源利用率简档,所述另外的利用值表示资源的利用。 生成至少一个进一步的执行计划,其是假设利用在其他资源利用率配置文件中指定的资源的最佳方案。

    Statistical representation of skewed data
    4.
    发明授权
    Statistical representation of skewed data 有权
    倾斜数据的统计表示

    公开(公告)号:US07386536B1

    公开(公告)日:2008-06-10

    申请号:US10751016

    申请日:2003-12-31

    IPC分类号: G06F7/00 G06F17/30

    摘要: A method, database system, and computer program for collecting statistics about a table are disclosed. The table includes one or more rows and each row includes a respective value. The method includes creating zero or more histogram buckets. Each histogram bucket includes a width representing a respective range of values and a height representing a count of rows having values in the range of values. The method further includes creating one or more high-bias buckets, each high-bias bucket represents one or more values that appear in a minimum percentage of rows.

    摘要翻译: 公开了一种用于收集关于表的统计的方法,数据库系统和计算机程序。 该表包括一行或多行,每行包括相应的值。 该方法包括创建零个或多个直方图桶。 每个直方图桶包括表示值的相应范围的宽度,以及表示具有值范围内的值的行的计数的高度。 该方法还包括创建一个或多个高偏置桶,每个高偏置桶表示以最小百分比行出现的一个或多个值。

    System, method and computer-readable medium for dynamic skew avoidance for generic queries
    5.
    发明授权
    System, method and computer-readable medium for dynamic skew avoidance for generic queries 有权
    用于通用查询的动态偏差避免的系统,方法和计算机可读介质

    公开(公告)号:US08832074B2

    公开(公告)日:2014-09-09

    申请号:US12582734

    申请日:2009-10-21

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30445 G06F17/30545

    摘要: A system, method, and computer-readable medium that facilitate dynamic skew avoidance are provided. The disclosed mechanisms advantageously do not require any statistic information regarding which values are skewed in a column on which a query is applied. Query selectivity is evaluated at a check point and thereby facilitates accurate detection of an overloaded processing module. The successful detection of an overloaded processing module causes other processing modules to stop sending more skewed rows to the overloaded processing module. Detection of an overloaded processing module is made when the overloaded processing module has received more rows than a target number of rows. Further, skewed rows that are maintained locally rather than redistributed to a detected processing module may result in more processing modules becoming overloaded. Advantageously, the disclosed mechanisms provide for a final redistribution adjustment to provide for even distribution of rows among all processing modules.

    摘要翻译: 提供了一种促进动态偏斜回避的系统,方法和计算机可读介质。 所公开的机制有利地不需要关于在其上应用查询的列中哪些值偏斜的任何统计信息。 在检查点评估查询选择性,从而有助于准确检测过载的处理模块。 成功检测到重载的处理模块导致其他处理模块停止向重载的处理模块发送更多的偏斜行。 当过载的处理模块已经接收到比目标行数更多的行时,进行过载处理模块的检测。 此外,本地维护而不是重分发到检测到的处理模块的偏斜行可能导致更多的处理模块变得过载。 有利地,所公开的机构提供最终重新分配调整以提供所有处理模块之间的行的均匀分布。

    Techniques for data assignment from an external distributed file system to a database management system
    6.
    发明授权
    Techniques for data assignment from an external distributed file system to a database management system 有权
    从外部分布式文件系统到数据库管理系统的数据分配技术

    公开(公告)号:US08713057B2

    公开(公告)日:2014-04-29

    申请号:US13340335

    申请日:2011-12-29

    IPC分类号: G06F17/30

    摘要: Techniques for data assignment from an external distributed file system (DFS) to a database management system (DBMS) are provided. Data blocks from the DFS are represented as first nodes and access module processors of the DBMS are represented as second nodes. A graph is produced with the first and second nodes. Assignments are made for the first nodes to the second nodes based on evaluation of the graph to integrate the DFS with the DBMS.

    摘要翻译: 提供了从外部分布式文件系统(DFS)到数据库管理系统(DBMS)的数据分配技术。 来自DFS的数据块被表示为第一节点,并且DBMS的访问模块处理器被表示为第二节点。 使用第一和第二节点生成图形。 基于图的评估将第一个节点分配给第二个节点,以将DFS与DBMS集成。

    TECHNIQUES FOR ACCESSING A PARALLEL DATABASE SYSTEM VIA EXTERNAL PROGRAMS USING VERTICAL AND/OR HORIZONTAL PARTITIONING
    7.
    发明申请
    TECHNIQUES FOR ACCESSING A PARALLEL DATABASE SYSTEM VIA EXTERNAL PROGRAMS USING VERTICAL AND/OR HORIZONTAL PARTITIONING 有权
    通过垂直和/或水平分割的外部程序访问并行数据库系统的技术

    公开(公告)号:US20130173594A1

    公开(公告)日:2013-07-04

    申请号:US13340324

    申请日:2011-12-29

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30424 G06F17/30584

    摘要: Techniques for accessing a parallel database system via an external program using vertical and/or horizontal partitioning are provided. An external program to a database management system (DBMS) configures external mappers to process a specific portion of query results on specific access module processors of the DBMS that are to house query results. The query is submitted by the external program to the DBMS and the DBMS is directed to organize the query results in a vertical or horizontal manner. Each external mapper accesses its portion of the query results for processing in parallel on its designated AMP or set of AMPS to process the query results.

    摘要翻译: 提供了通过使用垂直和/或水平分区的外部程序访问并行数据库系统的技术。 数据库管理系统(DBMS)的外部程序配置外部映射器,以处理要存放查询结果的DBMS的特定访问模块处理器上的查询结果的特定部分。 该查询由外部程序提交给DBMS,并且指导DBMS以垂直或水平方式组织查询结果。 每个外部映射器访问其部分查询结果,以便在其指定的AMP或一组AMPS上并行处理以处理查询结果。

    SYSTEM, METHOD AND COMPUTER-READABLE MEDIUM FOR DYNAMIC SKEW AVOIDANCE FOR GENERIC QUERIES
    8.
    发明申请
    SYSTEM, METHOD AND COMPUTER-READABLE MEDIUM FOR DYNAMIC SKEW AVOIDANCE FOR GENERIC QUERIES 有权
    动态查询的系统,方法和计算机可读介质

    公开(公告)号:US20110093499A1

    公开(公告)日:2011-04-21

    申请号:US12582734

    申请日:2009-10-21

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30445 G06F17/30545

    摘要: A system, method, and computer-readable medium that facilitate dynamic skew avoidance are provided. The disclosed mechanisms advantageously do not require any statistic information regarding which values are skewed in a column on which a query is applied. Query selectivity is evaluated at a check point and thereby facilitates accurate detection of an overloaded processing module. The successful detection of an overloaded processing module causes other processing modules to stop sending more skewed rows to the overloaded processing module. Detection of an overloaded processing module is made when the overloaded processing module has received more rows than a target number of rows. Further, skewed rows that are maintained locally rather than redistributed to a detected processing module may result in more processing modules becoming overloaded. Advantageously, the disclosed mechanisms provide for a final redistribution adjustment to provide for even distribution of rows among all processing modules.

    摘要翻译: 提供了一种促进动态偏斜回避的系统,方法和计算机可读介质。 所公开的机制有利地不需要关于在其上应用查询的列中哪些值偏斜的任何统计信息。 在检查点评估查询选择性,从而有助于准确检测过载的处理模块。 成功检测到重载的处理模块导致其他处理模块停止向重载的处理模块发送更多的偏斜行。 当过载的处理模块已经接收到比目标行数更多的行时,进行过载处理模块的检测。 此外,本地维护而不是重分发到检测到的处理模块的偏斜行可能导致更多的处理模块变得过载。 有利地,所公开的机构提供最终重新分配调整以提供所有处理模块之间的行的均匀分布。

    SYSTEM, METHOD, AND COMPUTER-READABLE MEDIUM FOR ELIMINATING UNNECESSARY SELF-JOINS IN A DATABASE SYSTEM
    9.
    发明申请
    SYSTEM, METHOD, AND COMPUTER-READABLE MEDIUM FOR ELIMINATING UNNECESSARY SELF-JOINS IN A DATABASE SYSTEM 审中-公开
    用于消除数据库系统中不必要的自我接收的系统,方法和计算机可读介质

    公开(公告)号:US20100121836A1

    公开(公告)日:2010-05-13

    申请号:US12268491

    申请日:2008-11-11

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F16/24544

    摘要: A system, method, and computer-readable medium for optimizing query performance in a database system are provided. In one embodiment, join predicates of a self outer join are evaluated. If each join predicate is respectively based on a common join attribute, and each join attribute has a not null constraint applied thereto, the self outer join may be re-written as a self inner join. In another embodiment, if not null and unique constraints are applied to each join attribute of an inner join featuring join predicates each respectively based on a common join attribute, the inner join may advantageously removed thereby resulting in a select operation.

    摘要翻译: 提供了一种用于优化数据库系统中的查询性能的系统,方法和计算机可读介质。 在一个实施例中,评估自外部连接的连接谓词。 如果每个连接谓词分别基于公共连接属性,并且每个连接属性具有应用于其的非空约束,则可以将自外部连接重写为自内部连接。 在另一个实施例中,如果不是空且唯一的约束被分别应用于基于共同连接属性的具有连接谓词的内部连接的每个连接属性,则可以有利地移除内部连接,从而导致选择操作。

    SYSTEM, METHOD, AND COMPUTER-READABLE MEDIUM FOR REDUCING ROW REDISTRIBUTION COSTS FOR PARALLEL JOIN OPERATIONS
    10.
    发明申请
    SYSTEM, METHOD, AND COMPUTER-READABLE MEDIUM FOR REDUCING ROW REDISTRIBUTION COSTS FOR PARALLEL JOIN OPERATIONS 有权
    系统,方法和计算机可读介质,用于减少并行运行的重新分配成本

    公开(公告)号:US20100049722A1

    公开(公告)日:2010-02-25

    申请号:US12193814

    申请日:2008-08-19

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30466 G06F17/30445

    摘要: A system, method, and computer-readable medium for optimizing execution of a join operation in a parallel processing system are provided. A plurality of processing nodes that have at least one row of one or more tables involved in a join operation are identified. For each of the processing nodes, respective counts of rows that would be redistributed to each of the processing nodes based on join attributes of the rows are determined. A redistribution matrix is calculated from the counts of rows of each of the processing nodes. An optimized redistribution matrix is generated from the redistribution matrix, wherein the optimized redistribution matrix provides a minimization of rows to be redistributed among the nodes to execute the join operation.

    摘要翻译: 提供了一种用于优化并行处理系统中的连接操作的执行的系统,方法和计算机可读介质。 识别在连接操作中具有至少一行一个或多个表的多个处理节点。 对于每个处理节点,确定将基于行的连接属性重新分配给每个处理节点的行的相应计数。 从每个处理节点的行数计算再分配矩阵。 从再分配矩阵生成优化的再分配矩阵,其中优化的再分配矩阵提供在节点之间重新分布的行的最小化以执行连接操作。