Techniques for evaluating query predicates during in-memory table scans

    公开(公告)号:US10216794B2

    公开(公告)日:2019-02-26

    申请号:US15945733

    申请日:2018-04-04

    Abstract: Techniques are described herein for filtering data from a table during an in-memory scan. Predicates are pushed to in-memory scan to avoid scanning unnecessary columnar units and reduce the overhead of decompressing, row stitching and distributing data during evaluation. Techniques are described herein for generating implied predicates that have conditions on single columns from complex predicates that have multiple conditions on the same column, which can be evaluated during an in-memory scan. Techniques are also described herein to reduce the overhead of a table scan for processing a join query. When redistributing a first table for performing a hash-join, the nodes performing an in-memory scan of the first table may create a filter that tracks unique values from the join key. Data from the second table is only processed and transferred to other nodes in the cluster if the values from the join key pass through the filter.

    OPTIMIZE WORKLOAD PERFORMANCE BY AUTOMATICALLY DISCOVERING AND IMPLEMENTING IN-MEMORY PERFORMANCE FEATURES

    公开(公告)号:US20240111772A1

    公开(公告)日:2024-04-04

    申请号:US18374852

    申请日:2023-09-29

    CPC classification number: G06F16/24561 G06F11/3414 G06F11/3419

    Abstract: Techniques are provided for optimizing workload performance by automatically discovering and implementing performance optimizations for in-memory units (IMUs). A system maintains a set of IMUs for processing database operations in a database. The system obtains a database workload information for the database system and filters the database workload information to identify database operations in the database workload information that may benefit from performance optimizations. The system analyzes the database operations to identify a set of performance optimizations and ranks the performance optimizations based on their potential benefit. The system selects a subset of the performance optimizations, based on their ranking, and generates new versions of IMUs that reflect the performance optimizations. The system performs verification tests on the new versions of IMUs and analyzes the tests to determine whether the new versions of IMUs yield expected performance benefits. The system then categorizes the new set of IMUs into a first set of IMUs to be retained and a second set of IMUs to be discarded. The system then makes the first set of IMUs available to the current workload and discards the second set of IMUs.

    Optimize workload performance by automatically discovering and implementing in-memory performance features

    公开(公告)号:US12229160B2

    公开(公告)日:2025-02-18

    申请号:US18374852

    申请日:2023-09-29

    Abstract: Techniques are provided for optimizing workload performance by automatically discovering and implementing performance optimizations for in-memory units (IMUs). A system maintains a set of IMUs for processing database operations in a database. The system obtains a database workload information for the database system and filters the database workload information to identify database operations in the database workload information that may benefit from performance optimizations. The system analyzes the database operations to identify a set of performance optimizations and ranks the performance optimizations based on their potential benefit. The system selects a subset of the performance optimizations, based on their ranking, and generates new versions of IMUs that reflect the performance optimizations. The system performs verification tests on the new versions of IMUs and analyzes the tests to determine whether the new versions of IMUs yield expected performance benefits. The system then categorizes the new set of IMUs into a first set of IMUs to be retained and a second set of IMUs to be discarded. The system then makes the first set of IMUs available to the current workload and discards the second set of IMUs.

    Selecting from OR-expansion states of a query

    公开(公告)号:US09870390B2

    公开(公告)日:2018-01-16

    申请号:US14183409

    申请日:2014-02-18

    CPC classification number: G06F17/30395 G06F17/30451

    Abstract: Methods, stored instructions, and computer systems are described for selecting an OR-expansion state of a query. A query processor accesses a certain query and a control parameter. The certain query comprises disjunctive predicates, and the control parameter indicates a certain procedure of a plurality of alternative stored procedures for expanding queries comprising disjunctive predicates into semantically equivalent candidate queries comprising combinations of subqueries. Based at least in part on the control parameter, the query processor selects the certain procedure to expand the certain query into one or more certain semantically equivalent candidate queries, each comprising a different combination of two or more subqueries. The query processor causes cost-based evaluation of subject queries, including the certain query and the one or more certain semantically equivalent candidate queries. The query processor then selects a query of the subject queries for execution or for further evaluation or transformation.

    Selecting From OR-Expansion States Of A Query
    6.
    发明申请
    Selecting From OR-Expansion States Of A Query 有权
    从查询的OR扩展状态中选择

    公开(公告)号:US20150234888A1

    公开(公告)日:2015-08-20

    申请号:US14183409

    申请日:2014-02-18

    CPC classification number: G06F17/30395 G06F17/30451

    Abstract: Methods, stored instructions, and computer systems are described for selecting an OR-expansion state of a query. A query processor accesses a certain query and a control parameter. The certain query comprises disjunctive predicates, and the control parameter indicates a certain procedure of a plurality of alternative stored procedures for expanding queries comprising disjunctive predicates into semantically equivalent candidate queries comprising combinations of subqueries. Based at least in part on the control parameter, the query processor selects the certain procedure to expand the certain query into one or more certain semantically equivalent candidate queries, each comprising a different combination of two or more subqueries. The query processor causes cost-based evaluation of subject queries, including the certain query and the one or more certain semantically equivalent candidate queries. The query processor then selects a query of the subject queries for execution or for further evaluation or transformation.

    Abstract translation: 描述了用于选择查询的扩展状态的方法,存储的指令和计算机系统。 查询处理器访问某个查询和控制参数。 该特定查询包括分离谓词,并且控制参数指示用于将包括分离谓词的查询扩展到包括子查询的组合的语义等价候选查询的多个备选存储过程的某个过程。 至少部分地基于控制参数,查询处理器选择某个过程以将特定查询扩展到一个或多个特定语义上等同的候选查询中,每个候选查询包括两个或多个子查询的不同组合。 查询处理器对主题查询进行基于成本的评估,包括特定查询和一个或多个特定语义上等效的候选查询。 然后,查询处理器选择主题查询的查询以供执行或用于进一步的评估或变换。

Patent Agency Ranking