RESOURCE OPTIMIZATION FOR SERVERLESS QUERY PROCESSING

    公开(公告)号:US20210096915A1

    公开(公告)日:2021-04-01

    申请号:US16697960

    申请日:2019-11-27

    摘要: A serverless query processing system receives a query and determines whether the query is a recurring query or a non-recurring query. The system may predict, in response to determining that the query is the recurring query, a peak resource requirement during an execution of the query. The system may compute, in response to determining that the query is the non-recurring query, a tight resource requirement corresponding to an amount of resources that satisfy a performance requirement over the execution of the query, where the tight resource requirement is less than the peak resource requirement. The system allocates resources to the query based on an applicable one of the peak resource requirement or the tight resource requirement. The system then starts the execution of the query using the resources.

    RESOURCE PROVISIONING
    3.
    发明公开

    公开(公告)号:US20240126604A1

    公开(公告)日:2024-04-18

    申请号:US18161679

    申请日:2023-01-30

    IPC分类号: G06F9/50

    CPC分类号: G06F9/5016 G06F9/505

    摘要: A system provisioning resources of a processing unit. The system predicts a performance impact on a workload attributable to a performance constraint of the processing unit for the workload according to a resource model, wherein the workload includes a query and the resource model characterizes attainable compute bandwidth, attainable memory bandwidth, and arithmetic intensity based on peak compute bandwidth and peak memory bandwidth of the processing unit. The system determines a resource allocation of the processing unit, based on the predicted performance impact and instructs the processing unit to allocate the resources for processing the workload based on the determined resource allocation.

    CLOUD BASED QUERY WORKLOAD OPTIMIZATION

    公开(公告)号:US20210089532A1

    公开(公告)日:2021-03-25

    申请号:US16581905

    申请日:2019-09-25

    IPC分类号: G06F16/2453

    摘要: The cloud-based query workload optimization system disclosed herein the cloud-based query workloads optimization system receives query logs from various query engines to a cloud data service, extracts various query entities from the query logs, parses query entities to generate a set of common workload features, generates intermediate representations of the query workloads, wherein the intermediate representations are agnostic to the language of the plurality of the queries, identifies a plurality of workload patterns based on the intermediate representations of the query workloads, categorizes the workloads in one or more workload type categories based on the workload patterns and the workload features, and selects an optimization scheme based on the category of workload pattern.

    PREDICTING RUNTIME VARIATION IN BIG DATA ANALYTICS

    公开(公告)号:US20230376800A1

    公开(公告)日:2023-11-23

    申请号:US17746245

    申请日:2022-05-17

    IPC分类号: G06N5/04 G06F9/48 G06N5/02

    CPC分类号: G06N5/04 G06F9/4843 G06N5/022

    摘要: Methods, systems and computer program products are provided for predicting runtime variation in big data analytics. Runtime probability distributions may be predicted for proposed computing jobs. A predictor may classify proposed computing jobs based on multiple runtime probability distributions that represent multiple clusters of runtime probability distributions for multiple executed recurring computing job groups. Proposed computing jobs may be classified as delta-normalized runtime probability distributions and/or a ratio-normalized runtime probability distributions. Sources of runtime variation may be identified with a quantitative contribution to predicted runtime variation. A runtime probability distribution editor may indicate modifications to sources of runtime variation in a proposed computing job and/or predict reductions in predicted runtime variation provided by modifications to a proposed computing job.

    RESOURCE OPTIMIZATION FOR SERVERLESS QUERY PROCESSING

    公开(公告)号:US20220413914A1

    公开(公告)日:2022-12-29

    申请号:US17894628

    申请日:2022-08-24

    摘要: A serverless query processing system receives a query and determines whether the query is a recurring query or a non-recurring query. The system may predict, in response to determining that the query is the recurring query, a peak resource requirement during an execution of the query. The system may compute, in response to determining that the query is the non-recurring query, a tight resource requirement corresponding to an amount of resources that satisfy a performance requirement over the execution of the query, where the tight resource requirement is less than the peak resource requirement. The system allocates resources to the query based on an applicable one of the peak resource requirement or the tight resource requirement. The system then starts the execution of the query using the resources.

    QUERY PROCESSING ON ACCELERATED PROCESSING UNITS

    公开(公告)号:US20240311380A1

    公开(公告)日:2024-09-19

    申请号:US18465764

    申请日:2023-09-12

    IPC分类号: G06F16/245 G06F16/2453

    CPC分类号: G06F16/24569 G06F16/24542

    摘要: Query processing systems and methods are disclosed herein. In an example system, query information is received over a network for processing a query. A first processing architecture loads a set of data associated with the query into a shared memory. A second processing architecture accesses the set of data from the shared memory. In one example, the first and second processing architectures and the shared memory are integrated in a hardware chip (e.g., a chiplet containing several processor architectures, such as CPU and a graphics processing unit (GPU)). The query is processed based on the set of data accessed from the shared memory using the second processing architecture to generate a query result. The query result is provided over the network. In this manner, a computing device may execute a query based on different processing systems contained therein.