DATA LOADING BASED ON WORKLOAD PREDICTIONS FOR IMPROVED PERFORMANCE OF CLOUD-BASED SYSTEMS

    公开(公告)号:US20240302990A1

    公开(公告)日:2024-09-12

    申请号:US18178613

    申请日:2023-03-06

    申请人: SAP SE

    IPC分类号: G06F3/06 G06N3/08

    摘要: Methods, systems, and computer-readable storage media for receiving a workload period, during which a workload is applied to a database system, providing a set of ML models based on historical data representative of historical executions of the workload over the workload period, each ML model configured to predict a cluster arrival rate curve (cARC), and during execution of the workload period and, for each timeslice of a plurality of timeslice of the workload period: providing a predicted cARC from each ML model, the predicted cARC representative of a predicted workload, determining column visiting times for each of a plurality of columns of each of a plurality of tables stored in the database system, generating a column list based on the column visiting times, and loading column data representative of columns included in the column list into low-latency memory prior to execution of a workload during the respective timeslice.

    AUTOMATIC INDEX CREATION FOR RELATIONAL DATABASE SYSTEMS

    公开(公告)号:US20230376482A1

    公开(公告)日:2023-11-23

    申请号:US17748140

    申请日:2022-05-19

    申请人: SAP SE

    发明人: Xiaotao Wang Jing He

    摘要: Methods, systems, and computer-readable storage media for automatic index creation for relational database systems. Query statements from a relational database are processed to generate query patterns from the query statements. Vectorization of the query patterns is performed to transform each query pattern into a numerical vector. A clustering algorithm is executed to cluster the numerical vectors into multiple clusters. Each cluster has a respective cluster center. A frequent query pattern is determined, for at least some of the multiple clusters, that corresponds to a respective cluster center. Active columns in the frequent query patterns are determined and a database index is automatically created for each active column that does not currently have a database index.

    MACHINE LEARNING DATABASE MEMORY USE PREDICTION AND ADAPTATION

    公开(公告)号:US20230376202A1

    公开(公告)日:2023-11-23

    申请号:US17748145

    申请日:2022-05-19

    申请人: SAP SE

    发明人: Xiaotao Wang Jing He

    IPC分类号: G06F3/06 G06N3/04

    摘要: Methods, systems, and computer-readable storage media for machine learning database memory use prediction and adaptation. An example method includes determining a sampling interval for an application for sampling memory use by a database for the application. A plurality of historical memory use samples of amounts of memory used by the database are determined for the application based on the sampling interval. The plurality of historical memory use samples are provided for training of a machine learning model to predict memory use for the application by the database for a future time period. A set of current memory use samples are provided to the machine learning model and a memory use prediction for the application for an upcoming time period is received from the machine learning model. A determination is made as to whether to extend memory of the database for the application based on the memory use prediction.