TERM VECTOR MODELING OF DATABASE WORKLOADS

    公开(公告)号:US20220019585A1

    公开(公告)日:2022-01-20

    申请号:US16930157

    申请日:2020-07-15

    Abstract: Techniques for managing database workloads using similarity measures based on queries executed are described. Classical techniques from information retrieval are applied to the domain of database workload management. Specifically, the technique of using document term vectors to compute similarity measures are applied using the conceptual mapping of SQL workloads as “documents” composed of SQL queries as “terms.” The techniques include generating two or more sets of workloads with each workload representing a set of queries executed on at least one database. Based on the sets of workloads, workload term vectors are calculated that represent the set of queries executed on the database. Then, based on the calculated workload vectors, a similarity score is generated between the two or more sets of workloads.

    Term vector modeling of database workloads

    公开(公告)号:US11327969B2

    公开(公告)日:2022-05-10

    申请号:US16930157

    申请日:2020-07-15

    Abstract: Techniques for managing database workloads using similarity measures based on queries executed are described. Classical techniques from information retrieval are applied to the domain of database workload management. Specifically, the technique of using document term vectors to compute similarity measures are applied using the conceptual mapping of SQL workloads as “documents” composed of SQL queries as “terms.” The techniques include generating two or more sets of workloads with each workload representing a set of queries executed on at least one database. Based on the sets of workloads, workload term vectors are calculated that represent the set of queries executed on the database. Then, based on the calculated workload vectors, a similarity score is generated between the two or more sets of workloads.

    Estimating query execution performance using a sampled counter

    公开(公告)号:US12292886B2

    公开(公告)日:2025-05-06

    申请号:US17580071

    申请日:2022-01-20

    Abstract: Techniques are described herein for probabilistic monitoring of high-frequency, low-latency database queries. In some embodiments, a probabilistic query monitoring system periodically samples active database sessions. For example, the system may generate sample data every one second or at some other sampling rate for each database session that is currently active. The sample data may include a mapping between query identifiers to sample counter values that are extracted at different sample intervals. The system may then estimate performance metrics for the set of active database based on the counter values sampled across consecutive sample intervals.

Patent Agency Ranking