Autonomous Column Selection for Columnar Cache

    公开(公告)号:US20230141891A1

    公开(公告)日:2023-05-11

    申请号:US17523520

    申请日:2021-11-10

    申请人: Google LLC

    摘要: Aspects of the disclosure are directed to generating cache configurations for caching data for a database. A database management system (DBMS) can search for column data to cache in a database cache to improve performance of the DBMS in resolving queries. Column data selection can be performed automatically and in the background of a deployed DBMS. Periodically, the DBMS can assess the performance benefit of having certain data cached in the database cache and select data for caching based on the assessed performance benefit. The DBMS can also determine the performance benefit of cached data when not cached, as well as select some portions of data to cache over others. The DBMS can also select data for caching based on different degrees of compression, to further improve query resolution performance.

    GROUP DETERMINATION BASED ON MULTI-TABLE DICTIONARY CODES

    公开(公告)号:US20190095486A1

    公开(公告)日:2019-03-28

    申请号:US15918132

    申请日:2018-03-12

    IPC分类号: G06F17/30

    摘要: Techniques related to group determination based on multi-table dictionary codes are disclosed. In some embodiments, one or more non-transitory storage media store a sequence of instructions which, when executed by one or more computing devices, cause performance of a method. The method comprises storing a fact table and a dimension table that share a domain dictionary. The fact table and the dimension table each have a column of encoded join keys that is decodable using the shared domain dictionary. A query may specify one or more row groups for the dimension table. To efficiently process the query, one or more group identifiers are assigned to the one or more row groups. Each row group corresponds to a different group identifier. This enables a code-to-group-identifier mapping to be generated. The code-to-group-identifier mapping correlates the encoded join keys to the one or more group identifiers.

    VISUALIZING DEVICE, VISUALIZING METHOD AND VISUALIZING PROGRAM

    公开(公告)号:US20170337259A1

    公开(公告)日:2017-11-23

    申请号:US15528197

    申请日:2015-08-26

    申请人: NEC Corporation

    IPC分类号: G06F17/30

    摘要: A classification axis allocating means 81 is a visualizing device that visualizes a classification model in which classification conditions for classifying classification target data are represented in a hierarchical structure, and allocates classification axes to respective dimensional axes of a multidimensional space in accordance with priority levels of the classification axes to be used for the classification conditions. A region splitting means 82 splits the dimensional axes, based on types of the allocated classification axes, and allocates classification target data to respective regions on the multidimensional space to be split in accordance with the split dimensional axes. A display means 83 displays the classification target data allocated to the respective regions of the multidimensional space.

    SYSTEM AND METHOD FOR BUILDING A DWARF DATA STRUCTURE

    公开(公告)号:US20170300516A1

    公开(公告)日:2017-10-19

    申请号:US15488856

    申请日:2017-04-17

    IPC分类号: G06F17/30

    摘要: Systems and methods for building a dwarf data structure with reduced size and improved query performance is disclosed. The system is configured to perform three major steps for reducing size of Dwarf data structure and improving query performance. In the first step, the system is configured to reducing the Dwarf data structure size by physical compression of the clustered node blocks of the Dwarf data structure when writing the nodes on a disk. In the second step, the system is configured to improve query performance by look-ahead reading, wherein an entire block of nodes is loaded into random access memory, as there is a very high probability of occurrence of the nodes required to be accessed from same block. In the third step, the system is configured to reduce the number of nodes/blocks read while serving range queries thereby improving query performance while retrieving data from Dwarf data structure.

    SYSTEM AND METHOD FOR SUPPORTING LARGE QUERIES IN A MULTIDIMENSIONAL DATABASE ENVIRONMENT

    公开(公告)号:US20170116313A1

    公开(公告)日:2017-04-27

    申请号:US15333051

    申请日:2016-10-24

    发明人: Alexey Roytman

    IPC分类号: G06F17/30

    摘要: In accordance with an embodiment, the system provides support for large queries in a multidimensional database computing environment. A kernel-based data structure, referred to herein as an odometer retriever, or odometer, that manages pointers to data blocks, contains control information, or otherwise operates as an array of arrays of pointers to stored members. When used with a dynamic flow, the approach enables the system to be used, for example to handle grid queries, Multidimensional Expressions (MDX) queries, or other types of queries in which the potential size of the query can be up to 264 bits.