REDISTRIBUTING TABLE DATA IN A DATABASE CLUSTER

    公开(公告)号:US20190163773A1

    公开(公告)日:2019-05-30

    申请号:US15827660

    申请日:2017-11-30

    Abstract: A computer-implemented method of relocating data in a distributed database comprises: creating, by one or more processors, a second table in the distributed database, the second table including all columns from a first table; copying, by the one or more processors, a first set of tuples from the first table to the second table; modifying, by the one or more processors, during the copying of the first set of tuples, data of the first table according to a modification; after the copying of the first set of tuples, modifying, by the one or more processors, data of the second table according to the modification; and switching, by the one or more processors, the second table for the first table in a catalog of the distributed database.

    Replicated database distribution for workload balancing after cluster reconfiguration

    公开(公告)号:US10102086B2

    公开(公告)日:2018-10-16

    申请号:US14577629

    申请日:2014-12-19

    Inventor: Le Cai

    Abstract: A computer-implemented method for distributing a plurality of data partitions among a cluster of computer nodes includes dividing a database into a plurality of primary partitions, storing the primary partitions on a plurality of nodes associated with a computer network, duplicating the primary partitions to create a plurality of secondary partitions and storing the secondary partitions on the nodes, wherein the primary and secondary partitions are substantially evenly distributed among the nodes, each secondary partition is stored on a different node than its corresponding primary partition, diversity exists with respect to each secondary partition and the node where it is stored, the primary partitions are available to serve a system workload, and one or more secondary partitions corresponding to one or more primary partitions of a failed node are available to be upgraded to primary status to maintain workload balancing and data availability after one of the computer nodes fails.

    System and Method for Massively Parallel Processing Database
    7.
    发明申请
    System and Method for Massively Parallel Processing Database 有权
    大规模并行处理数据库的系统与方法

    公开(公告)号:US20150293966A1

    公开(公告)日:2015-10-15

    申请号:US14249558

    申请日:2014-04-10

    Inventor: Le Cai Guogen Zhang

    Abstract: In one embodiment, a method of performing point-in-time recovery (PITR) in a massively parallel processing (MPP) database includes receiving, by a data node from a coordinator, a PITR recovery request and reading a log record of the MPP database. The method also includes determining a type of the log record and updating a transaction table when the type of the log record is an abort transaction or a commit transaction.

    Abstract translation: 在一个实施例中,在大规模并行处理(MPP)数据库中执行时间点恢复(PITR)的方法包括从协调器接收数据节点PITR恢复请求并读取MPP数据库的日志记录 。 该方法还包括当日志记录的类型是中止事务或提交事务时,确定日志记录的类型和更新事务表。

    Redistributing table data in a database cluster

    公开(公告)号:US11151111B2

    公开(公告)日:2021-10-19

    申请号:US15827660

    申请日:2017-11-30

    Abstract: A computer-implemented method of relocating data in a distributed database comprises: creating, by one or more processors, a second table in the distributed database, the second table including all columns from a first table; copying, by the one or more processors, a first set of tuples from the first table to the second table; modifying, by the one or more processors, during the copying of the first set of tuples, data of the first table according to a modification; after the copying of the first set of tuples, modifying, by the one or more processors, data of the second table according to the modification; and switching, by the one or more processors, the second table for the first table in a catalog of the distributed database.

    ASYNCHRONOUS CACHE COHERENCY FOR MVCC BASED DATABASE SYSTEMS

    公开(公告)号:US20200034472A1

    公开(公告)日:2020-01-30

    申请号:US16047458

    申请日:2018-07-27

    Abstract: A database system comprises a persistent storage device, a log node including a memory and a processor, and a plurality of database nodes. A database node includes a cache memory configured to store a database instance, and a processor configured to initiate a database transaction by sending a snapshot request to the log node, the snapshot request including a list of pages that were either replaced or newly loaded in the cache memory. The log node processor is configured to send a snapshot response to the database node, wherein the snapshot response includes a snapshot of the database and a list of changed pages of the database instances. The database node processor is configured to update the status of the pages in cached memory according to the snapshot response and perform the database transaction.

    Balancing CPU Utilization In A Massively Parallel Processing Heterogeneous Cluster

    公开(公告)号:US20180352024A1

    公开(公告)日:2018-12-06

    申请号:US15612747

    申请日:2017-06-02

    Abstract: A system (and associated method) includes a processor which determines a performance metric ratio of a performance metric of a second type of server relative to a performance metric of a first server type for each of multiple sizes of multiple benchmark datasets to thereby determine a plurality of performance metric ratio values. The processor also determines an interpolation function for the plurality of performance metric ratio values usable to compute interpolation performance metric ratios of the second type of server relative to the first type of server for dataset sizes other than the first plurality of sizes. Given a second dataset, the processor determines an amount of the second dataset to provide to each of the respective server types using the interpolation function. The processor configures a load balancer based on the amount of the second dataset determined for each type of server.

Patent Agency Ranking