METHODS AND APPARATUSES FOR ADJUSTING THE DISTRIBUTION OF PARTITIONED DATA

    公开(公告)号:US20180357727A1

    公开(公告)日:2018-12-13

    申请号:US15780380

    申请日:2016-12-16

    IPC分类号: G06Q40/00 G06F17/30

    摘要: The disclosure discloses a method for adjusting distribution of partitioned data. Access frequency information of to-be-processed partitioned data is acquired when an adjustment time corresponding to the to-be-processed partitioned data has arrived; revenue data of each of data centers corresponding to the partitioned data is then determined according to the access frequency information and a preset revenue function; and finally, optimal distribution information is determined according to the revenue data of each of the data centers and a quantity of copies, and a position of each of the copies in each of the data centers is adjusted according to the optimal distribution information. Therefore, without even having to provide additional memory or hard disk for storage, this disclosure dynamically optimizes the distribution status of the partitioned data according to access frequencies and characteristics of the partitioned data, thus reducing transmission bandwidth requirements among data centers.

    Method and system for designing a database system for high event rate, while maintaining predictable query performance

    公开(公告)号:US09875272B1

    公开(公告)日:2018-01-23

    申请号:US14748225

    申请日:2015-06-23

    申请人: Google Inc.

    IPC分类号: G06F17/00 G06F17/30

    摘要: A computer-implemented database system, method, and computer program product are provided. the database system comprises a plurality of nodes, each node including a data storage device having memory and a server; and a database system controller coupled to each of the plurality of nodes and having a processor and software program code for directing the database system to perform the following function of defining, for a user, a node group including at least one of the plurality of nodes, upon which the user's data is to be stored in a user database; wherein the user database is defined in terms of time-partitioned tables residing on the nodes of the node group, and wherein each time-partitioned table is further defined in terms of shards of the user's data, the shards corresponding with respective ones of the time partitions. The user data is managed in terms of the shards on respective nodes of the node group, and the time-partitioned tables include a chronologically oldest time-partitioned table and a current time-partitioned table into which a new user data entry will be written. The system further performs the functions of determining whether a predetermined criterion has been met; and, responsive to the detecting that the predetermined criterion has been met, purging all shards of the chronologically oldest time-partitioned table.

    Composite partition functions
    7.
    发明授权

    公开(公告)号:US09875263B2

    公开(公告)日:2018-01-23

    申请号:US14519952

    申请日:2014-10-21

    IPC分类号: G06F17/30

    摘要: Composite partition function techniques are described herein that may be used to repartition a database without moving existing data. In one or more implementations, a database is partitioned in a first arrangement associated with a first partition function. The database is repartitioned to form a second arrangement associated with a second partition function. A record of key membership for data entries in the partitions existing at the time of repartitioning is created, which can be used to recognize data corresponding to the first arrangement. A composite partition function is defined for subsequent database operations that is configured to apply the first partition function for data having keys found in the record of key membership and apply the second partition function for other data having keys that are not found in the record of key membership.