Abstract:
A method for partitioning an association table in a distributed database, where a manager determines a first data table in data tables requiring partition and generates a colocation partition (CP) table set of the first data table, the CP table set of the first data table includes the first data table and at least one CP table of the first data table, and a CP table of the first data table includes a data table whose partition key includes a subset of a partition key of the first data table. The manager partitions the first data table according to the partition key and partitions each CP table in the CP table set, a partition range of a partition key of each CP table is the same as a partition range of a corresponding partition key in the first data table.
Abstract:
Disclosed are a distributed storage system, a cluster node, and a range management method thereof, including: partitioning, by the cluster node according to a configuration parameter, a range corresponding to a first routing table entry in a local routing table into at least two subranges, where the first routing table entry refers to a routing table entry in which information that indicates the cluster node is recorded in replica information; separately establishing, by the cluster node, a log queue for each of the subranges; determining, by the cluster node, a corresponding subrange according to a key field of a data operation request from a client; executing a corresponding data read/write operation according to the determined subrange; and updating, according to the data read/write operation, a log queue corresponding to the determined subrange.
Abstract:
Disclosed are a distributed storage system, a cluster node, and a range management method thereof, including: partitioning, by the cluster node according to a configuration parameter, a range corresponding to a first routing table entry in a local routing table into at least two subranges, where the first routing table entry refers to a routing table entry in which information that indicates the cluster node is recorded in replica information; separately establishing, by the cluster node, a log queue for each of the subranges; determining, by the cluster node, a corresponding subrange according to a key field of a data operation request from a client; executing a corresponding data read/write operation according to the determined subrange; and updating, according to the data read/write operation, a log queue corresponding to the determined subrange.
Abstract:
A data table partitioning management method and apparatus are disclosed. The method includes: determining a type and a join key of each data table in a table group, where the type of the data table includes a one-dimensional table, a multidimensional table, or a fact table; and performing one-dimensional partitioning on row replica space of each data table in the table group, and performing one-dimensional or multidimensional partitioning on column replica space of the data table according to the type of the data table and based on the join key of the data table. Different partitioning management methods are applied to data tables of different types and different dimensions, so that data processing mechanisms of OLTP and OLAP are efficiently implemented in a system, and resource consumption is reduced.
Abstract:
A method for partitioning an association table in a distributed database, where a manager determines a first data table in data tables requiring partition and generates a colocation partition (CP) table set of the first data table, the CP table set of the first data table includes the first data table and at least one CP table of the first data table, and a CP table of the first data table includes a data table whose partition key includes a subset of a partition key of the first data table. The manager partitions the first data table according to the partition key and partitions each CP table in the CP table set, a partition range of a partition key of each CP table is the same as a partition range of a corresponding partition key in the first data table.
Abstract:
A data synchronization method, a data synchronization apparatus, and a distributed system are disclosed. A management node acquires a route update message that instructs to update routing information of the first data center and the second data center, where the routing information includes at least identification information of the first data center and the second data center, and backup routing information of nodes in the first data center and the second data center; the management node adjusts the routing information of the first data center and the second data center according to the route update message; and the management node synchronizes adjusted routing information of the first data center and the second data center to the first data center and the second data center, so that the first data center and the second data center perform, based on the adjusted routing information, synchronous transmission on data of managed nodes.