Abstract:
The solution discloses a data storage method in a distributed storage system: A client generates data of a stripe and concurrently sends data of strips in the stripe to storage nodes corresponding to the strips, so that data interaction between the storage nodes is reduced, and write concurrency is improved; and the storage node receives the data of the strips sent by the client and records a sequence of data of strips corresponding to a same logical address, so that write performance of a distributed block storage system is improved and data consistency is ensured.
Abstract:
Embodiments of the present invention provide a cluster system with calculation and storage converged, including a resource server group comprising at least two resource servers. Each resource server in the resource server group has at least one of a calculation resource and a storage resource. At least one resource server in the resource server group has a calculation resource and a storage resource, and the storage resource includes a persistent disk and a cache that corresponds to the persistent disk. A storage resource control module is configured to construct a shared storage resource pool by using storage resources in the resource server group, so as to provide a storage service, where storage resources included in each storage resource pool are from at least two resource servers in the resource server group.
Abstract:
A data storage method is used to improve storage consistency of a distributed storage system. The method includes: a primary storage node performs EC coding on a to-be-stored data segment to obtain a target EC stripe; determines in a storage node group to which the primary storage node belongs, m+k target storage nodes used to store m+k target EC blocks of the target EC stripe; sends a preparation message to the target storage nodes; receives a response message sent by a target storage node; and sends an execution message to the target storage nodes to instruct the target storage nodes to write target EC blocks that are in preparation logs.
Abstract:
A method for data reconstruction includes: obtaining a data migration list, where the data migration list indicates a partition with data to be migrated on a storage node with data to be migrated; generating a data migration task according to the data migration list and routing table information, where the routing table information indicates correspondence between a storage node in a cluster and a partition; and concurrently executing data migration tasks of a storage node in the cluster to reconstruct data. In addition, a device for data reconstruction and a storage system are further provided accordingly.
Abstract:
A data recovery method to improve storage consistency of a distributed storage system includes: receiving a recovery message from a storage node after a fault recovery of the faulty storage node; obtaining N lost erasure coding (EC) blocks of the faulty storage node through computation according to EC blocks stored by non-faulty storage nodes; sequentially sending N lost EC blocks to the faulty storage node for storage. In the process of sequentially sending the N lost EC blocks to the faulty storage node, when it is determined that for a first EC block that is to be stored by the faulty storage node, a second EC block having a same key value as the first EC block exists in the lost EC blocks, after the faulty storage node stores the second EC block, the first EC block is sent to the faulty storage node to store.
Abstract:
A data distribution method for improving performance of a distributed storage system includes: receiving, by a data distribution apparatus, a storage instruction of a user, dividing to-be-stored data that the storage instruction instructs to store, into P data segments, determining a storage node group corresponding to each data segment, and finally distributing the data segment to a primary node in the corresponding storage node group.
Abstract:
The present disclosure provides a storage system for data read and write. One embodiment of the storage system includes an origination device that is configured to receive a request for a logical block addressing-based operation on a volume, convert the logical block addressing-based operation request into a key addressing-based operation request carrying a key corresponding to data to be operated, and send the key addressing-based operation request to a routing library; the routing library is configured to receive the key addressing-based operation request, hash the key corresponding to the data to be operated, determine that a storage node taking charge of a hash region in which the hashed key is located is the master storage node, send the key addressing-based operation request to the master storage node of the data to be operated.
Abstract:
A data processing method in a storage system and the storage system, and relates to the field of data storage technologies. A client first sends a data block in a stripe and metadata of the data block to a data storage node. The client then sends the metadata of the data block and a parity block to a parity storage node, to back up the metadata of the data block on the parity storage node. When the metadata of the data block is successfully backed up, the data block is written into a corresponding storage device based on a storage location indicated by the metadata. This reduces write operation time.
Abstract:
Embodiments of the present invention provide a cluster system with calculation and storage converged, including: a resource server group, where the resource server group includes at least two resource servers, each resource server in the resource server group has at least one of: a calculation resource and a storage resource, at least one resource server in the resource server group has a calculation resource and a storage resource, and the storage resource includes a persistent disk and a cache that corresponds to the persistent disk; and a storage resource control module, configured to construct a shared storage resource pool by using storage resources in the resource server group, so as to provide a storage service, where storage resources included in each storage resource pool are from at least two resource servers in the resource server group.
Abstract:
A distributed storage and replication system includes a MDC module, multiple IO routing modules, and multiple OSD nodes. The MDC module is adapted to configure at least two partition, the IO routing module is adapted to route an IO request to an OSD node, and the OSD node is adapted to execute storage of data corresponding to the IO request. The MDC is configured to determine a faulty OSD node, update a partition view of a partition group that includes a partition on the faulty OSD node, and send an updating notification to a primary OSD node in the updated partition view. The primary OSD node is adapted to process replication of the data corresponding to the IO request. According to embodiments of the present disclosure, processing performance, fault tolerance, and availability of consistency replication are improved.