Synchronous replication in a distributed storage environment
    3.
    发明授权
    Synchronous replication in a distributed storage environment 有权
    在分布式存储环境中进行同步复制

    公开(公告)号:US09519555B2

    公开(公告)日:2016-12-13

    申请号:US13113702

    申请日:2011-05-23

    IPC分类号: G06F17/30 G06F11/20

    摘要: Embodiments of the present invention relate to synchronously replicating data in a distributed computing environment. To achieve synchronous replication both an eventual consistency approach and a strong consistency approach are contemplated. Received data may be written to a log of a primary data store for eventual committal. The data may then be annotated with a record, such as a unique identifier, which facilitates the replay of the data at a secondary data store. Upon receiving an acknowledgment that the secondary data store has written the data to a log, the primary data store may commit the data and communicate an acknowledgment of success back to the client. In a strong consistency approach, the primary data store may wait to send an acknowledgement of success to the client until it receives an acknowledgment that the secondary has not only written, but also committed, the data.

    摘要翻译: 本发明的实施例涉及在分布式计算环境中同步复制数据。 为了实现同步复制,可以考虑最终一致性方法和强一致性方法。 收到的数据可能会写入主数据存储的日志,以便最终提交。 然后可以用诸如唯一标识符的记录来注释数据,这有助于辅助数据存储器处的数据的重放。 在收到辅助数据存储已经将数据写入日志的确认之后,主数据存储可以提交数据并将成功确认传回到客户端。 在强一致性方法中,主数据存储可以等待向客户端发送成功确认,直到其接收到确认次要不仅写入但已经提交的数据。

    LOAD BALANCING WHEN REPLICATING ACCOUNT DATA
    5.
    发明申请
    LOAD BALANCING WHEN REPLICATING ACCOUNT DATA 有权
    更换帐户数据时的负载平衡

    公开(公告)号:US20120303791A1

    公开(公告)日:2012-11-29

    申请号:US13113776

    申请日:2011-05-23

    IPC分类号: G06F15/173

    摘要: Embodiments of the present invention relate to invoking and managing load-balancing operation(s) applied to partitions within a distributed computing environment, where each partition represents a key range of data for a storage account. The partitions affected by the load-balancing operation(s) are source partitions hosted on a primary storage stamp and/or destination partitions hosted on a secondary storage stamp, where the primary and secondary storage stamps are located in geographically distinct areas and are equipped to replicate the storage account's data therebetween. The load-balancing operation(s) include splitting partitions into child partitions upon detecting an increased workload as a result of active replication, merging partitions to form parent partitions upon detecting a reduction in workload as a result of decreased processing-related resource consumption, or offloading partitions based on resource consumption. A service within a partition layer of the storage stamps is responsible for determining when to invoke these load-balancing operation(s).

    摘要翻译: 本发明的实施例涉及调用和管理应用于分布式计算环境中的分区的负载平衡操作,其中每个分区表示存储帐户的数据的关键范围。 受负载平衡操作影响的分区是托管在辅助存储邮票上托管的主存储戳记和/或目标分区上的源分区,其中主存储邮票和辅助存储邮票位于地理位置不同的区域,并且被配置为 复制存储帐户之间的数据。 负载平衡操作包括在检测到作为主动复制的结果的增加的工作负载时将分区划分成子分区,在检测到与处理相关的资源消耗减少的结果中检测到工作负载的减少时,将分区合并以形成父分区;或 基于资源消耗卸载分区。 存储标记的分区层内的服务负责确定何时调用这些负载平衡操作。

    Geo-verification and repair
    7.
    发明授权
    Geo-verification and repair 有权
    地理验证和修复

    公开(公告)号:US08762336B2

    公开(公告)日:2014-06-24

    申请号:US13113634

    申请日:2011-05-23

    IPC分类号: G06F7/00 G06F17/00

    CPC分类号: G06F11/1004 G06F17/30575

    摘要: Presented herein are methods of continuously verifying data and repairing errors introduced during replication. In a particular embodiment, a primary data store sends out information sufficient to create a checkpoint together with a checksum for the data being verified at that checkpoint. At the secondary data store, a checkpoint is created in accordance with the checkpointing information, and a checksum is calculated over the indicated data at the created checkpoint. If the calculated checksum disagrees with the received checksum, additional checksums are calculated over subranges of the indicated data and compared with corresponding checksums over the data at the primary data store. The checksums at the primary data store may be requested from the primary data store or calculated locally based on the received overall checksum. Once an erroneous entry is identified, it can then be re-replicated from the primary data store to restore data consistency.

    摘要翻译: 这里提出的是连续验证数据和修复复制期间引入的错误的方法。 在特定实施例中,主数据存储器发出足以创建检查点的信息以及在该检查点处正在验证的数据的校验和。 在次级数据存储中,根据检查点信息创建检查点,并且在创建的检查点处对指示的数据计算校验和。 如果计算的校验和与接收到的校验和不一致,则在指定数据的子范围内计算额外的校验和,并与主数据存储器上的数据相对应的校验和进行比较。 可以从主数据存储器请求主数据存储器上的校验和,或者基于接收到的总体校验和在本地计算。 一旦识别出错误的条目,它就可以从主数据存储重新复制,以恢复数据的一致性。

    Replication processes in a distributed storage environment
    8.
    发明授权
    Replication processes in a distributed storage environment 有权
    分布式存储环境中的复制过程

    公开(公告)号:US08918362B2

    公开(公告)日:2014-12-23

    申请号:US13113718

    申请日:2011-05-23

    IPC分类号: G06F17/00 G06F7/00 G06F17/30

    CPC分类号: G06F17/30575

    摘要: Embodiments of the present invention relate to systems, methods, and computer storage media for replicating data in a distributed computing environment utilizing a combination of replication methodologies. A full-object replication may be utilized to replicate a full state of an object from a primary data store to a secondary data store. A checkpoint created after initiating the full-object replication may be parsed to identify changes to the object that have been entered since initiating the full-object replication. This replication process is referred to as a delta-checkpoint replication methodology. Additionally, in an embodiment, a log-based replication methodology may be utilized. The log-based replication may communicate data from a log of the primary data store to the secondary data store. It is also contemplated in an exemplary embodiment that when the log-based replication fails to maintain a throughput threshold, one of the other replication methodologies may be initiated, at least temporarily.

    摘要翻译: 本发明的实施例涉及利用复制方法的组合在分布式计算环境中复制数据的系统,方法和计算机存储介质。 可以使用全对象复制来将对象的完整状态从主数据存储复制到辅助数据存储。 启动全对象复制后创建的检查点可能被解析,以识别自启动全对象复制以来已输入的对象的更改。 该复制过程被称为delta检查点复制方法。 另外,在一个实施例中,可以使用基于日志的复制方法。 基于日志的复制可以将数据从主数据存储的日志传送到辅助数据存储。 在示例性实施例中也可以想到,当基于日志的复制不能维持吞吐量阈值时,可以至少暂时启动其他复制方法之一。

    Load balancing when replicating account data
    9.
    发明授权
    Load balancing when replicating account data 有权
    复制帐户数据时的负载平衡

    公开(公告)号:US08886796B2

    公开(公告)日:2014-11-11

    申请号:US13113776

    申请日:2011-05-23

    IPC分类号: G06F15/173 G06F9/50 H04L29/08

    摘要: Embodiments of the present invention relate to invoking and managing load-balancing operation(s) applied to partitions within a distributed computing environment, where each partition represents a key range of data for a storage account. The partitions affected by the load-balancing operation(s) are source partitions hosted on a primary storage stamp and/or destination partitions hosted on a secondary storage stamp, where the primary and secondary storage stamps are located in geographically distinct areas and are equipped to replicate the storage account's data therebetween. The load-balancing operation(s) include splitting partitions into child partitions upon detecting an increased workload as a result of active replication, merging partitions to form parent partitions upon detecting a reduction in workload as a result of decreased processing-related resource consumption, or offloading partitions based on resource consumption. A service within a partition layer of the storage stamps is responsible for determining when to invoke these load-balancing operation(s).

    摘要翻译: 本发明的实施例涉及调用和管理应用于分布式计算环境中的分区的负载平衡操作,其中每个分区表示存储帐户的数据的关键范围。 受负载平衡操作影响的分区是托管在辅助存储邮票上托管的主存储戳记和/或目标分区上的源分区,其中主存储邮票和辅助存储邮票位于地理位置不同的区域,并且被配置为 复制存储帐户之间的数据。 负载平衡操作包括在检测到作为主动复制的结果的增加的工作负载时将分区划分成子分区,在检测到与处理相关的资源消耗减少的结果中检测到工作负载的减少时,将分区合并以形成父分区;或 基于资源消耗卸载分区。 存储标记的分区层内的服务负责确定何时调用这些负载平衡操作。