System and method for performing live partitioning in a data store
    2.
    发明授权
    System and method for performing live partitioning in a data store 有权
    在数据存储中执行实时分区的系统和方法

    公开(公告)号:US09052831B1

    公开(公告)日:2015-06-09

    申请号:US13174295

    申请日:2011-06-30

    IPC分类号: G06F12/06 G06F3/06

    摘要: A system that implements a scalable data storage service may maintain tables in a data store on behalf of storage service clients. The service may maintain table data in multiple replicas of partitions that are stored on respective computing nodes in the system. In response to detecting an anomaly in the system, detecting a change in data volume on a partition or service request traffic directed to a partition, or receiving a service request from a client to split a partition, the data storage service may create additional copies of a partition replica using a physical copy mechanism. The data storage service may issue a split command defined in an API for the data store to divide the original and additional replicas into multiple replica groups, and to configure each replica group to maintain a respective portion of the table data that was stored in the partition before the split.

    摘要翻译: 实现可扩展数据存储服务的系统可以代表存储服务客户端来维护数据存储中的表。 该服务可以在存储在系统中的相应计算节点上的分区的多个副本中维护表数据。 响应于检测到系统中的异常,检测指向分区的分区或服务请求流量上的数据量的变化,或者从客户端接收分割分区的服务请求,数据存储服务可以创建附加副本 使用物理复制机制的分区副本。 数据存储服务可以发布在数据存储的API中定义的分割命令,以将原始和附加副本划分为多个副本组,并配置每个副本组以维护存储在分区中的表数据的相应部分 分裂之前。

    System and method for performing replica copying using a physical copy mechanism
    3.
    发明授权
    System and method for performing replica copying using a physical copy mechanism 有权
    使用物理复制机制执行副本复制的系统和方法

    公开(公告)号:US08732517B1

    公开(公告)日:2014-05-20

    申请号:US13174275

    申请日:2011-06-30

    IPC分类号: G06F11/00

    摘要: A system that implements a data storage service may maintain tables in a data store on behalf of clients. The service may maintain table data in multiple replicas of partitions of the data that are stored on respective computing nodes in the system. In response to detecting a failure or fault condition, or receiving a service request from a client to move or copy a partition replica, the data store may copy a partition replica to another computing node using a physical copy mechanism. The physical copy mechanism may copy table data from physical storage locations in which it is stored to physical storage locations allocated to a destination replica on the other computing node. During copying, service requests to modify table data may be logged and applied to the replica being copied. A catch-up operation may be performed to apply modification requests received during copying to the destination replica.

    摘要翻译: 实现数据存储服务的系统可以代表客户端在数据存储中维护表。 该服务可以在存储在系统中的相应计算节点上的数据的分区的多个副本中维护表数据。 响应于检测到故障或故障状况,或者从客户端接收到移动或复制分区副本的服务请求,数据存储可以使用物理复制机制将分区副本复制到另一个计算节点。 物理复制机制可以将表数据从其存储的物理存储位置复制到分配给另一个计算节点上的目标副本的物理存储位置。 在复制期间,可能会记录修改表数据的服务请求并将其应用于复制副本。 可以执行追赶操作以将复制期间接收的修改请求应用于目的地复本。

    System and method for adjusting membership of a data replication group
    7.
    发明授权
    System and method for adjusting membership of a data replication group 有权
    用于调整数据复制组成员资格的系统和方法

    公开(公告)号:US09069827B1

    公开(公告)日:2015-06-30

    申请号:US13352060

    申请日:2012-01-17

    IPC分类号: G06F17/30

    摘要: A system that implements a data storage service may store data on behalf of storage service clients. The system may maintain data in multiple replicas of partitions that are stored on respective computing nodes in the system. A master replica for a replica group may increment a membership version indicator for the group, and may propagate metadata (including the membership version indicator) indicating a membership change for the group to other members of the group. Propagating the metadata may include sending a log record containing the metadata to the other replicas to be appended to their respective logs. Once the membership change becomes durable, it may be committed. A replica attempting to become the master of a replica group may determine that another replica in the group has observed a more recent membership version, in which case logs may be synchronized or snipped, or the attempt may be abandoned.

    摘要翻译: 实现数据存储服务的系统可以代表存储服务客户端存储数据。 系统可以在存储在系统中的相应计算节点上的分区的多个副本中维护数据。 副本组的主副本可以增加组的成员版本指示符,并且可以向组的其他成员传播指示组的成员变化的元数据(包括成员版本指示符)。 传播元数据可以包括将包含元数据的日志记录发送到要附加到其各自日志的其他副本。 一旦成员变更变得持久,可能会被提交。 尝试成为副本组的主人的副本可能会确定组中的另一个副本已经观察到更新的成员身份版本,在这种情况下,日志可以被同步或剪切,或尝试可能被放弃。

    System and method for data replication using a single master failover protocol
    8.
    发明授权
    System and method for data replication using a single master failover protocol 有权
    使用单个主故障切换协议进行数据复制的系统和方法

    公开(公告)号:US09116862B1

    公开(公告)日:2015-08-25

    申请号:US13352326

    申请日:2012-01-17

    IPC分类号: G06F11/00 G06F11/20

    摘要: A system that implements a data storage service may store data on behalf of storage service clients. The system may maintain data in multiple replicas of various partitions that are stored on respective computing nodes in the system. The system may employ a single master failover protocol, usable when a replica attempts to become the master replica for a replica group of which it is a member. Attempting to become the master replica may include acquiring a lock associated with the replica group, and gathering state information from the other replicas in the group. The state information may indicate whether another replica supports the attempt (in which case it is included in a failover quorum) or stores more recent data or metadata than the replica attempting to become the master (in which case synchronization may be required). If the failover quorum includes enough replicas, the replica may become the master.

    摘要翻译: 实现数据存储服务的系统可以代表存储服务客户端存储数据。 系统可以将数据保存在存储在系统中各个计算节点上的各种分区的多个副本中。 系统可以使用单个主故障切换协议,当副本尝试成为其所属的副本组的主副本时可用。 尝试成为主副本可能包括获取与副本组相关联的锁,以及从组中的其他副本收集状态信息。 状态信息可以指示另一个副本是否支持尝试(在这种情况下它被包括在故障切换仲裁中),或存储比尝试成为主机的副本更多的最新数据或元数据(在这种情况下可能需要同步)。 如果故障转移仲裁包含足够的副本,副本可能会成为主服务器。

    System and method for splitting a replicated data partition
    9.
    发明授权
    System and method for splitting a replicated data partition 有权
    用于拆分复制数据分区的系统和方法

    公开(公告)号:US08930312B1

    公开(公告)日:2015-01-06

    申请号:US13352075

    申请日:2012-01-17

    IPC分类号: G06F17/30

    摘要: A system that implements a data storage service may store data on behalf of storage service clients. The system may maintain data in multiple replicas of partitions that are stored on respective computing nodes in the system. The system may split a data partition into two new partitions, and may split the replica group that stored the original partitions into two new replica groups, each storing one of the new partitions. To split the replica group, the master replica may propagate membership changes to the other members of the replica group for adding members to the original replica group and for splitting the expanded replica group into two new replica groups. Subsequent to the split, replicas may attempt to become the master for the original replica group or for a new replica group. If an attempt to become master replica for the original replica group succeeds, the split may fail.

    摘要翻译: 实现数据存储服务的系统可以代表存储服务客户端存储数据。 系统可以在存储在系统中的相应计算节点上的分区的多个副本中维护数据。 系统可以将数据分区拆分成两个新的分区,并且可以将存储原始分区的副本组拆分成两个新的副本组,每个副本组存储一个新的分区。 要拆分副本组,主副本可能将成员身份更改传播到副本组的其他成员,以将成员添加到原始副本组,并将扩展副本组分成两个新的副本组。 分割后,副本可能会尝试成为原始副本组或新副本组的主节点。 如果尝试成为原始副本组的主副本成功,则拆分可能会失败。

    System and method for throttling service requests having non-uniform workloads
    10.
    发明授权
    System and method for throttling service requests having non-uniform workloads 有权
    用于限制具有不均匀工作负载的服务请求的系统和方法

    公开(公告)号:US08914497B1

    公开(公告)日:2014-12-16

    申请号:US13327620

    申请日:2011-12-15

    IPC分类号: G06F15/173 G06F11/00

    摘要: A system that provides services to clients may receive and service requests, various ones of which may require different amounts of work. The system may determine whether it is operating in an overloaded or underloaded state based on a current work throughput rate, a target work throughput rate, a maximum request rate, or an actual request rate, and may dynamically adjust the maximum request rate in response. For example, if the maximum request rate is being exceeded, the maximum request rate may be raised or lowered, dependent on the current work throughput rate. If the target or committed work throughput rate is being exceeded, but the maximum request rate is not being exceeded, a lower maximum request rate may be proposed. Adjustments to the maximum request rate may be made using multiple incremental adjustments. Service request tokens may be added to a leaky token bucket at the maximum request rate.

    摘要翻译: 向客户端提供服务的系统可以接收和服务请求,其中各种可能需要不同的工作量。 系统可以基于当前工作吞吐率,目标工作吞吐率,最大请求速率或实际请求速率来确定其是否处于过载或欠载状态,并且可以动态地调整响应的最大请求速率。 例如,如果超过最大请求速率,则可以根据当前的工作吞吐率来提高或降低最大请求速率。 如果超过了目标或提交的工作吞吐率,但不超过最大请求率,则可能会提出较低的最大请求率。 可以使用多个增量调整来调整最大请求率。 可以以最大请求率将服务请求令牌添加到泄漏令牌桶中。