-
公开(公告)号:US20200341657A1
公开(公告)日:2020-10-29
申请号:US16926519
申请日:2020-07-10
Applicant: Amazon Technologies, Inc.
Inventor: Stefano Stefani , Timothy Andrew Rath , Chiranjeeb Buragahain , Yan Valerie Leshinsky , David Alan Lutz , Jakub Kulesza , Wei Xiao , Jai Vasanth
Abstract: A system that implements a scalable data storage service may maintain tables in a data store on behalf of storage service clients. The service may maintain table data in multiple replicas of partitions that are stored on respective computing nodes in the system. In response to detecting an anomaly in the system, detecting a change in data volume on a partition or service request traffic directed to a partition, or receiving a service request from a client to split a partition, the data storage service may create additional copies of a partition replica using a physical copy mechanism. The data storage service may issue a split command defined in an API for the data store to divide the original and additional replicas into multiple replica groups, and to configure each replica group to maintain a respective portion of the table data that was stored in the partition before the split.
-
公开(公告)号:US10776212B2
公开(公告)日:2020-09-15
申请号:US15495827
申请日:2017-04-24
Applicant: Amazon Technologies, Inc.
Inventor: Maximliano Maccanti , Timothy Andrew Rath , Rama Krishna Sandeep Pokkunuri , Akshat Vig , Clarence Wing Yin Ng , Srivaths Badrinath Copparam , Rajaprabhu Thiruchi Loganathan , Wei Xiao , William Alexander Stevenson
Abstract: A system that implements a data storage service may store data for a database table in multiple replicated partitions on respective storage nodes. In response to a request to back up a table, the service may back up individual partitions of the table to a remote storage system independently and (in some cases) in parallel, and may update (or create) and store metadata about the table and its partitions on storage nodes of the data storage service and/or in the remote storage system. Backing up each partition may include exporting it from the database in which the table is stored, packaging and compressing the exported partition for upload, and uploading the exported, packaged, and compressed partition to the remote storage system. The remote storage system may be a key-value durable storage system in which each backed-up partition is accessible using its partition identifier as the key.
-
93.
公开(公告)号:US20200175035A1
公开(公告)日:2020-06-04
申请号:US16698481
申请日:2019-11-27
Applicant: Amazon Technologies, Inc.
Inventor: Timothy Andrew Rath , David Alan Lutz
IPC: G06F16/27 , G06F16/182
Abstract: A system that implements a data storage service may store data on behalf of clients in multiple replicas on respective computing nodes. The system may employ an external service to select a master replica for a replica group. The master replica may service consistent read operations and/or write operations that are directed to the replica group (or to a data partition stored by the replica group). The master replica may employ a quorum based mechanism for performing replicated write operations, and a local lease mechanism for determining the replica authorized to perform consistent reads, even when the external service is unavailable. The master replica may propagate local leases to replica group members as replicated writes. If another replica assumes mastership for the replica group, it may not begin servicing consistent read operations that are directed to the replica group until the lease period for a current local lease expires.
-
公开(公告)号:US20200159745A1
公开(公告)日:2020-05-21
申请号:US16684901
申请日:2019-11-15
Applicant: Amazon Technologies, Inc.
Inventor: Akshat Vig , Timothy Andrew Rath , Stuart Henry Seelye Marshall , Rande A. Blackman , David Alan Lutz , Jian Wang , Jiandan Zheng , Janani Narayanan
IPC: G06F16/27 , G06F16/2455 , G06F16/2458 , G06F16/90 , G06F16/28
Abstract: Methods and apparatus for conditional master election in a distributed database are described. A plurality of replicas of a database object are stored by a distributed database service. Some types of operations corresponding to client requests directed at the database object are to be coordinated by a master replica. Client access to the database object is enabled prior to election of a master replica. In response to a triggering condition, a particular replica is elected master. The master coordinates implementation of operations with one or more other replicas in response to client requests.
-
公开(公告)号:US10461991B1
公开(公告)日:2019-10-29
申请号:US15269156
申请日:2016-09-19
Applicant: Amazon Technologies, Inc.
Abstract: A distributed database system may comprise a master computing node and one or more replication peer nodes. Upon identifying a failure of one of the nodes, a control plane may obtain a computing node instance and configure the computing node instance to act as a log-only peer to the remaining nodes. The log-only peer may store the log tail of the master computing node and may participate in quorum decisions. Upon restoration of the failed node, the log-only peer may be returned to the pool of computing node instances.
-
公开(公告)号:US10303795B2
公开(公告)日:2019-05-28
申请号:US14482668
申请日:2014-09-10
Applicant: Amazon Technologies, Inc.
Inventor: Allan Henry Vermeulen , Kiran-Kumar Muniswamy-Reddy , Timothy Daniel Cole , Timothy Andrew Rath
Abstract: In response to a read request directed to a first data store of a storage group, a state transition indicator is identified, corresponding to a modification that has been applied at the data store before a response to the read is prepared. A read descriptor that includes the state transition indicator and read repeatability verification metadata is prepared. The metadata can be used to check whether the read request is a repeatable read. The read descriptor is transmitted to a client-side component of the storage group.
-
公开(公告)号:US10248704B2
公开(公告)日:2019-04-02
申请号:US14270323
申请日:2014-05-05
Applicant: Amazon Technologies, Inc.
Inventor: Timothy Andrew Rath
IPC: G06F17/30
Abstract: A system that implements a data storage service may store data on behalf of storage service clients. The system may maintain data in multiple replicas that are stored on respective computing nodes in the system. Updates to the stored data and to the membership of replica groups are propagated as replicated log records. A replica receiving a log record may compare metadata in the received log record to corresponding metadata in a log record that was previously appended to its log to determine a response. The metadata may include a sequence number, a lock generation identifier, an epoch identifier, or an indication of an epoch change. The replica may append the received log record to its log, drop the received log record, or cache the received log record for future use. If a log conflict indicates an invalid log stream branch, one or more log records may be deleted.
-
公开(公告)号:US20180322017A1
公开(公告)日:2018-11-08
申请号:US16035465
申请日:2018-07-13
Applicant: Amazon Technologies, Inc.
Inventor: Maximiliano Maccanti , Timothy Andrew Rath , Rama Krishna Sandeep Pokkunuri , Akshat Vig , Clarence Wing Yin NG , Srivaths Badrinath Copparam , Rajaprabhu Thiruchi Loganathan , Wei Xiao , William Alexander Stevenson
IPC: G06F11/14
CPC classification number: G06F11/1458
Abstract: A system that implements a data storage service may store data for database tables in multiple replicated partitions on respective storage nodes. In response to a request to restore a given table that was backed up in a remote storage system (e.g., key-value durable storage system), the service may create a new table, and may import a copy of each of the partitions of the given table from the remote storage system into the new table. The request to restore the table may specify a modified value for a configuration parameter for the table or for one of its partitions. The service may apply the new configuration parameter value to the table or its partitions during the restore operation. The new configuration parameter value may indicate an increase or decrease in storage capacity or throughput capacity, and its application may automatically trigger a partition split or move operation.
-
公开(公告)号:US10120905B2
公开(公告)日:2018-11-06
申请号:US14578841
申请日:2014-12-22
Applicant: Amazon Technologies, Inc.
Abstract: A probabilistic counting structure such as a hyperloglog may be formed during a table scan for each of a selected set of columns. The columns may be selected based on an initial estimate of relatedness, which may be based on data types of the respective columns. An estimated cardinality of an intersection or union of columns may be formed based on an intersection of the probabilistic data structures. A join path may be determined based on the estimated cardinality of an intersection or union of the columns.
-
公开(公告)号:US09824133B1
公开(公告)日:2017-11-21
申请号:US14494473
申请日:2014-09-23
Applicant: Amazon Technologies, Inc.
Inventor: Santosh Kalki , Srinivasan Sundar Raghavan , Timothy Andrew Rath , Mukul Vijay Karnik , Amol Devgan , Swaminathan Sivasubramanian
CPC classification number: G06F17/30592
Abstract: A multi-tenant system for providing hosted analytic services may be dynamically configured in response to a request from a user. A request for analytic services may comprise an indication of at least one data source to be incorporated into an n-dimensional cube. A data source connector and transformation pipeline may transform data received from the data source to a format compatible with a dimension and hierarchy model of the n-dimensional cube.
-
-
-
-
-
-
-
-
-