-
公开(公告)号:US20230022729A1
公开(公告)日:2023-01-26
申请号:US17937389
申请日:2022-09-30
Applicant: Amazon Technologies, Inc.
Inventor: Fan Ping , Andrew Boyer , Oleksandr Chychykalo , James Pinkerton , Danny Wei , Norbert Paul Kusters , Divya Ashok Kumar Jain , Jianhua Fan , Thomas Tarak Mathew Veppumthara , Sebastiano Peluso
Abstract: A block-based storage system hosts logical volumes that are implemented via multiple replicas of volume data stored on multiple resource hosts in different failure domains. Also, the block-based storage service allows multiple client computing devices to attach to a same given logical volume at the same time. In order to prevent unnecessary failovers, a primary node storing a primary replica is configured with a health check application programmatic interface (API) and a secondary node storing a secondary replica determines whether or not to initiate a failover based on the health of the primary replica.
-
公开(公告)号:US20210089212A1
公开(公告)日:2021-03-25
申请号:US16579687
申请日:2019-09-23
Applicant: Amazon Technologies, Inc.
Inventor: Kiran-Kumar Muniswamy-Reddy , Danny Wei , Norbert Paul Kusters , Romain Benoit Seguy , Rahul Upadhyaya , Divya Ashok Kumar Jain , William Zaharchuk , Wells Lin
Abstract: The present disclosure generally relates to creating virtualized block storage devices whose data is replicated across isolated computing systems to lower risk of data loss even in wide-scale events, such as natural disasters. The virtualized device can include at least two volumes, each of which is implemented in a distinct computing system. In the case of a failed volume, a new volume can be created and populated with data from the surviving volume. During population, new writes can continue to be replicated to the new volume. The population process can write data from the surviving volume to the new volume “under” new writes, such that the population process does not overwrite data included in the new writes.
-
公开(公告)号:US10521135B2
公开(公告)日:2019-12-31
申请号:US15433951
申请日:2017-02-15
Applicant: Amazon Technologies, Inc.
Inventor: Norbert Paul Kusters , Nachiappan Arumugam , Andre Podnozov , Shobha Agrawal , Shreyas Ramalingam , Danny Wei , David R. Richardson , Marc John Brooker , Christopher Nathan Watson , John Luther Guthrie, II , Ravi Nankani
Abstract: A data storage system includes a head node and mass storage devices. The head node is configured to flush data stored in a storage of the head node, based at least in part on one or more triggers being met, from the storage of the head node to a set of the mass storage devices of the data storage system. The flushed data is written to a segment of free storage space across the set of the mass storage devices allocated for the given data flush operation. In some embodiments, a head node may flush both current version data and point-in-time version data to the set of mass storage devices. Also, the data storage system maintains an index that indicates storage locations of data for particular portions of a volume before and after the data is flushed to the set of mass storage devices.
-
公开(公告)号:US10452680B1
公开(公告)日:2019-10-22
申请号:US14866659
申请日:2015-09-25
Applicant: Amazon Technologies, Inc.
Inventor: Jianhua Fan , Benjamin Arthur Hawks , Norbert Paul Kusters , Nachiappan Arumugam , Danny Wei , John Luther Guthrie, II
IPC: G06F16/27 , G06F16/955 , G06F16/22
Abstract: Master-slave pairs can be used to provide data redundancy in an electronic data environment. A master peer can include a B-tree with references to the corresponding data. When provisioning a slave, the master can send a point-in-time copy of the B-tree to the slave, which can allocate the necessary space on local storage and update the references of the B-tree to point to the local storage for the slave. If the master and slave become disconnected, one of the peers can function as a solo master until the peers are again connected, at which point the old peer can be brought current or a new slave provisioned. A log peer can also be provisioned by a solo master, which can store data for operations received during the disconnect for use in catching up a slave peer, which could be the old slave, the log peer, or a new peer.
-
公开(公告)号:US09864538B1
公开(公告)日:2018-01-09
申请号:US14750926
申请日:2015-06-25
Applicant: Amazon Technologies, Inc.
Inventor: Robert Michael Johnson , Mark Bradley Davis , Norbert Paul Kusters , Marc Stephen Olson , Marc John Brooker
CPC classification number: G06F3/0626 , G06F3/0605 , G06F3/0631 , G06F3/0665 , G06F3/0689 , G06F9/5077
Abstract: Server computers often include one or more input/output (I/O) devices for communicating with a network or directly attached storage device. Data transferred between these devices may include blocks of data with a common often repeated and identifiable data pattern. Transfer and storage of data of this nature may be optimized by transferring primarily blocks of data that are not of the pre-determined data pattern. An indicator may be transferred and stored with transferred data that has been reduced in size in this manner.
-
公开(公告)号:US20170364411A1
公开(公告)日:2017-12-21
申请号:US15694684
申请日:2017-09-01
Applicant: Amazon Technologies, Inc.
Inventor: Jianhua Fan , Benjamin Arthur Hawks , Norbert Paul Kusters , Nachiappan Arumugam , Danny Wei , John Luther Guthrie, II
CPC classification number: G06F11/1448 , G06F3/0605 , G06F3/0619 , G06F3/065 , G06F3/067 , G06F3/0689 , G06F11/1464 , G06F11/1471 , G06F2201/84
Abstract: The present disclosure provides persistent storage for a master copy using operation numbers. A master copy can include a B-tree with references to corresponding data. When provisioning a slave copy, the master copy sends a point-in-time copy of the B-tree to the slave copy, which stores a copy of the B-tree, allocates the necessary space, and updates the references of the B-tree to point to a local storage before the data is transferred. When writing the data to persistent storage, a snapshot created on the master copy is an operation that is replicated to the slave copy. The snapshot is generated using a volume view that includes changes to chunks of data of the master copy since a previous snapshot, as determined using the operation number for the previous snapshot. Data (and metadata) for the snapshot is written to persistent storage while new I/O operations are processed.
-
公开(公告)号:US09817786B1
公开(公告)日:2017-11-14
申请号:US14752294
申请日:2015-06-26
Applicant: Amazon Technologies, Inc.
Inventor: Asif Khan , Thomas A. Volpe , Marc John Brooker , Marc Stephen Olson , Norbert Paul Kusters , Mark Bradley Davis , Robert Michael Johnson
CPC classification number: G06F13/4221 , G06F11/1064 , G11C15/00
Abstract: Server computers often include one or more input/output (I/O) adapter devices for communicating with a network or directly attached storage device. The data transfer latency for request can be reduced by utilizing ingress data placement logic to bypass the processor of the I/O adapter device. For example, host memory descriptors can be stored in a content addressable memory unit of the I/O adapter device to facilitate placement of requested data.
-
公开(公告)号:US20170308406A1
公开(公告)日:2017-10-26
申请号:US15645909
申请日:2017-07-10
Applicant: Amazon Technologies, Inc.
Inventor: Norbert Paul Kusters , John Robert Smiley , Marc John Brooker , Bei-Jing Guo , Marc Levy
CPC classification number: G06F9/5011 , G06F3/061 , G06F3/0656 , G06F3/067 , G06F9/4881 , G06F9/50 , G06F2209/503 , G06F2209/504 , Y02D10/22
Abstract: Burst throttling methods may be used to manage computing resources of a data storage service. Tokens may represent I/O operations executed by a customer of the data storage service. A first token bucket may contain a set of tokens representing the overall I/O operation capacity of the data storage service. Additionally, a second token bucket may contain a set of tokens for a given logical volume maintained by the data storage service. When I/O requests are received tokens may be charged the first token bucket and the second token bucket. Furthermore, if there is sufficient capacity, the data storage service may charge a reduced number of tokens to the third token bucket.
-
公开(公告)号:US11941278B2
公开(公告)日:2024-03-26
申请号:US17520537
申请日:2021-11-05
Applicant: Amazon Technologies, Inc.
Inventor: Norbert Paul Kusters , Jianhua Fan , Shuvabrata Ganguly , Danny Wei , Avram Israel Blaszka
CPC classification number: G06F3/0644 , G06F3/0617 , G06F3/0631 , G06F3/065 , G06F3/067 , G06F11/1612 , G06F11/3034
Abstract: A data storage system includes multiple head nodes and data storage sleds. Volume data is replicated between a primary and one or more secondary head nodes for a volume partition and is further flushed to a set of mass storage devices of the data storage sleds. Volume metadata is maintained in a primary and one or more secondary head nodes for a volume partition and is updated in response to volume data being flushed to the data storage sleds. Also, the primary and secondary head nodes store check-points of volume metadata to the data storage sleds, wherein in response to a failure of a primary or secondary head node for a volume partition, a replacement secondary head node for the volume partition recreates a secondary replica for the volume partition based, at least in part, on a stored volume metadata checkpoint.
-
公开(公告)号:US11681443B1
公开(公告)日:2023-06-20
申请号:US17006502
申请日:2020-08-28
Applicant: Amazon Technologies, Inc.
Inventor: Sriram Venugopal , Kun Tang , Norbert Paul Kusters , Jianhua Fan
CPC classification number: G06F3/0619 , G06F3/0604 , G06F3/0641 , G06F3/0644 , G06F3/0652 , G06F3/0683 , G06F11/10 , G06F2201/84
Abstract: A data storage system includes a head node and mass storage devices. The head node is configured to store volume data and flush volume data to the mass storage devices. Additionally, the head node is configured to determine a quantity of data partitions and/or parity partitions to store for a chunk of volume data being flushed to the mass storage devices in order to satisfy a durability guarantee. For chunks of data for which complete copies are also stored in an additional data storage system, the head node is configured to reduce the quantity of data partitions and/or parity partitions stored such that required storage space is reduced while still ensuring that the durability guarantee is satisfied.
-
-
-
-
-
-
-
-
-