-
公开(公告)号:US10853373B1
公开(公告)日:2020-12-01
申请号:US15980596
申请日:2018-05-15
Applicant: Amazon Technologies, Inc.
Inventor: Sandeep Bhatia , Timothy Andrew Rath , Mustafa Ozan Ozen , Atilim Cetin , Gaurav Gupta
IPC: G06F16/2455 , G06F16/2458 , G06F16/22 , G06F16/21
Abstract: A data storage and retrieval system receives data points for a time series. The data storage and retrieval system stores the data points for a first portion of the time series using a first data format. Based at least in part on an analysis of queries performed on the first portion of the time series, the data storage and retrieval system determines to store a second portion of the time series using a second data format. The data storage and retrieval system stores subsequently received data points of the time series in the second data format.
-
公开(公告)号:US11263184B1
公开(公告)日:2022-03-01
申请号:US16116791
申请日:2018-08-29
Applicant: Amazon Technologies, Inc.
Inventor: Mustafa Ozan Ozen , Dumanshu Goyal , Lonnie J. Princehouse , Gaurav Saxena , Atilim Cetin , Gaurav Gupta , Sandeep Bhatia , Nilesh Shahdadpuri , Timothy A. Rath , Eric Coll , Nirmesh Khandelwal
IPC: G06F16/00 , G06F16/21 , G06F16/27 , G06F16/2458 , G06F16/248 , G06F16/901 , G06F16/2457
Abstract: Methods, systems, and computer-readable media for partition splitting in a distributed database are disclosed. A partition of data is split into a first sub-partition and a second sub-partition. A first portion of the data is assigned to the first sub-partition, and a second portion of the data is assigned to the second sub-partition. One or more elements of the first portion of the data from the partition and an additional one or more elements of the first portion of the data from the first sub-partition are stored into a first node. One or more elements of the second portion of the data from the partition and an additional one or more elements of the second portion of the data from the second sub-partition are stored into a second node. The partition is prevented from receiving new data after the partition is split into the first and second sub-partitions.
-
公开(公告)号:US11120052B1
公开(公告)日:2021-09-14
申请号:US16021707
申请日:2018-06-28
Applicant: Amazon Technologies, Inc.
Inventor: Mustafa Ozan Ozen , Sandeep Bhatia , Atilim Cetin , Lonnie J. Princehouse , Timothy Andrew Rath , Gaurav Saxena
IPC: G06F16/28 , G06F16/22 , G06F16/2455
Abstract: Techniques are described for clustering data at the point of ingestion for storage using scalable storage resources. To cluster data at the point of ingestion, a data ingestion and query service uses a multilevel hash tree (MLHT)-based index to map a hierarchy of attribute values associated with each data element onto a point of a MLHT (which itself conceptually maps onto a continuous range of values). The total range of the MLHT is divided into one or more data partitions, each of which is mapped to one or more physical storage resources. A mapping algorithm uses the hierarchy of attribute fields to calculate the position of each data element ingested and, consequently, a physical storage resource at which to store the data element.
-
-