-
公开(公告)号:US11550787B1
公开(公告)日:2023-01-10
申请号:US17118307
申请日:2020-12-10
Applicant: Amazon Technologies, Inc.
Inventor: Andre Hernich , Vuk Ercegovac , Gaurav Saxena , Panagiotis Parchas , Yannis Papakonstantinou , Balakrishnan Narayanaswamy , Enrico Siragusa
IPC: G06F16/2453 , G06F16/2455
Abstract: Match rules for rewriting queries to use materialized views may be dynamically generated by a database system. A database system may generate rules that indicate whether a given query can use a materialized view and how to rewrite the given query to use the materialized view. A query may be received and the rules may be applied to the query to determine that the query can use the materialized view and to rewrite the query to use the materialized view. The rewritten query can then be executed.
-
公开(公告)号:US20220237184A1
公开(公告)日:2022-07-28
申请号:US17722222
申请日:2022-04-15
Applicant: Amazon Technologies, Inc.
Inventor: Gaurav Saxena , Ippokratis Pandis , Mengchu Cai , Mingda Li , Mohammad Rezaur Rahman , Naresh Chainani
IPC: G06F16/2458 , G06F16/2455 , G06F16/28
Abstract: A primary query engine may receive a query and determine whether the query is eligible for performance at a secondary query engine. If eligible, the primary query engine may evaluate the availability of the first query engine to perform the query. The first query engine may determine whether to assign the query to the primary query engine or to the secondary query according to availability evaluation. For queries assigned to the secondary query engine, the primary query engine may send a request to the secondary query engine to being processing of the query.
-
公开(公告)号:US11308100B2
公开(公告)日:2022-04-19
申请号:US16452385
申请日:2019-06-25
Applicant: Amazon Technologies, Inc.
Inventor: Gaurav Saxena , Ippokratis Pandis , Mengchu Cai , Mingda Li , Mohammad Rezaur Rahman , Naresh Chainani
IPC: G06F16/245 , G06F16/2458 , G06F16/2455 , G06F16/28 , G06F9/38
Abstract: A primary query engine may receive a query and determine whether the query is eligible for performance at a secondary query engine. If eligible, the primary query engine may evaluate the availability of the first query engine to perform the query. The first query engine may determine whether to assign the query to the primary query engine or to the secondary query according to availability evaluation. For queries assigned to the secondary query engine, the primary query engine may send a request to the secondary query engine to being processing of the query.
-
公开(公告)号:US11263184B1
公开(公告)日:2022-03-01
申请号:US16116791
申请日:2018-08-29
Applicant: Amazon Technologies, Inc.
Inventor: Mustafa Ozan Ozen , Dumanshu Goyal , Lonnie J. Princehouse , Gaurav Saxena , Atilim Cetin , Gaurav Gupta , Sandeep Bhatia , Nilesh Shahdadpuri , Timothy A. Rath , Eric Coll , Nirmesh Khandelwal
IPC: G06F16/00 , G06F16/21 , G06F16/27 , G06F16/2458 , G06F16/248 , G06F16/901 , G06F16/2457
Abstract: Methods, systems, and computer-readable media for partition splitting in a distributed database are disclosed. A partition of data is split into a first sub-partition and a second sub-partition. A first portion of the data is assigned to the first sub-partition, and a second portion of the data is assigned to the second sub-partition. One or more elements of the first portion of the data from the partition and an additional one or more elements of the first portion of the data from the first sub-partition are stored into a first node. One or more elements of the second portion of the data from the partition and an additional one or more elements of the second portion of the data from the second sub-partition are stored into a second node. The partition is prevented from receiving new data after the partition is split into the first and second sub-partitions.
-
公开(公告)号:US11120052B1
公开(公告)日:2021-09-14
申请号:US16021707
申请日:2018-06-28
Applicant: Amazon Technologies, Inc.
Inventor: Mustafa Ozan Ozen , Sandeep Bhatia , Atilim Cetin , Lonnie J. Princehouse , Timothy Andrew Rath , Gaurav Saxena
IPC: G06F16/28 , G06F16/22 , G06F16/2455
Abstract: Techniques are described for clustering data at the point of ingestion for storage using scalable storage resources. To cluster data at the point of ingestion, a data ingestion and query service uses a multilevel hash tree (MLHT)-based index to map a hierarchy of attribute values associated with each data element onto a point of a MLHT (which itself conceptually maps onto a continuous range of values). The total range of the MLHT is divided into one or more data partitions, each of which is mapped to one or more physical storage resources. A mapping algorithm uses the hierarchy of attribute fields to calculate the position of each data element ingested and, consequently, a physical storage resource at which to store the data element.
-
公开(公告)号:US20200167361A1
公开(公告)日:2020-05-28
申请号:US16199102
申请日:2018-11-23
Applicant: Amazon Technologies, Inc.
Inventor: Lonnie J. Princehouse , Timothy A. Rath , Gaurav Gupta , Mustafa Ozan Ozen , Omer Ahmed Zaki , Karthik Gurumoorthy Subramanya Bharathy , Gaurav Saxena
IPC: G06F16/2455 , G06F16/27 , G06F3/06 , G06F16/248
Abstract: Methods, systems, and computer-readable media for continuous functions in a time-series database are disclosed. A plurality of data points of a time series are stored into one or more storage tiers of a time-series database. The plurality of data points comprise a plurality of discrete measurements at respective timestamps. Using one or more query processors of the time-series database, a query of the time series is initiated. The query indicates a time range. Using the one or more query processors, a continuous function is determined that represents a segment of the time series in the time range. The continuous function is determined based at least in part on the plurality of data points. An operation is performed using the continuous function as an input.
-
公开(公告)号:US12248473B1
公开(公告)日:2025-03-11
申请号:US18540496
申请日:2023-12-14
Applicant: Amazon Technologies, Inc.
Inventor: Zhengchun Liu , Gaurav Saxena , Balakrishnan Narayanaswamy , Kaihui Zheng , Mohammad Rezaur Rahman , Tim Kraska
IPC: G06F16/2453
Abstract: A future workload may be predicted for a database system using analysis of queries submitted for execution. A feature vector for a query may be determined according to a query plan for the query. If the feature vector has not been previously seen, or has not been sufficiently seen, by the database system, a machine learning inference may be used to predict performance characteristics of the query, the machine learning system trained using previous feature vectors and performance characteristics of executed queries. If the feature vector has been sufficiently seen previously by the database system, a history of performance characteristics of previous queries with similar or the same feature vector may be used to predict the performance characteristics. The predictions may then be used to configure or reconfigure processing cluster(s) of the database system.
-
28.
公开(公告)号:US20240004897A1
公开(公告)日:2024-01-04
申请号:US17810314
申请日:2022-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Ippokratis Pandis , Gokul Soundararajan , Gopal Paliwal , Punit Rajgaria , Sanuj Basu , Todd Jeffrey Green , Gaurav Saxena , Vadim Skipin , Johannes Wust , Hemanth Satyanarayana , Matthew Perry Abrams , Murali Brahmadesam
CPC classification number: G06F16/27 , G06F16/25 , G06F16/283 , G06F16/2379
Abstract: Methods for replicating transactional tables of a transactional database to an analytical database and maintaining updates to those transactional table representations are disclosed. Snapshots of the transactional tables are provided to the analytical database via a transport mechanism, such as a data storage service or a data streaming service, and stored at the analytical database. Then, checkpoints comprising portions of a change-data-capture log that has recorded transactional changes to the transactional tables of the transactional database are provided to the analytical database via the same or different transport mechanism and used to commit those transactional changes to the snapshot representations. The snapshot representations may be used to respond to incoming analytical queries in order to provide real-time querying results.
-
公开(公告)号:US11841848B2
公开(公告)日:2023-12-12
申请号:US17805676
申请日:2022-06-06
Applicant: Amazon Technologies, Inc.
Inventor: Yannis Papakonstantinou , Vuk Ercegovac , Andre Hernich , Enrico Siragusa , Gaurav Saxena
IPC: G06F16/23 , G06F16/22 , G06F16/242
CPC classification number: G06F16/2393 , G06F16/2282 , G06F16/2445
Abstract: Stored procedures are generated to perform incremental updates to a materialized view for a database. When a request to create a materialized view is received, one or more internal tables are created from the database. A stored procedure is generated that when executed will update the materialized view of the database. The stored procedure may obtain changes to the database that were not included in the internal tables and update the internal tables to include the obtained changes. The stored procedure may be performed automatically and in response to requests.
-
公开(公告)号:US11762860B1
公开(公告)日:2023-09-19
申请号:US17118288
申请日:2020-12-10
Applicant: Amazon Technologies, Inc.
Inventor: Mohammad Rezaur Rahman , George Constantin Caragea , Raj Narayan Sett , Gaurav Saxena , Naresh Chainani , Chunbin Lin
IPC: G06F16/2455 , G06F16/2453 , G06N20/00 , G06F16/23 , G06F18/214 , G06F11/34
CPC classification number: G06F16/24568 , G06F11/3409 , G06F16/2308 , G06F16/24539 , G06F16/24542 , G06F18/214 , G06N20/00
Abstract: Database systems may dynamically management concurrency levels for performing queries. A query may be received at a database system and a memory usage for the query may be predicted. A determination may be made as to whether available memory is enough to satisfy the predicted memory usage for the query. If the available memory is enough to satisfy the predicted memory usage for the query, then an increase in a concurrency level for performing queries at the database system may be made. The query may be allowed to execute concurrently with other queries according to the increased concurrency level.
-
-
-
-
-
-
-
-
-