-
公开(公告)号:US10769148B1
公开(公告)日:2020-09-08
申请号:US15714834
申请日:2017-09-25
Applicant: Amazon Technologies, Inc.
Inventor: Nathan Binkert , Mengchu Cai , Martin Grund , Maor Kleider , Michail Petropoulos , Ippokratis Pandis
IPC: G06F16/00 , G06F16/2453 , G06F16/2455
Abstract: Relocating data sharing operations for query processing may be implemented when generating plans to perform a query. A query operation that causes a node to share data obtained from a different set of nodes with other nodes in a same set of nodes may be identified. The identified query operation may be replaced with another operation that causes the different set of nodes to share the data directly with the other nodes in the same set of nodes.
-
公开(公告)号:US11567972B1
公开(公告)日:2023-01-31
申请号:US15199486
申请日:2016-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Anurag Windlass Gupta , Andrew Edward Caldwell , Stavros Harizopoulos , Michail Petropoulos , Ramakrishna Kotla , John Benjamin Tobler
Abstract: A tree-based format may be implemented for data stored in a data store. A table may be maintained across one or multiple storage nodes in storage slabs. Storage slabs may be mapped to different nodes of a tree. Each node of the tree may be assigned a different range of distribution scheme values which identify what portions of the table are stored in the storage slab. Storage slabs mapped to child nodes in the tree may be assigned portions of the range of distribution scheme values assigned to a parent. Storage nodes may be added or removed for storing the table. Storage slabs may be moved from one storage node to another in order to accommodate the addition or removal of storage nodes.
-
公开(公告)号:US10095738B1
公开(公告)日:2018-10-09
申请号:US14978842
申请日:2015-12-22
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Edward Caldwell , Michail Petropoulos , Anurag Windlass Gupta
Abstract: A data store may implement dynamically assign storage locations to logical partitions according to query predicate indexes. Query predicate indexes may be maintained for individual storage locations in a data store that indicate whether storage locations store data values that do not satisfy a query predicate. The query predicate indexes may be evaluated to assign the storage locations to one or multiple logical partitions. The logical partitions may then be utilized to provide access to data stored in the data store. In some embodiments, a logical partition may be a remote storage partition that identifies data in storage locations to copy to a remote data store.
-
公开(公告)号:US20180285418A1
公开(公告)日:2018-10-04
申请号:US15476273
申请日:2017-03-31
Applicant: Amazon Technologies, Inc.
Inventor: Michail Petropoulos , Ippokratis Pandis
IPC: G06F17/30
Abstract: Queries received at a query engine may be executed for structured data and not-structured data. A query execution plan may be generated for the query that includes stateless operations to apply the query to the not-structured data at remote query processing engines. The remote query processing engines may perform the stateless operations and return results to the query engine. The query engine may generate a result for the query based on the results received from the remote query engine as well as results determined as part of applying the query to structured data. The result to the query may be returned.
-
公开(公告)号:US11860869B1
公开(公告)日:2024-01-02
申请号:US16457154
申请日:2019-06-28
Applicant: Amazon Technologies, Inc.
Inventor: Sangyong Hwang , Justin Levandoski , Yongsik Yoon , Kamal Kant Gupta , Saileshwar Krishnamurthy , Ippokratis Pandis , Michail Petropoulos
IPC: G06F16/2453 , G06F16/23 , G06F16/25
CPC classification number: G06F16/24542 , G06F16/2379 , G06F16/254
Abstract: Queries to a consistent view of a data set across different types of query engines are performed. Portions of a query received at a first type of query engine that can be performed at a different type of a query engine can be identified. A consistent view of a data set stored in a common data store can be provided by the first type of query engine to the second query engine. When the different type of query engine performs the portion of the query, the different type of query engine applies the consistent view of the data set. A result for the query can be returned by the first type of query engine based on results from the different type of query engine.
-
公开(公告)号:US11500931B1
公开(公告)日:2022-11-15
申请号:US15996224
申请日:2018-06-01
Applicant: Amazon Technologies, Inc.
Inventor: Panagiotis Parchas , Christos Faloutsos , Anurag Windlass Gupta , Stavros Harizopoulos , Michail Petropoulos
IPC: G06F16/90 , G06F16/901 , G06F16/2455 , G06F16/2453
Abstract: Using a graph representation of join history may be performed to distribute database data. Join history may be collected, captured, or tracked which describes the history of join operations between columns of different tables in a database. A graph representation of the join history may be generated. The graph representation may indicate a likelihood of different joins that may be performed between the columns of the tables of a database. An evaluation of the join history may be performed to identify columns for tables in the database to distribute the data of the tables amongst multiple storage locations according to the identified columns.
-
公开(公告)号:US20200050694A1
公开(公告)日:2020-02-13
申请号:US16102545
申请日:2018-08-13
Applicant: Amazon Technologies, Inc.
Inventor: Bhaven Avalani , Andrew Edward Caldwell , Naresh Chainani , Martin Grund , Anurag Windlass Gupta , Frederick Ryan Johnson , Ippokratis Pandis , Michail Petropoulos , Srividhya Srinivasan
IPC: G06F17/30
Abstract: Burst performance of a database query may be determined according to a size of the database query. A query to a database may be received. A size may be determined for the query. If the size is less than a size threshold assigned to a first query engine, then the query may be performed at the first query engine. If the size is greater than or equal to the size threshold assigned to the first query engine, then the query may be performed at a second query engine.
-
公开(公告)号:US10318346B1
公开(公告)日:2019-06-11
申请号:US15274813
申请日:2016-09-23
Applicant: Amazon Technologies, Inc.
Inventor: Stavros Harizopoulos , Michail Petropoulos , Andrea Olgiati
Abstract: Data stores may implement prioritized scheduling of data store access requests. When new access requests are received, the new access requests may be scheduled for prioritized execution on processing resources. Access requests that are currently being executed with prioritized execution may be reprioritized to make additional capacity for prioritized execution of the new access requests. Prioritized execution may be automatically enabled or disabled for a data store based on monitoring of performance metrics for executing access requests.
-
公开(公告)号:US12013856B2
公开(公告)日:2024-06-18
申请号:US16102545
申请日:2018-08-13
Applicant: Amazon Technologies, Inc.
Inventor: Bhaven Avalani , Andrew Edward Caldwell , Naresh Chainani , Martin Grund , Anurag Windlass Gupta , Frederick Ryan Johnson , Ippokratis Pandis , Michail Petropoulos , Srividhya Srinivasan
IPC: G06F16/24 , G06F16/2453 , G06F16/2455 , G06F16/28
CPC classification number: G06F16/2455 , G06F16/2453 , G06F16/283 , G06F16/285
Abstract: Burst performance of a database query may be determined according to a size of the database query. A query to a database may be received. A size may be determined for the query. If the size is less than a size threshold assigned to a first query engine, then the query may be performed at the first query engine. If the size is greater than or equal to the size threshold assigned to the first query engine, then the query may be performed at a second query engine.
-
公开(公告)号:US11321330B1
公开(公告)日:2022-05-03
申请号:US16140420
申请日:2018-09-24
Applicant: Amazon Technologies, Inc.
Inventor: Ippokratis Pandis , Panagiotis Parchas , Michail Petropoulos , Fabian Oliver Nagel , Yannis Papakonstantinou
IPC: G06F16/00 , G06F16/2458 , G06F16/28 , G06F16/22 , G06F16/2453
Abstract: A query that references nested data may have sub-expressions included in a subquery for nested data to perform the query. A first query engine may rewrite the query to include one or more sub-expressions that reference the nested data into a subquery for the nested data as part of generating a plan to perform the query. A second query engine may then be caused to perform the subquery for the nested data. A result of the query may be inclusive of the performance of the subquery at the second query engine.
-
-
-
-
-
-
-
-
-