-
1.
公开(公告)号:US10114846B1
公开(公告)日:2018-10-30
申请号:US15192945
申请日:2016-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Mehul Shah , Jakub Kulesza , James Thomas Kiraly , Benjamin Albert Sowell , Anurag Windlass Gupta
IPC: G06F17/30
Abstract: A balanced distribution of sort order values may be implemented for a multi-column sort order of a database table. Columns of the database table to be included in the multi-column sort order may be identified. Some columns containing string data values may be converted to equally-sized integer data values. The data values of columns may be evaluated to determine buckets representing the ranges of data values within the columns for depth-balanced histograms of the columns. Multi-column sort order values may be generated for individual entries in the database table according to bucket values assigned to the buckets that include the columns values of the individual entries. The entries of the database table may then be stored according to a sorted ordering of multi-column sort order values for the entries.
-
公开(公告)号:US11442931B2
公开(公告)日:2022-09-13
申请号:US16585680
申请日:2019-09-27
Applicant: Amazon Technologies, Inc.
Inventor: Anthony A. Virtuoso , Rahul Pathak , Mehul Shah , Akila Tennakoon , Jian Fang , Seth Thomas Denney , Jason Denton
IPC: G06F16/242 , G06F16/2455 , G06F16/248
Abstract: Techniques are described for an interactive query service that enables users to query data stored at a federated collection of data sources. An interactive query service provides interfaces that enable users to configure the interactive query service to query any number of heterogeneous data sources pertinent to a user. In general, the configuration of a data source can include identification of: a data source type, access configurations related to accessing the data source, and in some cases metadata describing a structure of the data stored by the data source (for example, a data catalog describing schemas, tables, columns, partitions, datatypes, or other metadata associated with the stored data). Once configured, an interactive query service can receive and execute queries that involve data stored at any combination of a user's data sources, where the queries may be expressed using a standard query language such as the Structured Query Language (SQL).
-
公开(公告)号:US10713272B1
公开(公告)日:2020-07-14
申请号:US15199505
申请日:2016-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Andrew Edward Caldwell , Anurag Windlass Gupta , Mehul Shah , Prajakta Damle , George Steven McPherson
IPC: G06F16/00 , G06F16/25 , G06F16/28 , G06F16/951 , G06F16/23
Abstract: Dynamic generation of data catalogs may be implemented for accessing data sets in different storage locations. Data sets may be accessed in order to extract portions of data. Structure recognition techniques may be applied to the extracted data in order to determine structural information for the data sets. The structural information may then be stored as part of a data catalog for the data sets. Requests to access the data catalog from different clients may be received and the requested structural data supplied so that the clients may access different data sets utilizing the supplied structural data. Data catalogs may be updated as changes to data sets are made.
-
-