-
公开(公告)号:US11797558B2
公开(公告)日:2023-10-24
申请号:US17491985
申请日:2021-10-01
Applicant: Amazon Technologies, Inc.
Inventor: Mehul A. Shah , George Steven McPherson , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta , Benjamin Albert Sowell , Bohou Li
CPC classification number: G06F16/254 , G06F16/282
Abstract: Data transformation workflows may be generated to transform data objects. A source data schema for a data object and a target data format or target data schema for a data object may be identified. A comparison of the source data schema and the target data format or schema may be made to determine what transformations can be performed to transform the data object into the target data format or schema. Code to execute the transformation operations may then be generated. The code may be stored for subsequent modification or execution.
-
2.
公开(公告)号:US10114846B1
公开(公告)日:2018-10-30
申请号:US15192945
申请日:2016-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Mehul Shah , Jakub Kulesza , James Thomas Kiraly , Benjamin Albert Sowell , Anurag Windlass Gupta
IPC: G06F17/30
Abstract: A balanced distribution of sort order values may be implemented for a multi-column sort order of a database table. Columns of the database table to be included in the multi-column sort order may be identified. Some columns containing string data values may be converted to equally-sized integer data values. The data values of columns may be evaluated to determine buckets representing the ranges of data values within the columns for depth-balanced histograms of the columns. Multi-column sort order values may be generated for individual entries in the database table according to bucket values assigned to the buckets that include the columns values of the individual entries. The entries of the database table may then be stored according to a sorted ordering of multi-column sort order values for the entries.
-
公开(公告)号:US11138220B2
公开(公告)日:2021-10-05
申请号:US15385764
申请日:2016-12-20
Applicant: Amazon Technologies, Inc.
Inventor: Mehul A. Shah , George Steven McPherson , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta , Benjamin Albert Sowell , Bohou Li
Abstract: Data transformation workflows may be generated to transform data objects. A source data schema for a data object and a target data format or target data schema for a data object may be identified. A comparison of the source data schema and the target data format or schema may be made to determine what transformations can be performed to transform the data object into the target data format or schema. Code to execute the transformation operations may then be generated. The code may be stored for subsequent modification or execution.
-
公开(公告)号:US10963479B1
公开(公告)日:2021-03-30
申请号:US15385777
申请日:2016-12-20
Applicant: Amazon Technologies, Inc.
Inventor: Mehul A. Shah , George Steven McPherson , Supratik Chakraborty , Anurag Windlass Gupta , Benjamin Albert Sowell
Abstract: Version controlled Extract, Transform, Load (ETL) code may be hosted for developing or executing the ETL job in an ETL system. A version of ETL code may be obtained from version controlled code store and maintained in a data store. Development or execution clients may submit access requests for the version of ETL code which may be serviced from the version stored in the data store. Updates to the version of the ETL code may be eventually committed to the version controlled code store. The latest version of ETL code may also be obtained from the version controlled code store when providing the ETL code in response to a request to retrieve the ETL code.
-
公开(公告)号:US20220100774A1
公开(公告)日:2022-03-31
申请号:US17491985
申请日:2021-10-01
Applicant: Amazon Technologies, Inc.
Inventor: Mehul A. Shah , George Steven McPherson , Prajakta Datta Damle , Gopinath Duddi , Anurag Windlass Gupta , Benjamin Albert Sowell , Bohou Li
Abstract: Data transformation workflows may be generated to transform data objects. A source data schema for a data object and a target data format or target data schema for a data object may be identified. A comparison of the source data schema and the target data format or schema may be made to determine what transformations can be performed to transform the data object into the target data format or schema. Code to execute the transformation operations may then be generated. The code may be stored for subsequent modification or execution.
-
-
-
-