-
公开(公告)号:US11275608B1
公开(公告)日:2022-03-15
申请号:US16002229
申请日:2018-06-07
Applicant: Amazon Technologies, Inc.
Inventor: Vignesh Natarajan , Shreyas Yagna , Wesley Shawn Davis , Henry A. Spang , Sidharth Bajaj , Sumit Kumar , Jared Alan Bergman , Tejal Borkar , Dmytro Sukhariev
Abstract: Methods, systems, and computer-readable media for scalable job transformation and management are disclosed. A plurality of tasks expressed in a first format are received at a host. The tasks are associated with a shard identifier based at least in part on one or more criteria, and the tasks are acquired by the host based at least in part on the shard identifier. A subset of the tasks having a common characteristic are determined. The subset of the tasks are aggregated into a job expressed in a second format, where the job represents the subset of the tasks. The job is provided to a job manager, and the subset of the tasks are scheduled for processing using the job manager.
-
公开(公告)号:US10929428B1
公开(公告)日:2021-02-23
申请号:US15918989
申请日:2018-03-12
Applicant: Amazon Technologies, Inc.
Inventor: Murali Brahmadesam , Seungmin Wei , Sumit Kumar , Raman Mittal , Crosbie Matthew Smith , Kevin Liu , Aadithya Chandramalle Gowda , Ramesh Shankar
Abstract: Adaptive replication of changes may be performed for copies of a database. Log records may be generated and stored that correspond to changes to a database while a database is being copied. If the changes to be applied to a copy of the database is less than or equal to a threshold number of changes, then the copy of the database may be updated using the stored log records. If the changes to be applied to the copy of the database are greater than the threshold number of changes, then the copy of the database may be updated using data stored in the database.
-
公开(公告)号:US10338958B1
公开(公告)日:2019-07-02
申请号:US14165521
申请日:2014-01-27
Applicant: .Amazon Technologies, Inc.
Inventor: Ankit Kamboj , Peter Sirota , George Steven McPherson , Vageesh Kumar , Sumit Kumar
Abstract: An indication of an input data stream comprising data records, stored at a stream management service, that are to be batched for a computation at a batch-oriented data processing service is received. A set of data records of the input data stream are identified, based on respective sequence numbers associated with the records, for a particular iteration of the computation. Metadata associated with the particular iteration, comprising identification information associated with the set of records on which the computation is performed during the particular iteration, is saved in a repository.
-
-