Invention Grant
- Patent Title: Update and query of a large collection of files that represent a single dataset stored on a blob store
-
Application No.: US16941227Application Date: 2020-07-28
-
Publication No.: US11308071B2Publication Date: 2022-04-19
- Inventor: Michael Paul Armbrust , Shixiong Zhu , Burak Yavuz
- Applicant: Databricks Inc.
- Applicant Address: US CA San Francisco
- Assignee: Databricks Inc.
- Current Assignee: Databricks Inc.
- Current Assignee Address: US CA San Francisco
- Agency: Van Pelt, Yi & James LLP
- Main IPC: G06F16/23
- IPC: G06F16/23 ; G06F16/14 ; G06F16/22

Abstract:
A system includes an interface and a processor. The interface is configured to receive a table indication of a data table and to receive a transaction indication to perform a transaction. The processor is configured to determine a current position N in a transaction log; determine a current state of the metadata; determine a read set associated with a transaction; attempt to write an update to the transaction log associated with a next position N+1; in response to a transaction determination that a simultaneous transaction associated with the next position N+1 already exists, determine a set of updated files; and in response to a determination that there is not an overlap between the read set associated with the current transaction and the set of updated files associated with the simultaneous transaction, attempt to write the update to the transaction to the transaction log associated with a further position N+2.
Public/Granted literature
- US20210011901A1 UPDATE AND QUERY OF A LARGE COLLECTION OF FILES THAT REPRESENT A SINGLE DATASET STORED ON A BLOB STORE Public/Granted day:2021-01-14
Information query