Patent search ap:"Databricks Inc." Page 7

61.

发明授权
Split front end for flexible back end cluster processing 有权

公开(公告)号：US10691433B2

公开(公告)日：2020-06-23

申请号：US16119802

申请日：2018-08-31

Applicant: Databricks Inc.

Inventor： Srinath Shankar , Eric Keng-hao Liang , Gregory George Owen

IPC: G06F9/44 , G06F8/41 , G06F8/54 , G06F8/70 , G06F11/36 , G06F11/07 , G06F21/62 , G06F16/23 , G06F16/907

Abstract: A system for code development and execution includes a client interface and a client processor. The client interface is configured to receive user code for execution and receive an indication of a server that will perform the execution. The client processor is configured to parse the user code to identify one or more data items referred to during the execution. The client processor is also configured to provide the server with an inquiry for metadata regarding the one or more data items, receive the metadata regarding the one or more data items, determine a logical plan based at least in part on the metadata regarding the one or more data items; and provide the logical plan to the server for execution.

62.

发明申请
SPLIT FRONT END FOR FLEXIBLE BACK END CLUSTER PROCESSING 审中-公开

公开(公告)号：US20200073644A1

公开(公告)日：2020-03-05

申请号：US16119802

申请日：2018-08-31

Applicant: Databricks Inc.

Inventor： Srinath Shankar , Eric Keng-hao Liang , Gregory George Owen

IPC: G06F8/41 , G06F8/54 , G06F8/70 , G06F11/36 , G06F11/07 , G06F17/30 , G06F21/62

Abstract: A system for code development and execution includes a client interface and a client processor. The client interface is configured to receive user code for execution and receive an indication of a server that will perform the execution. The client processor is configured to parse the user code to identify one or more data items referred to during the execution. The client processor is also configured to provide the server with an inquiry for metadata regarding the one or more data items, receive the metadata regarding the one or more data items, determine a logical plan based at least in part on the metadata regarding the one or more data items; and provide the logical plan to the server for execution.

63.

发明授权
Independent data processing environments within a big data cluster system 有权

公开(公告)号：US09959337B2

公开(公告)日：2018-05-01

申请号：US15485952

申请日：2017-04-12

Applicant: Databricks Inc.

Inventor： Ali Ghodsi , Ion Stoica

IPC: G06F9/46 , G06F17/30 , G06F9/50

CPC classification number: G06F17/30598 , G06F9/5033 , G06F9/5072 , G06F2209/505

Abstract: A cluster system includes an interface and a processor. The interface is to receive a request from a user associated with one of a plurality of shells. The processor is to determine a plurality of tasks to respond to the request; determine a local set of data and a shared set of data for a task of the plurality of tasks, wherein the local set of data is associated with the one of the plurality of shells; and provide the task, a local set indication, and a shared set indication to a worker associated with the task, wherein the local set indication refers to the local set of data and the shared set indication refers to the shared set of data.

64.

发明授权
Cluster instance management system 有权

公开(公告)号：US09769032B1

公开(公告)日：2017-09-19

申请号：US14663748

申请日：2015-03-20

Applicant: Databricks Inc.

Inventor： Ali Ghodsi , Ion Stoica , Matei Zaharia

IPC: G06F15/16 , H04L12/24 , H04L29/08 , H04L12/26

CPC classification number: H04L41/5051 , G06F11/30 , H04L41/5096 , H04L43/0817

Abstract: A system for cluster management comprises a status monitor and an instance replacement manager. The status monitor is for monitoring status of an instance of a set of instances on a cluster provider. The instance replacement manager is for determining a replacement strategy for the instance in the event the instance does not respond. The replacement strategy for the instance is based at least in part on a management criteria for on-demand instances and spot instances on the cluster provider.

65.

发明授权
System for exploring data in a database 有权

公开(公告)号：US09760602B1

公开(公告)日：2017-09-12

申请号：US14621950

申请日：2015-02-13

Applicant: Databricks Inc.

Inventor： Ali Ghodsi , Ion Stoica , Matei Zaharia

IPC: G06F7/00 , G06F17/30

CPC classification number: G06F17/30424 , G06F17/30389

Abstract: A system for exploring data in a database comprises a query parser, a parameter manager, a query submitter, and a result formatter. The query parser is to receive a base query and determine an input parameter from the base query. The parameter manager is to provide a first request for a value for the input parameter; receive the value for the input parameter; and provide a second request for the value for the input parameter. The query submitter is to determine a first query using the base query and the value for the input parameter; and provide an indication to execute the first query. The result formatter is to receive a result associated with the indication to execute the first query.

66.

发明授权
Independent data processing environments within a big data cluster system 有权

公开(公告)号：US09659081B1

公开(公告)日：2017-05-23

申请号：US14824989

申请日：2015-08-12

Applicant: Databricks Inc.

Inventor： Ali Ghodsi , Ion Stoica

IPC: G06F9/46 , G06F17/30 , G06F9/50

CPC classification number: G06F17/30598 , G06F9/5033 , G06F9/5072 , G06F2209/505

Abstract: A cluster system includes an interface and a processor. The interface is to receive a request from a user associated with one of a plurality of shells. The processor is to determine a plurality of tasks to respond to the request; determine a local set of data and a shared set of data for a task of the plurality of tasks, wherein the local set of data is associated with the one of the plurality of shells; and provide the task, a local set indication, and a shared set indication to a worker associated with the task, wherein the local set indication refers to the local set of data and the shared set indication refers to the shared set of data.

67.

发明授权
Clean room generation for data collaboration and executing clean room task in data processing pipeline 有权

公开(公告)号：US12260003B1

公开(公告)日：2025-03-25

申请号：US18474708

申请日：2023-09-26

Applicant: Databricks, Inc.

Inventor： William Chau , Abhijit Chakankar , Stephen Michael Mahoney , Daniel Seth Morris , Itai Shlomo Weiss

IPC: G06F21/00 , G06F21/62

Abstract: A data processing service facilitates the creation and processing of data processing pipelines that process data processing jobs defined with respect to a set of tasks in a sequence and with data dependencies associated with each separate task such that the output from one task is used as input for a subsequent task. In various embodiments, the set of tasks include at least one cleanroom task that is executed in a cleanroom station and at least one non-cleanroom task executed in an execution environment of a user where each task is configured to read one or more input datasets and transform the one or more input datasets into one or more output datasets.

68.

发明申请
K-D Tree Balanced Splitting 有权

公开(公告)号：US20250086155A1

公开(公告)日：2025-03-13

申请号：US18772758

申请日：2024-07-15

Applicant: Databricks, Inc.

Inventor： Bart Samwel , Prakhar Jain

IPC: G06F16/22 , G06F16/28

Abstract: A system for clustering data into corresponding files comprises one or more processors and a memory. The one or more processors is/are configured to: 1) determine to cluster a set of data into a set of files; 2) determine a set of split points in a corresponding set of dimensions of the set of data to determine the set of files, wherein each file of the set of files has an approximate target size; and 3) store one or more items of the set of data into a corresponding file of the set of files based at least in part on the set of split points. The memory is coupled to the one or more processors and configured to provide the processor with instructions.

69.

发明授权
Clustering key selection based on machine-learned key selection models for data processing service 有权

公开(公告)号：US12229169B1

公开(公告)日：2025-02-18

申请号：US18501830

申请日：2023-11-03

Applicant: Databricks, Inc.

Inventor： Terry Kim , Lin Ma , Rahul Shivu Mahadev , Rahul Potharaju

IPC: G06F16/28 , G06F16/21 , G06F16/22

Abstract: The disclosed configurations provide a method (and/or a computer-readable medium or system) for determining, from a table schema describing keys of a data table, one or more clustering keys that can be used to cluster data files of a data table. The method includes generating features for the data table, generating tokens from the features, generating a prediction for each token by applying to the token a machine-learned transformer model trained to predict a likelihood that the key associated with the token is a clustering key for the data table, determining clustering keys based on the predictions, and clustering data records of the data table into data files based on key-values for the clustering keys.

70.

发明授权
Checkpoint and restore based startup of executor nodes of a distributed computing engine for processing queries 有权

公开(公告)号：US12229137B1

公开(公告)日：2025-02-18

申请号：US18412438

申请日：2024-01-12

Applicant: Databricks, Inc.

Inventor： Xinyang Ge , Lixiang Ao , Haonan Jing , Aaron Daniel Davidson

IPC: G06F16/2453

Abstract: A system performs efficient startup of executors of a distributed computing engine used for processing queries, for example, database queries. The system starts an executor node and processes a set of queries using the executor node to warm up the executor node. The system performs a checkpoint of the warmed-up executor node to create an image. The image is restored in the target executor nodes. The system may store a checkpoint image for each configuration of an executor node. The configuration is determined based on various factors including the hardware of the executor node, memory allocation of the processes, and so on. The user or restore based on checkpoint images improves efficiency of execution of the startup of executor nodes.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification