-
公开(公告)号:US09769032B1
公开(公告)日:2017-09-19
申请号:US14663748
申请日:2015-03-20
Applicant: Databricks Inc.
Inventor: Ali Ghodsi , Ion Stoica , Matei Zaharia
CPC classification number: H04L41/5051 , G06F11/30 , H04L41/5096 , H04L43/0817
Abstract: A system for cluster management comprises a status monitor and an instance replacement manager. The status monitor is for monitoring status of an instance of a set of instances on a cluster provider. The instance replacement manager is for determining a replacement strategy for the instance in the event the instance does not respond. The replacement strategy for the instance is based at least in part on a management criteria for on-demand instances and spot instances on the cluster provider.
-
公开(公告)号:US09760602B1
公开(公告)日:2017-09-12
申请号:US14621950
申请日:2015-02-13
Applicant: Databricks Inc.
Inventor: Ali Ghodsi , Ion Stoica , Matei Zaharia
CPC classification number: G06F17/30424 , G06F17/30389
Abstract: A system for exploring data in a database comprises a query parser, a parameter manager, a query submitter, and a result formatter. The query parser is to receive a base query and determine an input parameter from the base query. The parameter manager is to provide a first request for a value for the input parameter; receive the value for the input parameter; and provide a second request for the value for the input parameter. The query submitter is to determine a first query using the base query and the value for the input parameter; and provide an indication to execute the first query. The result formatter is to receive a result associated with the indication to execute the first query.
-
公开(公告)号:US20240412095A1
公开(公告)日:2024-12-12
申请号:US18206460
申请日:2023-06-06
Applicant: Databricks, Inc.
Inventor: Matei Zaharia , Avesh Singh , Mani Parkhe , Maxim Lukiyanov , Xiangrui Meng , Aakrati Talati , Chenen Liang , Kasey Uhlenhuth
IPC: G06N20/00
Abstract: A system performs training and execution of machine learning models that use on-demand features using feature functions. The system receives commands for registering metadata associated with a machine learning model. The machine learning model may process a set of features including on-demand features as well as other features such as batch features. The system executes the command by storing an association between the machine learning model and the feature functions associated with any on-demand features processed by the machine learning model. The feature functions are executed using an end point of a data asset service. The use of the data asset service for invoking the feature functions ensures that the same set of instructions is executed during model training and model inferencing, thereby avoiding model skew.
-
公开(公告)号:US20220374457A1
公开(公告)日:2022-11-24
申请号:US17514997
申请日:2021-10-29
Applicant: Databricks Inc.
Inventor: Mani Parkhe , Clemens Mewald , Matei Zaharia , Avesh Singh
Abstract: The present application discloses a method, system, and computer system for managing a plurality of features and storing lineage information pertaining to the features. The method includes obtaining one or more datasets, determining a first feature, wherein the first feature is determined based at least in part on the one or more datasets, and storing the first feature in a feature store. The first feature is stored in association with a dataset indication of the one or more datasets from which the first feature is determined. The feature store comprises a plurality of features.
-
公开(公告)号:US20230141556A1
公开(公告)日:2023-05-11
申请号:US17976361
申请日:2022-10-28
Applicant: Databricks, Inc.
Inventor: Michael Paul Armbrust , Tathagata Das , Shi Xin , Matei Zaharia
IPC: G06F16/2453 , G06F16/2455
CPC classification number: G06F16/24542 , G06F16/24568
Abstract: A system for executing a streaming query includes an interface and a processor. The interface is configured to receive a logical query plan. The processor is configured to determine a physical query plan based at least in part on the logical query plan. The physical query plan comprises an ordered set of operators. Each operator of the ordered set of operators comprises an operator input mode and an operator output mode. The processor is further configured to execute the physical query plan using the operator input mode and the operator output mode for each operator of the query.
-
公开(公告)号:US11514045B2
公开(公告)日:2022-11-29
申请号:US16721402
申请日:2019-12-19
Applicant: Databricks Inc.
Inventor: Michael Paul Armbrust , Tathagata Das , Shi Xin , Matei Zaharia
IPC: G06F16/2453 , G06F16/2455
Abstract: A system for executing a streaming query includes an interface and a processor. The interface is configured to receive a logical query plan. The processor is configured to determine a physical query plan based at least in part on the logical query plan. The physical query plan comprises an ordered set of operators. Each operator of the ordered set of operators comprises an operator input mode and an operator output mode. The processor is further configured to execute the physical query plan using the operator input mode and the operator output mode for each operator of the query.
-
公开(公告)号:US20200257689A1
公开(公告)日:2020-08-13
申请号:US16721402
申请日:2019-12-19
Applicant: Databricks Inc.
Inventor: Michael Paul Armbrust , Tathagata Das , Shi Xin , Matei Zaharia
IPC: G06F16/2453 , G06F16/2455
Abstract: A system for executing a streaming query includes an interface and a processor. The interface is configured to receive a logical query plan. The processor is configured to determine a physical query plan based at least in part on the logical query plan. The physical query plan comprises an ordered set of operators. Each operator of the ordered set of operators comprises an operator input mode and an operator output mode. The processor is further configured to execute the physical query plan using the operator input mode and the operator output mode for each operator of the query.
-
公开(公告)号:US10361928B2
公开(公告)日:2019-07-23
申请号:US15682397
申请日:2017-08-21
Applicant: Databricks Inc.
Inventor: Ali Ghodsi , Ion Stoica , Matei Zaharia
IPC: G06F15/173 , H04L12/24 , G06F11/30 , H04L12/26
Abstract: A system for cluster management comprises a status monitor and an instance replacement manager. The status monitor is for monitoring status of an instance of a set of instances on a cluster provider. The instance replacement manager is for determining a replacement strategy for the instance in the event the instance does not respond. The replacement strategy for the instance is based at least in part on a management criteria for on-demand instances and spot instances on the cluster provider.
-
公开(公告)号:US10095735B2
公开(公告)日:2018-10-09
申请号:US15675619
申请日:2017-08-11
Applicant: Databricks Inc.
Inventor: Ali Ghodsi , Ion Stoica , Matei Zaharia
Abstract: A system for exploring data in a database comprises a query parser, a parameter manager, a query submitter, and a result formatter. The query parser is to receive a base query and determine an input parameter from the base query. The parameter manager is to provide a first request for a value for the input parameter; receive the value for the input parameter; and provide a second request for the value for the input parameter. The query submitter is to determine a first query using the base query and the value for the input parameter; and provide an indication to execute the first query. The result formatter is to receive a result associated with the indication to execute the first query.
-
公开(公告)号:US20250086177A1
公开(公告)日:2025-03-13
申请号:US18745847
申请日:2024-06-17
Applicant: Databricks, Inc.
Inventor: Michael Paul Armbrust , Tathagata Das , Shi Xin , Matei Zaharia
IPC: G06F16/2453 , G06F16/2455
Abstract: A system for executing a streaming query includes an interface and a processor. The interface is configured to receive a logical query plan. The processor is configured to determine a physical query plan based at least in part on the logical query plan. The physical query plan comprises an ordered set of operators. Each operator of the ordered set of operators comprises an operator input mode and an operator output mode. The processor is further configured to execute the physical query plan using the operator input mode and the operator output mode for each operator of the query.
-
-
-
-
-
-
-
-
-