-
公开(公告)号:US20230177072A1
公开(公告)日:2023-06-08
申请号:US18162625
申请日:2023-01-31
Applicant: Databricks, Inc.
Inventor: Mani Parkhe , Clemens Mewald , Matei Zaharia , Avesh Singh
CPC classification number: G06F16/288 , G06F30/27
Abstract: The present application discloses a method, system, and computer system for managing a plurality of features and storing lineage information pertaining to the features. The method includes obtaining one or more datasets, determining a first feature, wherein the first feature is determined based at least in part on the one or more datasets, and storing the first feature in a feature store. The first feature is stored in association with a dataset indication of the one or more datasets from which the first feature is determined. The feature store comprises a plurality of features.
-
公开(公告)号:US20240412095A1
公开(公告)日:2024-12-12
申请号:US18206460
申请日:2023-06-06
Applicant: Databricks, Inc.
Inventor: Matei Zaharia , Avesh Singh , Mani Parkhe , Maxim Lukiyanov , Xiangrui Meng , Aakrati Talati , Chenen Liang , Kasey Uhlenhuth
IPC: G06N20/00
Abstract: A system performs training and execution of machine learning models that use on-demand features using feature functions. The system receives commands for registering metadata associated with a machine learning model. The machine learning model may process a set of features including on-demand features as well as other features such as batch features. The system executes the command by storing an association between the machine learning model and the feature functions associated with any on-demand features processed by the machine learning model. The feature functions are executed using an end point of a data asset service. The use of the data asset service for invoking the feature functions ensures that the same set of instructions is executed during model training and model inferencing, thereby avoiding model skew.
-
公开(公告)号:US20220374457A1
公开(公告)日:2022-11-24
申请号:US17514997
申请日:2021-10-29
Applicant: Databricks Inc.
Inventor: Mani Parkhe , Clemens Mewald , Matei Zaharia , Avesh Singh
Abstract: The present application discloses a method, system, and computer system for managing a plurality of features and storing lineage information pertaining to the features. The method includes obtaining one or more datasets, determining a first feature, wherein the first feature is determined based at least in part on the one or more datasets, and storing the first feature in a feature store. The first feature is stored in association with a dataset indication of the one or more datasets from which the first feature is determined. The feature store comprises a plurality of features.
-
-