-
公开(公告)号:US11669377B2
公开(公告)日:2023-06-06
申请号:US17680859
申请日:2022-02-25
Applicant: Palantir Technologies Inc.
Inventor: David Lisuk , Simon Slowik
IPC: G06F9/54 , G06N20/00 , G06F9/455 , H04L67/133
CPC classification number: G06F9/547 , G06F9/45558 , G06N20/00 , H04L67/133 , G06F2009/45562 , G06F2009/45591 , G06F2009/45595
Abstract: One or more virtual machines are launched at an application platform. At each of the one or more virtual machines, a machine learning model execution environment is instantiated for an instance of a machine learning model. A respective instance of the machine learning model is loaded to each machine learning model execution environment. Each loaded instance of the machine learning model is associated with an application programming interface (API) endpoint which can receive input data for the loaded instance of the machine learning model from a client device and return output data produced by the loaded instance of the machine learning model based on the input data.
-
公开(公告)号:US11221898B2
公开(公告)日:2022-01-11
申请号:US16675056
申请日:2019-11-05
Applicant: Palantir Technologies Inc.
Inventor: David Lisuk , Guodong Xu , Luis Voloch , Matthew Elkherj
Abstract: Systems and methods are validating data in a data set. A data set including data to validate and a validator to use in validating the data is selected based on user input generated based on interactions of a user with a graphical user interface. The validator is applied to the data to determine whether one or more statistics generated through application of the validator to the data is valid or invalid based on a validation routine associated with the validator. A data quality report indicating whether the data set is valid or invalid, based on a determination of whether the one or more statistics is valid or invalid, is generated and selectively presented to the user through the graphical user interface.
-
公开(公告)号:US20200073743A1
公开(公告)日:2020-03-05
申请号:US16675056
申请日:2019-11-05
Applicant: Palantir Technologies Inc.
Inventor: David Lisuk , Guodong Xu , Luis Voloch , Matthew Elkherj
Abstract: Systems and methods are validating data in a data set. A data set including data to validate and a validator to use in validating the data is selected based on user input generated based on interactions of a user with a graphical user interface. The validator is applied to the data to determine whether one or more statistics generated through application of the validator to the data is valid or invalid based on a validation routine associated with the validator. A data quality report indicating whether the data set is valid or invalid, based on a determination of whether the one or more statistics is valid or invalid, is generated and selectively presented to the user through the graphical user interface.
-
公开(公告)号:US10534595B1
公开(公告)日:2020-01-14
申请号:US15977666
申请日:2018-05-11
Applicant: PALANTIR TECHNOLOGIES INC.
Inventor: David Lisuk , Paul Gribelyuk
Abstract: Techniques for configuring and validating a data pipeline system deployment are described. In an embodiment, a template is a file or data object that describes a package of related jobs. For example, a template may describe a set of jobs necessary for deduplication of data records or a set of jobs performing machine learning on a set of data records. The template can be defined in a file, such as a JSON blob or XML file. For each job specified in the template, the template may identify a set of dataset dependencies that are needed as input for the processing of that job. For each job specified in the template, the template may further identify a set of configuration parameters needed for deployment of the job. In an embodiment, a server uses the template and the configuration parameter values collected via the GUI to generate code for the package of jobs. The code may be stored in a version control system. In an embodiment, the code may be compiled, executed, and deployed to a server for processing the data.
-
-
-