-
公开(公告)号:US10942627B2
公开(公告)日:2021-03-09
申请号:US16660603
申请日:2019-10-22
Applicant: Palantir Technologies Inc.
Inventor: Matthew Maclean , Benjamin Duffield , Mark Elliot
IPC: G06F3/0482 , G06F3/0481 , G06N20/00
Abstract: In various example embodiments, a comparative modeling system is configured to receive selections of a data set, a transform scheme, and one or more machine-learning algorithms. In response to a selection of the one or more machine-learning algorithms, the comparative modeling system determines parameters within the one or more machine-learning algorithms. The comparative modeling system generates a plurality of models for the one or more machine-learning algorithms, determines comparison metric values for the plurality of models, and causes presentation of the comparison metric values for the plurality of models.
-
公开(公告)号:US12093279B2
公开(公告)日:2024-09-17
申请号:US18465089
申请日:2023-09-11
Applicant: Palantir Technologies Inc.
Inventor: Matthew Maclean , Adam Borochoff , Jared Newman , Joseph Rafidi
CPC classification number: G06F16/26 , G06F16/212 , G06F16/221 , G06F16/2282 , G06F16/258 , G06F16/27
Abstract: A method comprises creating metadata identifying columns of tables and column operations of one or more data transforms of the columns in a data pipeline and including links to code segments in human-readable form corresponding to the one or more data transforms; executing a build job that effects the one or more data transforms on one or more datasets to generate one or more derived datasets; causing, after the executing, a presentation of a graphical user interface (GUI) including a graphical representation of the one or more data transforms based on the metadata, wherein the method is performed by one or more processors.
-
公开(公告)号:US11755614B2
公开(公告)日:2023-09-12
申请号:US17727578
申请日:2022-04-22
Applicant: Palantir Technologies Inc.
Inventor: Matthew Maclean , Adam Borochoff , Jared Newman , Joseph Rafidi
CPC classification number: G06F16/26 , G06F16/212 , G06F16/221 , G06F16/2282 , G06F16/258 , G06F16/27
Abstract: Techniques for propagation of deletion operations among a plurality of related datasets are described herein. In an embodiment, a data processing method comprises, using a distributed database system that is programmed to manage a plurality of different raw datasets and a plurality of derived datasets that have been derived from the raw datasets based on a plurality of derivation relationships that link the raw datasets to the derived datasets: from a first dataset that is stored in the distributed database system, determining a subset of records that are candidates for propagated deletion of specified data values; determining one or more particular raw datasets that contain the subset of records; deleting the specified data values from the particular raw datasets; based on the plurality of derivation relationships and the particular raw datasets, identifying one or more particular derived datasets that have been derived from the particular raw datasets; generating and executing a build of the one or more particular derived datasets to result in creating and storing the one or more particular derived datasets without the specified data values that were deleted from the particular raw datasets; repeating the generating and executing for all derived datasets that have derivation relationships to the particular raw datasets; wherein the method is performed using one or more processors.
-
公开(公告)号:US20220058163A1
公开(公告)日:2022-02-24
申请号:US17463345
申请日:2021-08-31
Applicant: PALANTIR TECHNOLOGIES INC.
Inventor: Robert Fink , Lynn Cuthriell , Adam Anderson , Adam Borochoff , Catherine Lu , Joseph Rafidi , Karanveer Mohan , Matthew Jenny , Matthew Maclean , Michelle Guo , Parvathy Menon , Ryan Rowe
IPC: G06F16/18 , G06F16/182 , G06F16/21 , G06F16/23
Abstract: A computer-implemented system and method for data revision control in a large-scale data analytic systems. In one embodiment, for example, a computer-implemented method comprises the operations of storing a first version of a dataset that is derived by executing a first version of driver program associated with the dataset; and storing a first build catalog entry comprising an identifier of the first version of the dataset and comprising an identifier of the first version of the driver program.
-
公开(公告)号:US20200050329A1
公开(公告)日:2020-02-13
申请号:US16660603
申请日:2019-10-22
Applicant: Palantir Technologies Inc.
Inventor: Matthew Maclean , Benjamin Duffield , Mark Elliot
IPC: G06F3/0482 , G06F3/0481 , G06N20/00
Abstract: In various example embodiments, a comparative modeling system is configured to receive selections of a data set, a transform scheme, and one or more machine-learning algorithms. In response to a selection of the one or more machine-learning algorithms, the comparative modeling system determines parameters within the one or more machine-learning algorithms. The comparative modeling system generates a plurality of models for the one or more machine-learning algorithms, determines comparison metric values for the plurality of models, and causes presentation of the comparison metric values for the plurality of models.
-
公开(公告)号:US10552002B1
公开(公告)日:2020-02-04
申请号:US15655408
申请日:2017-07-20
Applicant: Palantir Technologies Inc.
Inventor: Matthew Maclean , Benjamin Duffield , Mark Elliot
IPC: G06F3/0482 , G06N99/00 , G06F3/0481 , G06N20/00
Abstract: In various example embodiments, a comparative modeling system is configured to receive selections of a data set, a transform scheme, and one or more machine-learning algorithms. In response to a selection of the one or more machine-learning algorithms, the comparative modeling system determines parameters within the one or more machine-learning algorithms. The comparative modeling system generates a plurality of models for the one or more machine-learning algorithms, determines comparison metric values for the plurality of models, and causes presentation of the comparison metric values for the plurality of models.
-
公开(公告)号:US10007674B2
公开(公告)日:2018-06-26
申请号:US15262207
申请日:2016-09-12
Applicant: Palantir Technologies, Inc.
Inventor: Robert Fink , Lynn Cuthriell , Adam Anderson , Adam Borochoff , Catherine Lu , Joseph Rafidi , Karanveer Mohan , Matthew Jenny , Matthew Maclean , Michelle Guo , Parvathy Menon , Ryan Rowe
IPC: G06F17/30
CPC classification number: G06F16/1873 , G06F16/182 , G06F16/219 , G06F16/2379
Abstract: A computer-implemented system and method for data revision control in a large-scale data analytic systems. In one embodiment, for example, a computer-implemented method comprises the operations of storing a first version of a dataset that is derived by executing a first version of driver program associated with the dataset; and storing a first build catalog entry comprising an identifier of the first version of the dataset and comprising an identifier of the first version of the driver program.
-
-
-
-
-
-