-
公开(公告)号:US11822542B2
公开(公告)日:2023-11-21
申请号:US17208914
申请日:2021-03-22
Applicant: Palantir Technologies Inc.
Inventor: Jeppe Hallgren , Ammaar Reshi , James Thompson
IPC: G06F16/23 , G06F16/24 , G06F16/25 , G06F16/242
CPC classification number: G06F16/2386 , G06F16/2365 , G06F16/2433 , G06F16/254
Abstract: In an embodiment, a data processing method comprises, using a distributed database system that is programmed to manage a plurality of different raw datasets and a plurality of derived datasets that have been derived from the raw datasets based on a plurality of derivation relationships that link the raw datasets to the derived datasets: from a first dataset that is stored in the distributed database system, determining a subset of records that are candidates for propagated deletion of specified data values; determining one or more particular raw datasets that contain the subset of records; deleting the specified data values from the particular raw datasets; based on the plurality of derivation relationships and the particular raw datasets, identifying one or more particular derived datasets that have been derived from the particular raw datasets; generating and executing a build of the one or more particular derived datasets to result in creating and storing the one or more particular derived datasets without the specified data values that were deleted from the particular raw datasets; repeating the generating and executing for all derived datasets that have derivation relationships to the particular raw datasets; wherein the method is performed using one or more processors.
-
公开(公告)号:US10956406B2
公开(公告)日:2021-03-23
申请号:US15990338
申请日:2018-05-25
Applicant: Palantir Technologies Inc.
Inventor: Jeppe Hallgren , Ammaar Reshi , James Thompson
IPC: G06F16/23 , G06F16/242 , G06F16/25
Abstract: Using a distributed database system that manages a plurality of different raw datasets and a plurality of derived datasets that have been derived from the raw datasets based on a plurality of derivation relationships that link the raw datasets to the derived datasets, a subset of records that are candidates for propagated deletion of specified data values is determined. One or more particular raw datasets that contain the subset of records is determined. The specified data values from the particular raw datasets is deleted. Based on the plurality of derivation relationships and the particular raw datasets, one or more particular derived datasets that have been derived from the particular raw datasets is identified. A build of one or more particular derived datasets to result in creating and storing one or more particular derived datasets without the specified data values deleted from the particular raw datasets is generated and executed.
-
公开(公告)号:US20240037094A1
公开(公告)日:2024-02-01
申请号:US18486039
申请日:2023-10-12
Applicant: Palantir Technologies Inc.
Inventor: Jeppe Hallgren , Ammaar Reshi , James Thompson
IPC: G06F16/23 , G06F16/242 , G06F16/25
CPC classification number: G06F16/2386 , G06F16/2365 , G06F16/2433 , G06F16/254
Abstract: A method of enabling propagated deletion in a distributed database system is disclosed. The method comprises receiving a request to delete data in a distributed database system; causing a display of a relevant dataset and a switch between applying propagated deletion or not; receiving a first selection of a subset of records from the relevant dataset using one or more filter functions and a second selection of applying propagated deletion to the subset of records; and applying propagated deletion to the subset of records to generate a new dataset.
-
公开(公告)号:US20210271670A1
公开(公告)日:2021-09-02
申请号:US17208914
申请日:2021-03-22
Applicant: Palantir Technologies Inc.
Inventor: Jeppe Hallgren , Ammaar Reshi , James Thompson
IPC: G06F16/23 , G06F16/242 , G06F16/25
Abstract: Techniques for propagation of deletion operations among a plurality of related datasets are described herein. In an embodiment, a data processing method comprises, using a distributed database system that is programmed to manage a plurality of different raw datasets and a plurality of derived datasets that have been derived from the raw datasets based on a plurality of derivation relationships that link the raw datasets to the derived datasets: from a first dataset that is stored in the distributed database system, determining a subset of records that are candidates for propagated deletion of specified data values; determining one or more particular raw datasets that contain the subset of records; deleting the specified data values from the particular raw datasets; based on the plurality of derivation relationships and the particular raw datasets, identifying one or more particular derived datasets that have been derived from the particular raw datasets; generating and executing a build of the one or more particular derived datasets to result in creating and storing the one or more particular derived datasets without the specified data values that were deleted from the particular raw datasets; repeating the generating and executing for all derived datasets that have derivation relationships to the particular raw datasets; wherein the method is performed using one or more processors.
-
公开(公告)号:US10776382B2
公开(公告)日:2020-09-15
申请号:US15914814
申请日:2018-03-07
Applicant: Palantir Technologies Inc.
Inventor: David Meiklejohn , Jeppe Hallgren , Vitaly Pavlenko
Abstract: Systems and methods are provided for facilitating the transformation of data from a tabular data set organized according to a data schema to an object based data set organized according to a data ontology. The provided systems and methods offer a graphical user interface for mapping the tabular based data to the object based data set according to the data ontology. The tabular based data may be transformed according to the mapping.
-
公开(公告)号:US09922108B1
公开(公告)日:2018-03-20
申请号:US15398958
申请日:2017-01-05
Applicant: Palantir Technologies Inc.
Inventor: David Meiklejohn , Jeppe Hallgren , Vitaly Pavlenko
IPC: G06F17/30
CPC classification number: G06F17/30569 , G06F17/30294 , G06F17/30607
Abstract: Systems and methods are provided for facilitating the transformation of data from a tabular data set organized according to a data schema to an object based data set organized according to a data ontology. The provided systems and methods offer a graphical user interface for mapping the tabular based data to the object based data set according to the data ontology. The tabular based data may be transformed according to the mapping.
-
公开(公告)号:US12229121B2
公开(公告)日:2025-02-18
申请号:US18486039
申请日:2023-10-12
Applicant: Palantir Technologies Inc.
Inventor: Jeppe Hallgren , Ammaar Reshi , James Thompson
IPC: G06F16/23 , G06F16/24 , G06F16/242 , G06F16/25
Abstract: A method of enabling propagated deletion in a distributed database system comprises receiving a request to delete data in a distributed database system; causing a display of a relevant dataset and a switch between applying propagated deletion or not; receiving a first selection of a subset of records from the relevant dataset using one or more filter functions and a second selection of applying propagated deletion to the subset of records; and applying propagated deletion to the subset of records to generate a new dataset.
-
公开(公告)号:US20180196863A1
公开(公告)日:2018-07-12
申请号:US15914814
申请日:2018-03-07
Applicant: Palantir Technologies Inc.
Inventor: David Meiklejohn , Jeppe Hallgren , Vitaly Pavlenko
IPC: G06F17/30
CPC classification number: G06F16/258 , G06F16/212 , G06F16/289
Abstract: Systems and methods are provided for facilitating the transformation of data from a tabular data set organized according to a data schema to an object based data set organized according to a data ontology. The provided systems and methods offer a graphical user interface for mapping the tabular based data to the object based data set according to the data ontology. The tabular based data may be transformed according to the mapping.
-
-
-
-
-
-
-