-
公开(公告)号:US10133782B2
公开(公告)日:2018-11-20
申请号:US15225437
申请日:2016-08-01
Applicant: Palantir Technologies Inc.
Inventor: Huw Pryce , James Neale , Robert Fink , Jared Newman , Graham Dennis , Viktor Nordling , Artur Jonkisz , Daniel Fox , Felix de Souza , Harkirat Singh , Mark Elliot
IPC: G06F17/30
Abstract: Computer-implemented techniques for data extraction are described. The techniques include a method and system for retrieving an extraction job specification, wherein the extraction job specification comprises a source repository identifier that identifies a source repository comprising a plurality of data records; a data recipient identifier that identifies a data recipient; and a schedule that indicates a timing of when to retrieve the plurality of data records. The method and system further include retrieving the plurality of data records from the source repository based on the schedule, creating an extraction transaction from the plurality of data records, wherein the extraction transaction comprises a subset of the plurality of data records and metadata, and sending the extraction transaction to the data recipient.
-
公开(公告)号:US12079352B2
公开(公告)日:2024-09-03
申请号:US17226014
申请日:2021-04-08
Applicant: Palantir Technologies Inc.
Inventor: Anton Apostolatos , Adam Lieskovský , Florian Diegruber , Francisco Ferreira , Joseph Kane , Joanna Peller , Kelvin Lau , Maciej Laska , Mikael Ibrahim Mofarrej , Max-Philipp Schrader , Philipp Hoefer , Spencer McCollester , Viktor Nordling
CPC classification number: G06F21/604 , G06F16/258
Abstract: A computer-implemented method enforces data security constraints in a data pipeline. The data pipeline takes one or more source datasets as input and performs one or more data transformations on them. The method includes using data defining one or more data security constraints to configure the data pipeline to perform a data transformation on a restricted subset of entries of the source datasets. The restriction is defined by the data defining one or more data security constraints. The method further includes performing the data transformation according to the configuration to produce one or more transformed datasets. The method further includes using the data defining one or more data security constraints to perform a verification on one or more of the transformed datasets to ensure that entries in the one or more of the transformed datasets are restricted as defined by the one or more data security constraints.
-
公开(公告)号:US20220198032A1
公开(公告)日:2022-06-23
申请号:US17226014
申请日:2021-04-08
Applicant: Palantir Technologies Inc.
Inventor: Anton Apostolatos , Adam Lieskovský , Florian Diegruber , Francisco Ferreira , Joseph Kane , Joanna Peller , Kelvin Lau , Maciej Laska , Mikael Ibrahim Mofarrej , Max-Philipp Schrader , Philipp Hoefer , Spencer McCollester , Viktor Nordling
Abstract: A computer-implemented method enforces data security constraints in a data pipeline. The data pipeline takes one or more source datasets as input and performs one or more data transformations on them. The method includes using data defining one or more data security constraints to configure the data pipeline to perform a data transformation on a restricted subset of entries of the source datasets. The restriction is defined by the data defining one or more data security constraints. The method further includes performing the data transformation according to the configuration to produce one or more transformed datasets. The method further includes using the data defining one or more data security constraints to perform a verification on one or more of the transformed datasets to ensure that entries in the one or more of the transformed datasets are restricted as defined by the one or more data security constraints.
-
公开(公告)号:US20200349152A1
公开(公告)日:2020-11-05
申请号:US16933688
申请日:2020-07-20
Applicant: Palantir Technologies Inc.
Inventor: HUW PRYCE , James Neale , Robert Fink , Jared Newman , Graham Dennis , Viktor Nordling , Artur Jonkisz , Daniel Fox , Felix de Souza , Harkirat Singh , Mark Elliot
IPC: G06F16/2455 , G06F16/25 , G06F16/2458
Abstract: Computer-implemented techniques for data extraction are described. The techniques include a method and system for retrieving an extraction job specification, wherein the extraction job specification comprises a source repository identifier that identifies a source repository comprising a plurality of data records; a data recipient identifier that identifies a data recipient; and a schedule that indicates a timing of when to retrieve the plurality of data records. The method and system further include retrieving the plurality of data records from the source repository based on the schedule, creating an extraction transaction from the plurality of data records, wherein the extraction transaction comprises a subset of the plurality of data records and metadata, and sending the extraction transaction to the data recipient.
-
公开(公告)号:US10776360B2
公开(公告)日:2020-09-15
申请号:US16147687
申请日:2018-09-29
Applicant: Palantir Technologies Inc.
Inventor: Huw Pryce , James Neale , Robert Fink , Jared Newman , Graham Dennis , Viktor Nordling , Artur Jonkisz , Daniel Fox , Felix de Souza , Harkirat Singh , Mark Elliot
IPC: G06F16/00 , G06F16/2455 , G06F16/25 , G06F16/2458
Abstract: Computer-implemented techniques for data extraction are described. The techniques include a method and system for retrieving an extraction job specification, wherein the extraction job specification has a source repository identifier that identifies a source repository including a plurality of data records; a data recipient identifier that identifies a data recipient; and a schedule that indicates a timing of when to retrieve the plurality of data records. The method and system further include retrieving the plurality of data records from the source repository based on the schedule, creating an extraction transaction from the plurality of data records, wherein the extraction transaction includes a subset of the plurality of data records and metadata, and sending the extraction transaction to the data recipient.
-
公开(公告)号:US20200012593A1
公开(公告)日:2020-01-09
申请号:US16572404
申请日:2019-09-16
Applicant: Palantir Technologies, Inc.
Inventor: Peter Maag , Jacob Albertson , Jared Newman , Matthew Lynch , Maciej Albin , Viktor Nordling
Abstract: Discussed herein are embodiments of methods and systems which allow engineers or administrators to create modular plugins which represent the logic for various fault detection tests that can be performed on data pipelines and shared among different software deployments. In some cases, the modular plugins each define a particular test to be executed against data received from the pipeline in addition to one or more configuration points. The configuration points represent configurable arguments, such as variables and/or functions, referenced by the instructions which implement the tests and that can be set according to the specific operation environment of the monitored pipeline.
-
公开(公告)号:US10417120B2
公开(公告)日:2019-09-17
申请号:US15671423
申请日:2017-08-08
Applicant: Palantir Technologies, Inc.
Inventor: Peter Maag , Jacob Albertson , Jared Newman , Matthew Lynch , Maciej Albin , Viktor Nordling
Abstract: Discussed herein are embodiments of methods and systems which allow engineers or administrators to create modular plugins which represent the logic for various fault detection tests that can be performed on data pipelines and shared among different software deployments. In some cases, the modular plugins each define a particular test to be executed against data received from the pipeline in addition to one or more configuration points. The configuration points represent configurable arguments, such as variables and/or functions, referenced by the instructions which implement the tests and that can be set according to the specific operation environment of the monitored pipeline.
-
公开(公告)号:US20170220403A1
公开(公告)日:2017-08-03
申请号:US14877229
申请日:2015-10-07
Applicant: Palantir Technologies, Inc.
Inventor: Peter Maag , Jacob Albertson , Jared Newman , Matthew Lynch , Maciej Albin , Viktor Nordling
IPC: G06F11/07
CPC classification number: G06F11/3692 , G06F11/0751 , G06F11/0775
Abstract: Discussed herein are embodiments of methods and systems which allow engineers or administrators to create modular plugins which represent the logic for various fault detection tests that can be performed on data pipelines and shared among different software deployments. In some cases, the modular plugins each define a particular test to be executed against data received from the pipeline in addition to one or more configuration points. The configuration points represent configurable arguments, such as variables and/or functions, referenced by the instructions which implement the tests and that can be set according to the specific operation environment of the monitored pipeline.
-
公开(公告)号:US11593374B2
公开(公告)日:2023-02-28
申请号:US16933688
申请日:2020-07-20
Applicant: Palantir Technologies Inc.
Inventor: Huw Pryce , James Neale , Robert Fink , Jared Newman , Graham Dennis , Viktor Nordling , Artur Jonkisz , Daniel Fox , Felix de Souza , Harkirat Singh , Mark Elliot
IPC: G06F16/00 , G06F16/2455 , G06F16/25 , G06F16/2458
Abstract: Computer-implemented techniques for data extraction are described. The techniques include a method and system for retrieving an extraction job specification, wherein the extraction job specification comprises a source repository identifier that identifies a source repository comprising a plurality of data records; a data recipient identifier that identifies a data recipient; and a schedule that indicates a timing of when to retrieve the plurality of data records. The method and system further include retrieving the plurality of data records from the source repository based on the schedule, creating an extraction transaction from the plurality of data records, wherein the extraction transaction comprises a subset of the plurality of data records and metadata, and sending the extraction transaction to the data recipient.
-
公开(公告)号:US20210064645A1
公开(公告)日:2021-03-04
申请号:US17010187
申请日:2020-09-02
Applicant: Palantir Technologies Inc.
Inventor: Francisco Ferreira , Ryan Norris , Viktor Nordling , Kelvin Lau
IPC: G06F16/36 , G06F16/182 , G06F16/176
Abstract: A method, performed by one or more processors, is disclosed, comprising providing, to a plurality of parties permitted to communicate data via a shared database, an ontology application associated with a common core ontology, the core ontology defining constraints required to be met for producing, from one or more received datasets, one or more data objects for storing in the shared database. The ontology application may be configured to receive one or more datasets from one or more parties and to use the core database ontology to determine if the received one or more datasets conform to the constraints of the core ontology, and store the received one or more datasets as data objects in the shared database, conditional on the constraints being met.
-
-
-
-
-
-
-
-
-