-
公开(公告)号:US12050907B2
公开(公告)日:2024-07-30
申请号:US18473520
申请日:2023-09-25
Applicant: Palantir Technologies Inc.
Inventor: Audrey Kuan , Andrew Kaier , Eric Lee , Jasjit Grewal , Mark Elliot , Nitish Kulkarni , Robert Fink , Samuel Rogerson , Thomas Pearson , Thomas Powell , Lawrence Manning , Corey Garvey
CPC classification number: G06F8/71 , G06F9/3885 , G06F9/4494 , G06F9/4881 , G06F16/2329 , G06F16/2358
Abstract: An apparatus, and a method, performed by one or more processors are disclosed. The method receives a build request associated with performing an external data processing task on a first data set, the first data set being stored in memory associated with a data processing platform to be performed at a system external to the data processing platform. The method generates a task identifier for the data processing task, and provides, in association with the task identifier, the first data set to an agent associated with the external system with an indication of the data processing task, the agent being arranged to cause performance of the task at the external system, to receive a second data set resulting from performance of the task, and to provide the second data set and associated metadata indicative of the transformation. The method receives the second data set and metadata from the agent associated with the external system and stores the second data set and associated metadata.
-
公开(公告)号:US20220391202A1
公开(公告)日:2022-12-08
申请号:US17820062
申请日:2022-08-16
Applicant: Palantir Technologies Inc.
Inventor: Audrey Kuan , Andrew Kaier , Eric Lee , Jasjit Grewal , Mark Elliot , Nitish Kulkarni , Robert Fink , Samuel Rogerson , Thomas Pearson , Thomas Powell , Lawrence Manning , Corey Garvey
Abstract: An apparatus, and a method, performed by one or more processors are disclosed. The method may comprise receiving a build request associated with performing an external data processing task on a first data set, the first data set being stored in memory associated with a data processing platform to be performed at a system external to the data processing platform. The method may also comprise generating a task identifier for the data processing task, and providing, in association with the task identifier, the first data set to an agent associated with the external system with an indication of the data processing task, the agent being arranged to cause performance of the task at the external system, to receive a second data set resulting from performance of the task, and to provide the second data set and associated metadata indicative of the transformation. The method may also comprise receiving the second data set and metadata from the agent associated with the external system and storing the second data set and associated metadata.
-
公开(公告)号:US11093634B1
公开(公告)日:2021-08-17
申请号:US16219504
申请日:2018-12-13
Applicant: Palantir Technologies Inc.
Inventor: Samuel Szuflita , Alice Yu , Emily Wang , Hao Dang , Megha Arora , Nicholas Gates , Samuel Rogerson
IPC: G06F16/901 , G06F21/62 , G06F21/60 , G06F16/36 , G06F16/903
Abstract: A computer system is configured to receiving a data set from a data provider and automatically save the data set in a quarantine database where copying, moving, and sharing of the data set are restricted until the data set is released by a data provider. The data set is parsed to find and mark portions with potentially sensitive information. At least those parts are reviewed by a data governor, who can confirm, add, edit, or remove markers. Those parts can be visually indicated to the data governor, along with a preview of, metadata about, and analysis of the data set. After reviewing at least the automatically marked portions, the data governor can release the data set to a non-quarantine database where another user can use the data set. The user is restricted from accessing the quarantine database.
-
公开(公告)号:US20200301701A1
公开(公告)日:2020-09-24
申请号:US16900071
申请日:2020-06-12
Applicant: Palantir Technologies Inc.
Inventor: Audrey Kuan , Andrew Kaier , Eric Lee , Jasjit Grewal , Mark Elliot , Nitish Kulkarni , Robert Fink , Samuel Rogerson , Thomas Pearson , Thomas Powell , Lawrence Manning , Corey Garvey
Abstract: An apparatus, and a method, performed by one or more processors are disclosed. The method may comprise receiving a build request associated with performing an external data processing task on a first data set, the first data set being stored in memory associated with a data processing platform to be performed at a system external to the data processing platform. The method may also comprise generating a task identifier for the data processing task, and providing, in association with the task identifier, the first data set to an agent associated with the external system with an indication of the data processing task, the agent being arranged to cause performance of the task at the external system, to receive a second data set resulting from performance of the task, and to provide the second data set and associated metadata indicative of the transformation. The method may also comprise receiving the second data set and metadata from the agent associated with the external system and storing the second data set and associated metadata.
-
公开(公告)号:US20200167151A1
公开(公告)日:2020-05-28
申请号:US16251578
申请日:2019-01-18
Applicant: Palantir Technologies Inc.
Inventor: Audrey Kuan , Andrew Kaier , Eric Lee , Jasjit Grewal , Mark Elliot , Nitish Kulkarni , Robert Fink , Samuel Rogerson , Thomas Pearson , Thomas Powell , Lawrence Manning , Corey Garvey
Abstract: An apparatus, and a method, performed by one or more processors are disclosed. The method may comprise receiving a build request associated with performing an external data processing task on a first data set, the first data set being stored in memory associated with a data processing platform to be performed at a system external to the data processing platform. The method may also comprise generating a task identifier for the data processing task, and providing, in association with the task identifier, the first data set to an agent associated with the external system with an indication of the data processing task, the agent being arranged to cause performance of the task at the external system, to receive a second data set resulting from performance of the task, and to provide the second data set and associated metadata indicative of the transformation. The method may also comprise receiving the second data set and metadata from the agent associated with the external system and storing the second data set and associated metadata.
-
16.
公开(公告)号:US20160253672A1
公开(公告)日:2016-09-01
申请号:US14726353
申请日:2015-05-29
Applicant: Palantir Technologies, Inc.
Inventor: Sean Hunter , Samuel Rogerson , Anirvan Mukherjee
CPC classification number: G06Q20/4016 , G06Q40/06 , H04L67/10
Abstract: A computer system implements a risk model for detecting outliers in a large plurality of transaction data, which can encompass millions or billions of transactions in some instances. The computing system comprises a non-transitory computer readable storage medium storing program instructions for execution by a computer processor in order to cause the computing system to receive first features for an entity in the transaction data, receive second features for a benchmark set, the second features corresponding with the first features, determine an outlier value of the entity based on a Mahalanobis distance from the first features to a benchmark value representing an average for the second features. The output of the risk model can be used to prioritize review by a human data analyst. The data analyst's review of the underlying data can be used to improve the model.
Abstract translation: 计算机系统实现用于检测大量多个事务数据中的异常值的风险模型,其在一些情况下可以包含数百万或数十亿次的事务。 该计算系统包括一个非暂时的计算机可读存储介质,其存储用于由计算机处理器执行的程序指令,以便使计算系统接收交易数据中的实体的第一特征,接收用于基准集的第二特征,第二特征 对应于第一特征的特征,基于从第一特征到表示第二特征的平均值的基准值的马氏距离距离来确定实体的离群值。 风险模型的输出可用于将人力资源分析师的审查优先考虑在内。 数据分析师对底层数据的回顾可用于改进模型。
-
-
-
-
-