-
公开(公告)号:US09965937B2
公开(公告)日:2018-05-08
申请号:US14473920
申请日:2014-08-29
Applicant: Palantir Technologies Inc.
Inventor: David Cohen , Jason Ma , Bing Jie Fu , Ilya Nepomnyashchiy , Steven Berler , Alex Smaliy , Jack Grossman , James Thompson , Julia Boortz , Matthew Sprague , Parvathy Menon , Michael Kross , Michael Harris , Adam Borochoff
CPC classification number: G08B21/18 , G06F3/04842 , H04L63/0281 , H04L63/1433 , H04L63/145
Abstract: Embodiments of the present disclosure relate to a data analysis system that may automatically generate memory-efficient clustered data structures, automatically analyze those clustered data structures, and provide results of the automated analysis in an optimized way to an analyst. The automated analysis of the clustered data structures (also referred to herein as data clusters) may include an automated application of various criteria or rules so as to generate a compact, human-readable analysis of the data clusters. The human-readable analyzes (also referred to herein as “summaries” or “conclusions”) of the data clusters may be organized into an interactive user interface so as to enable an analyst to quickly navigate among information associated with various data clusters and efficiently evaluate those data clusters in the context of, for example, a fraud investigation. Embodiments of the present disclosure also relate to automated scoring of the clustered data structures.
-
公开(公告)号:US08712906B1
公开(公告)日:2014-04-29
申请号:US13968213
申请日:2013-08-15
Applicant: Palantir Technologies, Inc.
Inventor: Matthew Sprague , Michael Kross , Adam Borochoff , Parvathy Menon , Michael Harris
IPC: G06Q40/00
Abstract: Techniques are disclosed for prioritizing a plurality of clusters. Prioritizing clusters may generally include identifying a scoring strategy for prioritizing the plurality of clusters. Each cluster is generated from a seed and stores a collection of data retrieved using the seed. For each cluster, elements of the collection of data stored by the cluster are evaluated according to the scoring strategy and a score is assigned to the cluster based on the evaluation. The clusters may be ranked according to the respective scores assigned to the plurality of clusters. The collection of data stored by each cluster may include financial data evaluated by the scoring strategy for a risk of fraud. The score assigned to each cluster may correspond to an amount at risk.
-
公开(公告)号:US12238136B2
公开(公告)日:2025-02-25
申请号:US18504392
申请日:2023-11-08
Applicant: Palantir Technologies Inc.
Inventor: Harkirat Singh , Geoffrey Stowe , Stefan Bach , Matthew Sprague , Michael Kross , Adam Borochoff , Parvathy Menon , Michael Harris
IPC: G06Q40/00 , G06F16/23 , G06F16/242 , G06F16/2457 , G06F16/2458 , G06F16/26 , G06F16/28 , G06F16/335 , G06F16/35 , G06F16/355 , G06F16/9535 , G06Q10/10 , G06Q20/38 , G06Q20/40 , G06Q30/018 , G06Q40/02 , G06Q40/03 , G06Q40/10 , G06Q40/12 , H04L9/40
Abstract: In various embodiments, systems, methods, and techniques are disclosed for generating a collection of clusters of related data from a seed. Seeds may be generated based on seed generation strategies or rules. Clusters may be generated by, for example, retrieving a seed, adding the seed to a first cluster, retrieving a clustering strategy or rules, and adding related data and/or data entities to the cluster based on the clustering strategy. Various cluster scores may be generated based on attributes of data in a given cluster. Further, cluster metascores may be generated based on various cluster scores associated with a cluster. Clusters may be ranked based on cluster metascores. Various embodiments may enable an analyst to discover various insights related to data clusters, and may be applicable to various tasks including, for example, tax fraud detection, beaconing malware detection, malware user-agent detection, and/or activity trend detection, among various others.
-
公开(公告)号:US12197514B2
公开(公告)日:2025-01-14
申请号:US18239690
申请日:2023-08-29
Applicant: Palantir Technologies Inc.
Inventor: Matthew Maclean , Adam Borochoff , Joseph Rafidi , Matthew Jenny , Parvathy Menon , Ryan Rowe
IPC: G06F16/904 , G06F3/0484 , G06F16/21 , G06F16/23 , G06F21/60 , G06T11/20
Abstract: Systems and methods are provided for creating and managing a data integration workspace. The workspace may comprise one or more views of data (or datasets) stored in or accessible by the system. Models may be generated and updated based on the plurality of datasets and presented via a graphical user interface. Feedback received via a graphical user interface presenting a model may be used to annotate an underlying dataset associated with the model. Responsive to a modification of the underlying dataset or the rules for using the underlying dataset to generate the model, other related datasets and/or models may be automatically updated accordingly. Templates associated with one or more types of users may be defined. Each template may comprise one or more specific models related to a specific type of user.
-
公开(公告)号:US20240146761A1
公开(公告)日:2024-05-02
申请号:US18504392
申请日:2023-11-08
Applicant: Palantir Technologies Inc.
Inventor: Harkirat Singh , Geoffrey Stowe , Stefan Bach , Matthew Sprague , Michael Kross , Adam Borochoff , Parvathy Menon , Michael Harris
IPC: H04L9/40 , G06F16/23 , G06F16/242 , G06F16/2457 , G06F16/2458 , G06F16/26 , G06F16/28 , G06F16/335 , G06F16/35 , G06F16/9535 , G06Q10/10 , G06Q20/38 , G06Q20/40 , G06Q30/018 , G06Q40/00 , G06Q40/02 , G06Q40/03 , G06Q40/10 , G06Q40/12
CPC classification number: H04L63/145 , G06F16/23 , G06F16/244 , G06F16/24578 , G06F16/2465 , G06F16/26 , G06F16/283 , G06F16/285 , G06F16/287 , G06F16/288 , G06F16/335 , G06F16/35 , G06F16/355 , G06F16/9535 , G06Q10/10 , G06Q20/382 , G06Q20/4016 , G06Q30/0185 , G06Q40/00 , G06Q40/02 , G06Q40/03 , G06Q40/10 , G06Q40/123
Abstract: In various embodiments, systems, methods, and techniques are disclosed for generating a collection of clusters of related data from a seed. Seeds may be generated based on seed generation strategies or rules. Clusters may be generated by, for example, retrieving a seed, adding the seed to a first cluster, retrieving a clustering strategy or rules, and adding related data and/or data entities to the cluster based on the clustering strategy. Various cluster scores may be generated based on attributes of data in a given cluster. Further, cluster metascores may be generated based on various cluster scores associated with a cluster. Clusters may be ranked based on cluster metascores. Various embodiments may enable an analyst to discover various insights related to data clusters, and may be applicable to various tasks including, for example, tax fraud detection, beaconing malware detection, malware user-agent detection, and/or activity trend detection, among various others.
-
公开(公告)号:US20230333888A1
公开(公告)日:2023-10-19
申请号:US17826972
申请日:2022-05-27
Applicant: Palantir Technologies Inc.
Inventor: Adam Borochoff , John Mathews , Joseph Rafidi , James Thompson , Kamran Khan , Morten Telling , Parvathy Menon , Patrick Szmucer , Robert Kruszewski , Rahij Ramsharan , Katherine Ketsdever
CPC classification number: G06F9/4881 , G06F11/3495
Abstract: Computing systems methods, and non-transitory storage media are provided for retrieving information regarding an operation to be performed by a platform, performing a preliminary validation of the operation, generating details regarding the preliminary validation, transmitting at least a subset of the details of the preliminary validation to the platform, and populating the generated details on an interface. If the preliminary validation fails, the platform refrains from performing the operation. Furthermore, the logic describing the operation can be executed on different platforms and is not bound or limited to one platform.
-
公开(公告)号:US11336681B2
公开(公告)日:2022-05-17
申请号:US16898850
申请日:2020-06-11
Applicant: Palantir Technologies Inc.
Inventor: Harkirat Singh , Geoffrey Stowe , Brendan Weickert , Matthew Sprague , Michael Kross , Adam Borochoff , Parvathy Menon , Michael Harris
IPC: G06Q40/00 , H04L29/06 , G06F16/2457 , G06F16/23 , G06F16/242 , G06F16/28 , G06F16/9535 , G06Q10/10 , G06Q40/02 , G06F16/335 , G06F16/35 , G06F16/26 , G06F16/2458 , G06Q20/40 , G06Q30/00 , G06Q20/38
Abstract: In various embodiments, systems, methods, and techniques are disclosed for generating a collection of clusters of related data from a seed. Seeds may be generated based on seed generation strategies or rules. Clusters may be generated by, for example, retrieving a seed, adding the seed to a first cluster, retrieving a clustering strategy or rules, and adding related data and/or data entities to the cluster based on the clustering strategy. Various cluster scores may be generated based on attributes of data in a given cluster. Further, cluster metascores may be generated based on various cluster scores associated with a cluster. Clusters may be ranked based on cluster metascores. Various embodiments may enable an analyst to discover various insights related to data clusters, and may be applicable to various tasks including, for example, tax fraud detection, beaconing malware detection, malware user-agent detection, and/or activity trend detection, among various others.
-
公开(公告)号:US11314769B2
公开(公告)日:2022-04-26
申请号:US16014005
申请日:2018-06-21
Applicant: Palantir Technologies Inc.
Inventor: Matthew MacLean , Adam Borochoff , Jared Newman , Joseph Rafidi
Abstract: Techniques for propagation of deletion operations among a plurality of related datasets are described herein. In an embodiment, a data processing method comprises, using a distributed database system that is programmed to manage a plurality of different raw datasets and a plurality of derived datasets that have been derived from the raw datasets based on a plurality of derivation relationships that link the raw datasets to the derived datasets: from a first dataset that is stored in the distributed database system, determining a subset of records that are candidates for propagated deletion of specified data values; determining one or more particular raw datasets that contain the subset of records; deleting the specified data values from the particular raw datasets; based on the plurality of derivation relationships and the particular raw datasets, identifying one or more particular derived datasets that have been derived from the particular raw datasets; generating and executing a build of the one or more particular derived datasets to result in creating and storing the one or more particular derived datasets without the specified data values that were deleted from the particular raw datasets; repeating the generating and executing for all derived datasets that have derivation relationships to the particular raw datasets; wherein the method is performed using one or more processors.
-
公开(公告)号:US20210209158A1
公开(公告)日:2021-07-08
申请号:US17210274
申请日:2021-03-23
Applicant: Palantir Technologies Inc.
Inventor: Matthew Maclean , Adam Borochoff , Joseph Rafidi , Matthew Jenny , Parvathy Menon , Ryan Rowe
IPC: G06F16/904 , G06T11/20 , G06F3/0484 , G06F21/60 , G06F16/23 , G06F16/21
Abstract: Systems and methods are provided for creating and managing a data integration workspace. The workspace may comprise one or more views of data (or datasets) stored in or accessible by the system. Models may be generated and updated based on the plurality of datasets and presented via a graphical user interface. Feedback received via a graphical user interface presenting a model may be used to annotate an underlying dataset associated with the model. Responsive to a modification of the underlying dataset or the rules for using the underlying dataset to generate the model, other related datasets and/or models may be automatically updated accordingly. Templates associated with one or more types of users may be defined. Each template may comprise one or more specific models related to a specific type of user.
-
公开(公告)号:US10956508B2
公开(公告)日:2021-03-23
申请号:US15956600
申请日:2018-04-18
Applicant: Palantir Technologies Inc.
Inventor: Matthew Maclean , Adam Borochoff , Joseph Rafidi , Matthew Jenny , Parvathy Menon , Ryan Rowe
IPC: G06F16/904 , G06T11/20 , G06F3/0484 , G06F21/60 , G06F16/23 , G06F16/21
Abstract: Systems and methods are provided for creating and managing a data integration workspace. The workspace may comprise one or more views of data (or datasets) stored in or accessible by the system. Models may be generated and updated based on the plurality of datasets and presented via a graphical user interface. Feedback received via a graphical user interface presenting a model may be used to annotate an underlying dataset associated with the model. Responsive to a modification of the underlying dataset or the rules for using the underlying dataset to generate the model, other related datasets and/or models may be automatically updated accordingly. Templates associated with one or more types of users may be defined. Each template may comprise one or more specific models related to a specific type of user.
-
-
-
-
-
-
-
-
-