-
公开(公告)号:US20200334236A1
公开(公告)日:2020-10-22
申请号:US16919951
申请日:2020-07-02
Applicant: Palantir Technologies Inc.
Inventor: Geoffrey Stowe , John McRaven , Andrew Pettit , Lucas Lemanowicz , Benedict Cappellacci , Arjun Mathur , Jonathan Victor , Nabeel Qureshi , Anshuman Prasad , Joy Tao , Mikhail Proniushkin , Casey Patton
IPC: G06F16/248 , G06F3/0482 , G06T11/20 , G06F16/26
Abstract: A system and method for processing data wherein one or more user selections of source data and an input defining one or more operations to be performed on the selected source data are received to generate processed data for display as a chart; the source data is retrieved from at least one data source, the source data is processed according to the defined one or more operations to generate processed data for output for display as a chart, the chart is stored as data defining the one or more operations and data identifying the source data operated on, a further user selection is received to redisplay the chart; retrieving the source data from the at least one data source; and the source data is processed according to the defined one or more operations to generate the processed data for output for redisplay as the chart.
-
公开(公告)号:US10423582B2
公开(公告)日:2019-09-24
申请号:US15824096
申请日:2017-11-28
Applicant: Palantir Technologies, Inc.
Inventor: Geoffrey Stowe , Chris Fischer , Paul George , Eli Bingham , Rosco Hill
IPC: G06F16/174 , G06F16/10 , G06F16/13 , G06F16/17 , G06F16/35 , G06F16/14 , G06F16/248 , G06F16/25 , G06F16/901 , G06F16/23 , G06F16/9535 , G06F16/2457 , G06F17/00 , G06F11/20
Abstract: A data analysis system is proposed for providing fine-grained low latency access to high volume input data from possibly multiple heterogeneous input data sources. The input data is parsed, optionally transformed, indexed, and stored in a horizontally-scalable key-value data repository where it may be accessed using low latency searches. The input data may be compressed into blocks before being stored to minimize storage requirements. The results of searches present input data in its original form. The input data may include access logs, call data records (CDRs), e-mail messages, etc. The system allows a data analyst to efficiently identify information of interest in a very large dynamic data set up to multiple petabytes in size. Once information of interest has been identified, that subset of the large data set can be imported into a dedicated or specialized data analysis system for an additional in-depth investigation and contextual analysis.
-
公开(公告)号:US09639578B2
公开(公告)日:2017-05-02
申请号:US14961830
申请日:2015-12-07
Applicant: PALANTIR TECHNOLOGIES, INC.
Inventor: Geoffrey Stowe , Chris Fischer , Paul George , Eli Bingham , Rosco Hill
CPC classification number: G06F17/30153 , G06F11/2025 , G06F17/00 , G06F17/30067 , G06F17/30091 , G06F17/30106 , G06F17/30129 , G06F17/30371 , G06F17/30528 , G06F17/30554 , G06F17/30569 , G06F17/30705 , G06F17/30867 , G06F17/30955
Abstract: A data analysis system is proposed for providing fine-grained low latency access to high volume input data from possibly multiple heterogeneous input data sources. The input data is parsed, optionally transformed, indexed, and stored in a horizontally-scalable key-value data repository where it may be accessed using low latency searches. The input data may be compressed into blocks before being stored to minimize storage requirements. The results of searches present input data in its original form. The input data may include access logs, call data records (CDRs), e-mail messages, etc. The system allows a data analyst to efficiently identify information of interest in a very large dynamic data set up to multiple petabytes in size. Once information of interest has been identified, that subset of the large data set can be imported into a dedicated or specialized data analysis system for an additional in-depth investigation and contextual analysis.
-
公开(公告)号:US20230394054A1
公开(公告)日:2023-12-07
申请号:US18452815
申请日:2023-08-21
Applicant: Palantir Technologies Inc.
Inventor: Geoffrey Stowe , John McRaven , Andrew Pettit , Lucas Lemanowicz , Benedict Cappellacci , Arjun Mathur , Jonathan Victor , Nabeel Qureshi , Anshuman Prasad , Joy Tao , Mikhail Proniushkin , Casey Patton
IPC: G06F16/248 , G06F3/0482 , G06T11/20 , G06F16/26
CPC classification number: G06F16/248 , G06F3/0482 , G06T11/206 , G06F16/26 , G06T2200/24
Abstract: A system and method for processing data wherein one or more user selections of source data and an input defining one or more operations to be performed on the selected source data are received to generate processed data for display as a chart; the source data is retrieved from at least one data source, the source data is processed according to the defined one or more operations to generate processed data for output for display as a chart, the chart is stored as data defining the one or more operations and data identifying the source data operated on, a further user selection is received to redisplay the chart; retrieving the source data from the at least one data source; and the source data is processed according to the defined one or more operations to generate the processed data for output for redisplay as the chart.
-
公开(公告)号:US11775542B2
公开(公告)日:2023-10-03
申请号:US17662142
申请日:2022-05-05
Applicant: Palantir Technologies Inc.
Inventor: Geoffrey Stowe , John McRaven , Andrew Pettit , Lucas Lemanowicz , Benedict Cappellacci , Arjun Mathur , Jonathan Victor , Nabeel Qureshi , Anshuman Prasad , Joy Tao , Mikhail Proniushkin , Casey Patton
IPC: G06F16/248 , G06F3/0482 , G06T11/20 , G06F16/26
CPC classification number: G06F16/248 , G06F3/0482 , G06F16/26 , G06T11/206 , G06T2200/24
Abstract: A system and method for processing data wherein one or more user selections of source data and an input defining one or more operations to be performed on the selected source data are received to generate processed data for display as a chart; the source data is retrieved from at least one data source, the source data is processed according to the defined one or more operations to generate processed data for output for display as a chart, the chart is stored as data defining the one or more operations and data identifying the source data operated on, a further user selection is received to redisplay the chart; retrieving the source data from the at least one data source; and the source data is processed according to the defined one or more operations to generate the processed data for output for redisplay as the chart.
-
公开(公告)号:US10740344B2
公开(公告)日:2020-08-11
申请号:US16720813
申请日:2019-12-19
Applicant: Palantir Technologies Inc.
Inventor: Geoffrey Stowe , John McRaven , Andrew Pettit , Lucas Lemanowicz , Benedict Cappellacci , Arjun Mathur , Jonathan Victor , Nabeel Qureshi , Anshuman Prasad , Joy Tao , Mikhail Proniushkin , Casey Patton
IPC: G06F16/248 , G06F16/26 , G06F3/0482 , G06T11/20
Abstract: A system and method for processing data wherein one or more user selections of source data and an input defining one or more operations to be performed on the selected source data are received to generate processed data for display as a chart; the source data is retrieved from at least one data source, the source data is processed according to the defined one or more operations to generate processed data for output for display as a chart, the chart is stored as data defining the one or more operations and data identifying the source data operated on, a further user selection is received to redisplay the chart; retrieving the source data from the at least one data source; and the source data is processed according to the defined one or more operations to generate the processed data for output for redisplay as the chart.
-
公开(公告)号:US09852144B2
公开(公告)日:2017-12-26
申请号:US15446917
申请日:2017-03-01
Applicant: Palantir Technologies, Inc.
Inventor: Geoffrey Stowe , Chris Fischer , Paul George , Eli Bingham , Rosco Hill
CPC classification number: G06F17/30153 , G06F11/2025 , G06F17/00 , G06F17/30067 , G06F17/30091 , G06F17/30106 , G06F17/30129 , G06F17/30371 , G06F17/30528 , G06F17/30554 , G06F17/30569 , G06F17/30705 , G06F17/30867 , G06F17/30955
Abstract: A data analysis system is proposed for providing fine-grained low latency access to high volume input data from possibly multiple heterogeneous input data sources. The input data is parsed, optionally transformed, indexed, and stored in a horizontally-scalable key-value data repository where it may be accessed using low latency searches. The input data may be compressed into blocks before being stored to minimize storage requirements. The results of searches present input data in its original form. The input data may include access logs, call data records (CDRs), e-mail messages, etc. The system allows a data analyst to efficiently identify information of interest in a very large dynamic data set up to multiple petabytes in size. Once information of interest has been identified, that subset of the large data set can be imported into a dedicated or specialized data analysis system for an additional in-depth investigation and contextual analysis.
-
公开(公告)号:US11848760B2
公开(公告)日:2023-12-19
申请号:US17658893
申请日:2022-04-12
Applicant: Palantir Technologies Inc.
Inventor: Harkirat Singh , Geoffrey Stowe , Brendan Weickert , Matthew Sprague , Michael Kross , Adam Borochoff , Parvathy Menon , Michael Harris
IPC: G06Q40/00 , H04L9/40 , G06F16/2457 , G06F16/23 , G06F16/242 , G06F16/28 , G06F16/9535 , G06Q10/10 , G06Q40/02 , G06Q40/10 , G06F16/335 , G06F16/35 , G06F16/26 , G06F16/2458 , G06Q40/03 , G06Q20/40 , G06Q30/018 , G06Q40/12 , G06Q20/38
CPC classification number: H04L63/145 , G06F16/23 , G06F16/244 , G06F16/2465 , G06F16/24578 , G06F16/26 , G06F16/283 , G06F16/285 , G06F16/287 , G06F16/288 , G06F16/335 , G06F16/35 , G06F16/355 , G06F16/9535 , G06Q10/10 , G06Q20/382 , G06Q20/4016 , G06Q30/0185 , G06Q40/00 , G06Q40/02 , G06Q40/03 , G06Q40/10 , G06Q40/123
Abstract: In various embodiments, systems, methods, and techniques are disclosed for generating a collection of clusters of related data from a seed. Seeds may be generated based on seed generation strategies or rules. Clusters may be generated by, for example, retrieving a seed, adding the seed to a first cluster, retrieving a clustering strategy or rules, and adding related data and/or data entities to the cluster based on the clustering strategy. Various cluster scores may be generated based on attributes of data in a given cluster. Further, cluster metascores may be generated based on various cluster scores associated with a cluster. Clusters may be ranked based on cluster metascores. Various embodiments may enable an analyst to discover various insights related to data clusters, and may be applicable to various tasks including, for example, tax fraud detection, beaconing malware detection, malware user-agent detection, and/or activity trend detection, among various others.
-
公开(公告)号:US20200304522A1
公开(公告)日:2020-09-24
申请号:US16898850
申请日:2020-06-11
Applicant: Palantir Technologies Inc.
Inventor: Harkirat Singh , Geoffrey Stowe , Brendan Weickert , Matthew Sprague , Michael Kross , Adam Borochoff , Parvathy Menon , Michael Harris
IPC: H04L29/06 , G06Q40/00 , G06F16/2457 , G06F16/23 , G06F16/242 , G06F16/28 , G06F16/9535 , G06Q10/10 , G06Q40/02 , G06F16/335 , G06F16/35 , G06F16/26 , G06F16/2458 , G06Q20/40 , G06Q30/00 , G06Q20/38
Abstract: In various embodiments, systems, methods, and techniques are disclosed for generating a collection of clusters of related data from a seed. Seeds may be generated based on seed generation strategies or rules. Clusters may be generated by, for example, retrieving a seed, adding the seed to a first cluster, retrieving a clustering strategy or rules, and adding related data and/or data entities to the cluster based on the clustering strategy. Various cluster scores may be generated based on attributes of data in a given cluster. Further, cluster metascores may be generated based on various cluster scores associated with a cluster. Clusters may be ranked based on cluster metascores. Various embodiments may enable an analyst to discover various insights related to data clusters, and may be applicable to various tasks including, for example, tax fraud detection, beaconing malware detection, malware user-agent detection, and/or activity trend detection, among various others.
-
公开(公告)号:US20190384747A1
公开(公告)日:2019-12-19
申请号:US16548803
申请日:2019-08-22
Applicant: Palantir Technologies Inc.
Inventor: Geoffrey Stowe , Chris Fischer , Paul George , Eli Bingham , Rosco Hill
IPC: G06F16/174 , G06F11/20 , G06F17/00 , G06F16/2457 , G06F16/9535 , G06F16/23 , G06F16/901 , G06F16/25 , G06F16/248 , G06F16/14 , G06F16/35 , G06F16/17 , G06F16/13 , G06F16/10
Abstract: A data analysis system is proposed for providing fine-grained low latency access to high volume input data from possibly multiple heterogeneous input data sources. The input data is parsed, optionally transformed, indexed, and stored in a horizontally-scalable key-value data repository where it may be accessed using low latency searches. The input data may be compressed into blocks before being stored to minimize storage requirements. The results of searches present input data in its original form. The input data may include access logs, call data records (CDRs), e-mail messages, etc. The system allows a data analyst to efficiently identify information of interest in a very large dynamic data set up to multiple petabytes in size. Once information of interest has been identified, that subset of the large data set can be imported into a dedicated or specialized data analysis system for an additional in-depth investigation and contextual analysis.
-
-
-
-
-
-
-
-
-