-
公开(公告)号:US20170177606A1
公开(公告)日:2017-06-22
申请号:US15446917
申请日:2017-03-01
Applicant: Palantir Technologies, Inc.
Inventor: Geoffrey Stowe , Chris Fischer , Paul George , Eli Bingham , Rosco Hill
IPC: G06F17/30
CPC classification number: G06F17/30153 , G06F11/2025 , G06F17/00 , G06F17/30067 , G06F17/30091 , G06F17/30106 , G06F17/30129 , G06F17/30371 , G06F17/30528 , G06F17/30554 , G06F17/30569 , G06F17/30705 , G06F17/30867 , G06F17/30955
Abstract: A data analysis system is proposed for providing fine-grained low latency access to high volume input data from possibly multiple heterogeneous input data sources. The input data is parsed, optionally transformed, indexed, and stored in a horizontally-scalable key-value data repository where it may be accessed using low latency searches. The input data may be compressed into blocks before being stored to minimize storage requirements. The results of searches present input data in its original form. The input data may include access logs, call data records (CDRs), e-mail messages, etc. The system allows a data analyst to efficiently identify information of interest in a very large dynamic data set up to multiple petabytes in size. Once information of interest has been identified, that subset of the large data set can be imported into a dedicated or specialized data analysis system for an additional in-depth investigation and contextual analysis.
-
公开(公告)号:US09852144B2
公开(公告)日:2017-12-26
申请号:US15446917
申请日:2017-03-01
Applicant: Palantir Technologies, Inc.
Inventor: Geoffrey Stowe , Chris Fischer , Paul George , Eli Bingham , Rosco Hill
CPC classification number: G06F17/30153 , G06F11/2025 , G06F17/00 , G06F17/30067 , G06F17/30091 , G06F17/30106 , G06F17/30129 , G06F17/30371 , G06F17/30528 , G06F17/30554 , G06F17/30569 , G06F17/30705 , G06F17/30867 , G06F17/30955
Abstract: A data analysis system is proposed for providing fine-grained low latency access to high volume input data from possibly multiple heterogeneous input data sources. The input data is parsed, optionally transformed, indexed, and stored in a horizontally-scalable key-value data repository where it may be accessed using low latency searches. The input data may be compressed into blocks before being stored to minimize storage requirements. The results of searches present input data in its original form. The input data may include access logs, call data records (CDRs), e-mail messages, etc. The system allows a data analyst to efficiently identify information of interest in a very large dynamic data set up to multiple petabytes in size. Once information of interest has been identified, that subset of the large data set can be imported into a dedicated or specialized data analysis system for an additional in-depth investigation and contextual analysis.
-
公开(公告)号:US20180081896A1
公开(公告)日:2018-03-22
申请号:US15824096
申请日:2017-11-28
Applicant: Palantir Technologies, Inc.
Inventor: Geoffrey Stowe , Chris Fischer , Paul George , Eli Bingham , Rosco Hill
Abstract: A data analysis system is proposed for providing fine-grained low latency access to high volume input data from possibly multiple heterogeneous input data sources. The input data is parsed, optionally transformed, indexed, and stored in a horizontally-scalable key-value data repository where it may be accessed using low latency searches. The input data may be compressed into blocks before being stored to minimize storage requirements. The results of searches present input data in its original form. The input data may include access logs, call data records (CDRs), e-mail messages, etc. The system allows a data analyst to efficiently identify information of interest in a very large dynamic data set up to multiple petabytes in size. Once information of interest has been identified, that subset of the large data set can be imported into a dedicated or specialized data analysis system for an additional in-depth investigation and contextual analysis.
-
公开(公告)号:US20190384747A1
公开(公告)日:2019-12-19
申请号:US16548803
申请日:2019-08-22
Applicant: Palantir Technologies Inc.
Inventor: Geoffrey Stowe , Chris Fischer , Paul George , Eli Bingham , Rosco Hill
IPC: G06F16/174 , G06F11/20 , G06F17/00 , G06F16/2457 , G06F16/9535 , G06F16/23 , G06F16/901 , G06F16/25 , G06F16/248 , G06F16/14 , G06F16/35 , G06F16/17 , G06F16/13 , G06F16/10
Abstract: A data analysis system is proposed for providing fine-grained low latency access to high volume input data from possibly multiple heterogeneous input data sources. The input data is parsed, optionally transformed, indexed, and stored in a horizontally-scalable key-value data repository where it may be accessed using low latency searches. The input data may be compressed into blocks before being stored to minimize storage requirements. The results of searches present input data in its original form. The input data may include access logs, call data records (CDRs), e-mail messages, etc. The system allows a data analyst to efficiently identify information of interest in a very large dynamic data set up to multiple petabytes in size. Once information of interest has been identified, that subset of the large data set can be imported into a dedicated or specialized data analysis system for an additional in-depth investigation and contextual analysis.
-
5.
公开(公告)号:US09208159B2
公开(公告)日:2015-12-08
申请号:US14451221
申请日:2014-08-04
Applicant: PALANTIR TECHNOLOGIES, INC.
Inventor: Geoffrey Stowe , Chris Fischer , Paul George , Eli Bingham , Rosco Hill
CPC classification number: G06F17/30153 , G06F11/2025 , G06F17/00 , G06F17/30067 , G06F17/30091 , G06F17/30106 , G06F17/30129 , G06F17/30371 , G06F17/30528 , G06F17/30554 , G06F17/30569 , G06F17/30705 , G06F17/30867 , G06F17/30955
Abstract: A data analysis system is proposed for providing fine-grained low latency access to high volume input data from possibly multiple heterogeneous input data sources. The input data is parsed, optionally transformed, indexed, and stored in a horizontally-scalable key-value data repository where it may be accessed using low latency searches. The input data may be compressed into blocks before being stored to minimize storage requirements. The results of searches present input data in its original form. The input data may include access logs, call data records (CDRs), e-mail messages, etc. The system allows a data analyst to efficiently identify information of interest in a very large dynamic data set up to multiple petabytes in size. Once information of interest has been identified, that subset of the large data set can be imported into a dedicated or specialized data analysis system for an additional in-depth investigation and contextual analysis.
Abstract translation: 提出了一种数据分析系统,用于从可能的多个异构输入数据源提供细粒度的低延迟访问大容量输入数据。 输入数据被解析,可选地变换,索引并存储在水平可扩展的键值数据存储库中,在该存储库中可以使用低延迟搜索进行访问。 输入数据可以在存储之前被压缩成块,以最小化存储要求。 搜索结果以原始形式显示输入数据。 输入数据可以包括访问日志,呼叫数据记录(CDR),电子邮件消息等。该系统允许数据分析者在大小上达到多PB的非常大的动态数据集中有效地识别感兴趣的信息。 一旦确定了感兴趣的信息,大数据集的该子集可以被导入到专门的或专门的数据分析系统中以进行进一步的深入调查和上下文分析。
-
公开(公告)号:US10423582B2
公开(公告)日:2019-09-24
申请号:US15824096
申请日:2017-11-28
Applicant: Palantir Technologies, Inc.
Inventor: Geoffrey Stowe , Chris Fischer , Paul George , Eli Bingham , Rosco Hill
IPC: G06F16/174 , G06F16/10 , G06F16/13 , G06F16/17 , G06F16/35 , G06F16/14 , G06F16/248 , G06F16/25 , G06F16/901 , G06F16/23 , G06F16/9535 , G06F16/2457 , G06F17/00 , G06F11/20
Abstract: A data analysis system is proposed for providing fine-grained low latency access to high volume input data from possibly multiple heterogeneous input data sources. The input data is parsed, optionally transformed, indexed, and stored in a horizontally-scalable key-value data repository where it may be accessed using low latency searches. The input data may be compressed into blocks before being stored to minimize storage requirements. The results of searches present input data in its original form. The input data may include access logs, call data records (CDRs), e-mail messages, etc. The system allows a data analyst to efficiently identify information of interest in a very large dynamic data set up to multiple petabytes in size. Once information of interest has been identified, that subset of the large data set can be imported into a dedicated or specialized data analysis system for an additional in-depth investigation and contextual analysis.
-
公开(公告)号:US09639578B2
公开(公告)日:2017-05-02
申请号:US14961830
申请日:2015-12-07
Applicant: PALANTIR TECHNOLOGIES, INC.
Inventor: Geoffrey Stowe , Chris Fischer , Paul George , Eli Bingham , Rosco Hill
CPC classification number: G06F17/30153 , G06F11/2025 , G06F17/00 , G06F17/30067 , G06F17/30091 , G06F17/30106 , G06F17/30129 , G06F17/30371 , G06F17/30528 , G06F17/30554 , G06F17/30569 , G06F17/30705 , G06F17/30867 , G06F17/30955
Abstract: A data analysis system is proposed for providing fine-grained low latency access to high volume input data from possibly multiple heterogeneous input data sources. The input data is parsed, optionally transformed, indexed, and stored in a horizontally-scalable key-value data repository where it may be accessed using low latency searches. The input data may be compressed into blocks before being stored to minimize storage requirements. The results of searches present input data in its original form. The input data may include access logs, call data records (CDRs), e-mail messages, etc. The system allows a data analyst to efficiently identify information of interest in a very large dynamic data set up to multiple petabytes in size. Once information of interest has been identified, that subset of the large data set can be imported into a dedicated or specialized data analysis system for an additional in-depth investigation and contextual analysis.
-
-
-
-
-
-