-
公开(公告)号:US20160314155A1
公开(公告)日:2016-10-27
申请号:US14693792
申请日:2015-04-22
Applicant: Palantir Technologies Inc.
Inventor: Timothy Wilson , Brian Dorne , Christopher Lockfort , Thomas Kleingarn
IPC: G06F17/30
CPC classification number: G06F16/2272 , G06F16/1774 , G06F16/254 , G06F16/951
Abstract: A data integration pipeline is configured to receive and integrate raw data into a data processing system. Raw data may be defined as an electronic collection of data composed of information from multiple records, whose primary relationship to each other is their shared origin from a single or multiple databases. In integrating the raw data, the data integration pipeline may split the integration into two phases, more specifically: an ingest phase, and an import phase.
Abstract translation: 数据集成流水线被配置为接收并将原始数据集成到数据处理系统中。 原始数据可以被定义为由来自多个记录的信息组成的数据的电子收集,它们彼此的主要关系是它们来自单个或多个数据库的共享来源。 在整合原始数据时,数据集成管道可将集成分为两个阶段,更具体地说就是:进入阶段和导入阶段。