COLUMN BASED DATA TRANSFER IN EXTRACT, TRANSFORM AND LOAD (ETL) SYSTEMS
    21.
    发明申请
    COLUMN BASED DATA TRANSFER IN EXTRACT, TRANSFORM AND LOAD (ETL) SYSTEMS 有权
    提取,变换和负载(ETL)系统中的基于数据的数据传输

    公开(公告)号:US20130297557A1

    公开(公告)日:2013-11-07

    申请号:US13936508

    申请日:2013-07-08

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30563

    摘要: Executing a plurality of transform stages in an extract, transform and load (ETL) job including, for each of the transform stages, receiving a plurality of input row identifiers (RIDs) corresponding to a first plurality of source database table rows in a source database table. Data is retrieved directly from a subset of the source database table columns in the first plurality of source database table rows based on the input RIDs and transform logic. Partial row data including data from the subset of the source database table columns is generated for each of the first plurality of source database table rows. Transformed data is generated based on the partial row data and to the transform logic. Output RIDs corresponding to a second plurality of rows in the source database table that include a least a subset of the transformed data are output to a downstream stage.

    摘要翻译: 在提取,变换和加载(ETL)作业中执行多个变换阶段,包括对于每个变换阶段,接收与源数据库中的第一多个源数据库表行相对应的多个输入行标识符(RID) 表。 基于输入的RID和变换逻辑,直接从第一多个源数据库表行中的源数据库表列的子集中检索数据。 对于第一多个源数据库表行中的每一个生成包括源数据库表列的子集的数据的部分行数据。 基于部分行数据和变换逻辑生成变换数据。 包含源数据库表中包含变换数据的至少一个子集的第二多行的输出RID被输出到下游阶段。

    Shipping of data through ETL stages

    公开(公告)号:US10216815B2

    公开(公告)日:2019-02-26

    申请号:US15987375

    申请日:2018-05-23

    IPC分类号: G06F17/30 H04L29/06

    摘要: Performing an extract, transform, and load (ETL) process. Column data is received by a stage of the ETL process. The size of the received column data is ascertained by the stage. In response to determining that the size of the column data exceeds a predefined threshold, the stage saves the column data and creates a data locator associated with the column data. The created data locator advances through successive downstream stages of the ETL process as a replacement for the column data.