Method and Apparatus for Loading Data Files into a Data-Warehouse System
    1.
    发明申请
    Method and Apparatus for Loading Data Files into a Data-Warehouse System 失效
    将数据文件加载到数据仓库系统中的方法和装置

    公开(公告)号:US20090172047A1

    公开(公告)日:2009-07-02

    申请号:US12171991

    申请日:2008-07-11

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30563

    摘要: Date-warehouse systems are populated using an enhanced Extraction-Load-Transform (ETL) process and system by employing three ideas: Out-of-order-fill ETL, relative-ordering index (ROI), and dependent queries. Out-of-order-fill ETL allows a data warehouse to accept the loading of data files in any order, and does not require the loading of any previous backup data files in order to provide some functionality to end users under the view that some functionality or data access is better than none at all. Dependent queries are processes that use defined data structures for use in constructing, extracting, and validating each record to be written in said data-warehouse system in order to ensure that referential integrity is maintained and that no orphaned data is pushed into the data warehouse. Finally, ROI is a process wherein a value is determined, based on the constraints of the source data, which indicates the relative newness of the data.

    摘要翻译: 日期仓库系统使用增强的提取 - 负载变换(ETL)流程和系统填充,采用三个想法:无序填充ETL,相对排序索引(ROI)和依赖查询。 无序填充ETL允许数据仓库以任何顺序接受数据文件的加载,并且不需要加载任何先前的备份数据文件,以便在最终用户的一些功能下提供一些功能 或数据访问比没有一个更好。 依赖查询是使用定义的数据结构用于构建,提取和验证要写入所述数据仓库系统中的每个记录的过程,以确保参照完整性得到维护,并且没有孤立数据被推入数据仓库。 最后,ROI是基于指示数据的相对新颖性的源数据的约束来确定值的过程。

    Method and apparatus for loading data files into a data-warehouse system

    公开(公告)号:US07984019B2

    公开(公告)日:2011-07-19

    申请号:US12171991

    申请日:2008-07-11

    IPC分类号: G06F7/00 G06F17/00

    CPC分类号: G06F17/30563

    摘要: Date-warehouse systems are populated using an enhanced Extraction-Load-Transform (ETL) process and system by employing three ideas: Out-of-order-fill ETL, relative-ordering index (ROI), and dependent queries. Out-of-order-fill ETL allows a data warehouse to accept the loading of data files in any order, and does not require the loading of any previous backup data files in order to provide some functionality to end users under the view that some functionality or data access is better than none at all. Dependent queries are processes that use defined data structures for use in constructing, extracting, and validating each record to be written in said data-warehouse system in order to ensure that referential integrity is maintained and that no orphaned data is pushed into the data warehouse. Finally, ROI is a process wherein a value is determined, based on the constraints of the source data, which indicates the relative newness of the data.