Slowly changing dimension attributes in extract, transform, load processes
    1.
    发明授权
    Slowly changing dimension attributes in extract, transform, load processes 有权
    在提取,转换,加载过程中缓慢改变维属性

    公开(公告)号:US09031902B2

    公开(公告)日:2015-05-12

    申请号:US13293196

    申请日:2011-11-10

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30563

    摘要: A computer-implemented method, computer program product and a system for identifying and handling slowly changing dimension (SCD) attributes for use with an Extract, Transform, Load (ETL) process, comprising importing a data model for dimensional data into a data integration system, where the dimensional data comprises a plurality of attributes, identifying via a data discovery analyzer one or more attributes in the data model as SCD attributes, importing the identified SCD attributes into the data integration system, selecting a data source comprising dimensional data, automatically generating an ETL job for the dimensional data utilizing the imported SCD attributes, and executing the automatically generated ETL to extract the dimensional data from the data source and loading the dimensional data into the imported SCD attributes in a target data system.

    摘要翻译: 一种计算机实现的方法,计算机程序产品和用于识别和处理与提取,变换,加载(ETL)过程一起使用的缓慢变化的维度(SCD)属性的系统,包括将维数据的数据模型导入数据集成系统 其中尺寸数据包括多个属性,通过数据发现分析器将数据模型中的一个或多个属性识别为SCD属性,将所识别的SCD属性导入到数据集成系统中,选择包括尺寸数据的数据源,自动生成 用于使用导入的SCD属性的维数据的ETL作业,以及执行自动生成的ETL以从数据源提取尺寸数据,并将维数据加载到目标数据系统中的导入的SCD属性中。

    SLOWLY CHANGING DIMENSION ATTRIBUTES IN EXTRACT, TRANSFORM, LOAD PROCESSES
    2.
    发明申请
    SLOWLY CHANGING DIMENSION ATTRIBUTES IN EXTRACT, TRANSFORM, LOAD PROCESSES 有权
    在提取,变换,加载过程中快速更改尺寸属性

    公开(公告)号:US20130124453A1

    公开(公告)日:2013-05-16

    申请号:US13293196

    申请日:2011-11-10

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30563

    摘要: A computer-implemented method, computer program product and a system for identifying and handling slowly changing dimension (SCD) attributes for use with an Extract, Transform, Load (ETL) process, comprising importing a data model for dimensional data into a data integration system, where the dimensional data comprises a plurality of attributes, identifying via a data discovery analyzer one or more attributes in the data model as SCD attributes, importing the identified SCD attributes into the data integration system, selecting a data source comprising dimensional data, automatically generating an ETL job for the dimensional data utilizing the imported SCD attributes, and executing the automatically generated ETL to extract the dimensional data from the data source and loading the dimensional data into the imported SCD attributes in a target data system.

    摘要翻译: 一种计算机实现的方法,计算机程序产品和用于识别和处理与提取,变换,加载(ETL)过程一起使用的缓慢变化的维度(SCD)属性的系统,包括将维数据的数据模型导入数据集成系统 其中尺寸数据包括多个属性,通过数据发现分析器将数据模型中的一个或多个属性识别为SCD属性,将所识别的SCD属性导入到数据集成系统中,选择包括尺寸数据的数据源,自动生成 用于使用导入的SCD属性的维数据的ETL作业,以及执行自动生成的ETL以从数据源提取尺寸数据,并将维数据加载到目标数据系统中的导入的SCD属性中。

    Finding Partition Boundaries for Parallel Processing of Markup Language Documents
    4.
    发明申请
    Finding Partition Boundaries for Parallel Processing of Markup Language Documents 有权
    查找用于并行处理标记语言文档的分区边界

    公开(公告)号:US20120079364A1

    公开(公告)日:2012-03-29

    申请号:US12893248

    申请日:2010-09-29

    IPC分类号: G06F17/27

    摘要: A method, a computer program product and a system identify partition locations within an extended markup language (XML) document without parsing so as to process portions of said document in parallel. The XML document includes sections required to remain continuous. The document is scanned for continuous sections without parsing, and boundaries of the initial partitions are adjusted to reside outside the continuous sections to determine resulting partitions for the document. The resulting partitions may be processed in parallel to provide the document information for storage.

    摘要翻译: 方法,计算机程序产品和系统识别扩展标记语言(XML)文档中的分区位置,而不进行解析,以便并行处理所述文档的部分。 XML文档包含保持连续性所需的部分。 文档扫描连续部分而不进行解析,初始分区的边界将被调整为驻留在连续部分之外,以确定文档的结果分区。 所得到的分区可以并行处理以提供用于存储的文档信息。

    System and method for generating code for an integrated data system
    7.
    发明申请
    System and method for generating code for an integrated data system 有权
    用于生成集成数据系统代码的系统和方法

    公开(公告)号:US20070214111A1

    公开(公告)日:2007-09-13

    申请号:US11372540

    申请日:2006-03-10

    IPC分类号: G06F17/30

    摘要: A computer implemented method, apparatus, and computer usable program code for generating code for an integrated data system. A mixed data flow is received. The mixed data flow contains mixed data flow operators, which are associated with multiple runtime environments. A graph is generated containing logical operators based on the mixed data flow in response to receiving the mixed data flow. The logical operators are independent of the plurality of runtime environments. The graph is converted to a model. The logical operators are converted to model operators associated with the multiple runtime environments. The model operators allow for analysis of operations for the mixed data flow. The model is converted into an execution plan graph. The execution plan graph is executable on different runtime environments.

    摘要翻译: 一种用于生成集成数据系统的代码的计算机实现的方法,装置和计算机可用程序代码。 接收到混合数据流。 混合数据流包含与多个运行时环境相关联的混合数据流操作符。 生成包含基于混合数据流的响应于接收到混合数据流的逻辑运算符的图形。 逻辑运算符独立于多个运行时环境。 图形转换为模型。 逻辑运算符被转换为与多个运行时环境相关联的模型运算符。 模型运算符允许对混合数据流的操作进行分析。 该模型转换为执行计划图。 执行计划图可在不同的运行时环境中执行。

    System and method for a multi-level locking hierarchy in a database with multi-dimensional clustering
    8.
    发明授权
    System and method for a multi-level locking hierarchy in a database with multi-dimensional clustering 失效
    具有多维聚类的数据库中多级锁定层次结构的系统和方法

    公开(公告)号:US07236974B2

    公开(公告)日:2007-06-26

    申请号:US10425760

    申请日:2003-04-29

    IPC分类号: G06F17/30

    摘要: A multi-level locking hierarchy for a relational database includes a locking level applied to a multi-dimensionally clustering table, a locking level applied to blocks within the table, and a locking level applied to rows within the blocks. The hierarchy leverages the multi-dimensional clustering of the table data for efficiency and to reduce lock overhead. Data is normally locked in order of coarser to finer granularity to limit deadlock. When data of finer granularity is locked, data of coarser granularity containing the finer granularity data is also locked. Block lock durations may be employed to ensure that a block remains locked if any contained row remains locked. Block level lock attributes may facilitate detection of at least one of a concurrent scan and a row deletion within a block. Detection of the emptying of a block during a scan of the block may bar scan completion in that block.

    摘要翻译: 关系数据库的多级锁定层次结构包括应用于多维聚类表的锁定级别,应用于表中块的锁定级别以及应用于块内的行的锁定级别。 层次结构利用表数据的多维聚类来提高效率并减少锁定开销。 数据通常以更细和更细粒度的顺序锁定,以限制死锁。 当更细粒度的数据被锁定时,包含更细粒度数据的较粗粒度的数据也被锁定。 可以使用块锁定持续时间来确保如果任何包含的行保持锁定,则块保持锁定。 块级锁定属性可以有助于检测块内的并行扫描和行删除中的至少一个。 在块的扫描期间检测块的排空可能会阻止该块中的扫描完成。