Finding Partition Boundaries for Parallel Processing of Markup Language Documents
    34.
    发明申请
    Finding Partition Boundaries for Parallel Processing of Markup Language Documents 有权
    查找用于并行处理标记语言文档的分区边界

    公开(公告)号:US20120079364A1

    公开(公告)日:2012-03-29

    申请号:US12893248

    申请日:2010-09-29

    IPC分类号: G06F17/27

    摘要: A method, a computer program product and a system identify partition locations within an extended markup language (XML) document without parsing so as to process portions of said document in parallel. The XML document includes sections required to remain continuous. The document is scanned for continuous sections without parsing, and boundaries of the initial partitions are adjusted to reside outside the continuous sections to determine resulting partitions for the document. The resulting partitions may be processed in parallel to provide the document information for storage.

    摘要翻译: 方法,计算机程序产品和系统识别扩展标记语言(XML)文档中的分区位置,而不进行解析,以便并行处理所述文档的部分。 XML文档包含保持连续性所需的部分。 文档扫描连续部分而不进行解析,初始分区的边界将被调整为驻留在连续部分之外,以确定文档的结果分区。 所得到的分区可以并行处理以提供用于存储的文档信息。

    METHOD AND APPARATUS FOR USING SET BASED STRUCTURED QUERY LANGUAGE (SQL) TO IMPLEMENT EXTRACT, TRANSFORM, AND LOAD (ETL) SPLITTER OPERATION
    38.
    发明申请
    METHOD AND APPARATUS FOR USING SET BASED STRUCTURED QUERY LANGUAGE (SQL) TO IMPLEMENT EXTRACT, TRANSFORM, AND LOAD (ETL) SPLITTER OPERATION 失效
    使用基于组合的查询语言(SQL)来实现提取,变换和加载(ETL)分离器操作的方法和装置

    公开(公告)号:US20080147707A1

    公开(公告)日:2008-06-19

    申请号:US11610480

    申请日:2006-12-13

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30454

    摘要: Methods and systems for implementing a splitter operation in an extract, transform, and load (ETL) process are provided. In one implementation, the method includes receiving a data flow including a splitter operation, and generating an execution plan graph based on the data flow. The execution plan graph includes structured query language (SQL) code for implementing the splitter operation, in which the structured query language (SQL) code is respectively executable among database servers associated with different vendors.

    摘要翻译: 提供了在提取,转换和加载(ETL)过程中实现分离器操作的方法和系统。 在一个实现中,该方法包括接收包括分离器操作的数据流,以及基于数据流生成执行计划图。 执行计划图包括用于实现分离器操作的结构化查询语言(SQL)代码,其中结构化查询语言(SQL)代码可以分别在与不同供应商相关联的数据库服务器之间执行。