TECHNIQUES FOR EFFICIENT AND SCALABLE PROCESSING OF COMPLEX SETS OF XML SCHEMAS
    81.
    发明申请
    TECHNIQUES FOR EFFICIENT AND SCALABLE PROCESSING OF COMPLEX SETS OF XML SCHEMAS 有权
    用于XML方案的复杂集合的有效和可扩展处理的技术

    公开(公告)号:US20110179089A1

    公开(公告)日:2011-07-21

    申请号:US12689729

    申请日:2010-01-19

    IPC分类号: G06F17/30

    摘要: Techniques are described herein for efficient and scalable processing of complex sets of XML schemas. The techniques described herein provide for reducing duplication of schema elements in volatile memory by building an XML schema in-memory model that stores repeating schema elements in in-memory data structures that are separate from in-memory data structures that store the parent schema elements which logically include or otherwise refer to the repeating schema elements. The techniques described herein also provide for faster generation of an in-memory model of an XML schema by pre-loading, in data structures on persistent storage, of schema elements from dependent XML schemas that are referenced and/or incorporated by the XML schema. The techniques described herein also provide for efficient processing of inter-dependent XML schemas by tracking all unresolved schema elements from dependent XML schemas and freeing the portions of volatile memory, which are used to process schema elements from the dependent XML schemas, as soon as the dependent schema elements being processed are stored in data structures on persistent storage.

    摘要翻译: 这里描述了用于复杂的XML模式集合的有效和可缩放处理的技术。 本文描述的技术提供了通过构建XML模式内存中模型来减少易失性存储器中的模式元素的重复,该模式将内存模型元素存储在与存储父模式元素的内存中数据结构分开的内存中数据结构中, 逻辑上包括或以其他方式引用重复的模式元素。 本文描述的技术还提供了通过在由XML模式引用和/或并入的依赖XML模式中预先加载在持久存储器上的模式元素的数据结构中来更快地生成XML模式的内存中模型。 本文描述的技术还提供了通过从依赖的XML模式跟踪所有未解决的模式元素并释放用于从依赖的XML模式处理模式元素的易失性存储器的部分的快速处理相互依赖的XML模式,一旦 正在处理的依赖模式元素存储在持久存储器上的数据结构中。

    Cache-Based Predicate Handling For Queries On XML Data Using Uncorrelated Path-Based Row Sources
    83.
    发明申请
    Cache-Based Predicate Handling For Queries On XML Data Using Uncorrelated Path-Based Row Sources 有权
    基于缓存的谓词处理使用不相关的基于路径的行源的XML数据查询

    公开(公告)号:US20100306219A1

    公开(公告)日:2010-12-02

    申请号:US12474124

    申请日:2009-05-28

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30935

    摘要: The approaches described herein provide an efficient way to process certain kinds of queries that retrieve XML data stored in an object-relational database. An XML query may include a predicate condition that joins two row sources. One or both of the row sources required for predicate evaluation may be path-based; that is, consume and produce rows of XMLType data. Such a path-based row source may be modified to use the predicate condition to filter the rows it generates so that only rows satisfying the predicate condition need to be further processed. A small cache structure introduced into the path-based row source may be used to optimize the evaluation of the predicate row. Once populated with row data, the cache structure may be reorganized to optimize the performance of predicate evaluation.

    摘要翻译: 本文描述的方法提供了处理检索存储在对象关系数据库中的XML数据的某些类型的查询的有效方式。 XML查询可能包括连接两个行源的谓词条件。 谓词评估所需的一个或两个行来源可能是基于路径的; 也就是消耗和生成XMLType数据行。 这样的基于路径的行源可以被修改为使用谓词条件来过滤其生成的行,使得仅需要进一步处理满足谓词条件的行。 引入基于路径的行源中的小缓存结构可用于优化谓词行的评估。 一旦填充行数据,缓存结构可能被重组,以优化谓词评估的性能。

    LOADING AN INDEX WITH MINIMAL EFFECT ON AVAILABILITY OF APPLICATIONS USING THE CORRESPONDING TABLE
    84.
    发明申请
    LOADING AN INDEX WITH MINIMAL EFFECT ON AVAILABILITY OF APPLICATIONS USING THE CORRESPONDING TABLE 有权
    使用相应表加载对应用可用性的最小影响的索引

    公开(公告)号:US20100235348A1

    公开(公告)日:2010-09-16

    申请号:US12401397

    申请日:2009-03-10

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30911

    摘要: A database server provides a partitioned table with a corresponding locally partitioned index. When a request is made to load data into the table by an application, a table partition, along with a corresponding index partition, is created for the application. The corresponding index partition in marked as asynchronously maintained, allowing other index partitions to be used by the query optimizer in formulating query plans for other table partitions. Once the loading of data is complete, a synchronization operation is called with the result of conforming the data in the asynchronously maintained index partition with its corresponding table in which the data was loaded. Once the data in the corresponding table partition and the data in the asynchronously maintained index partition are synchronized, the asynchronously maintained index partition is marked as synchronously maintained.

    摘要翻译: 数据库服务器提供具有相应的本地分区索引的分区表。 当应用程序将数据加载到表中时,会为应用程序创建表分区以及相应的索引分区。 相应的索引分区被标记为异步维护,允许查询优化器使用其他索引分区来为其他表分区制定查询计划。 一旦数据加载完成,就会调用同步操作,使异步维护的索引分区中的数据与其中加载数据的对应表相符。 一旦相应表分区中的数据和异步维护的索引分区中的数据同步,则异步维护的索引分区被标记为同步维护。

    Incremental maintenance of an XML index on binary XML data
    85.
    发明授权
    Incremental maintenance of an XML index on binary XML data 有权
    对二进制XML数据的XML索引的增量维护

    公开(公告)号:US07739251B2

    公开(公告)日:2010-06-15

    申请号:US11715603

    申请日:2007-03-07

    IPC分类号: G06F17/00

    CPC分类号: G06F17/30911 G06F17/30569

    摘要: Techniques are provided for incrementally maintaining an XML index built to access XML data that is encoded in binary XML form. Rather than delete and reinsert index entries of all the nodes of a modified XML document, only the index entries of the affected nodes are modified. Consequently, the order key values stored in the index may become inconsistent with the current hierarchical locations of the nodes to which the order key values correspond. Techniques are described for resolving the inconsistencies, and for addressing additional problems that result when the XML index is path-subsetted.

    摘要翻译: 提供了用于增量维护构建以访问以二进制XML形式编码的XML数据的XML索引的技术。 修改的XML文档的所有节点的删除和重新插入索引条目,而不是修改受影响的节点的索引条目。 因此,存储在索引中的订单键值可能与订单键值对应的节点的当前分层位置变得不一致。 描述了解决不一致的技术,以及解决当XML索引是路径子集化时产生的其他问题。

    Estimating Cardinalities of XML Table Constructs Within Queries
    86.
    发明申请
    Estimating Cardinalities of XML Table Constructs Within Queries 有权
    估计查询中XML表构造的基数

    公开(公告)号:US20100145957A1

    公开(公告)日:2010-06-10

    申请号:US12328642

    申请日:2008-12-04

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30935

    摘要: Techniques are provided for estimating the cardinality of a virtual result table that is produced by executing path-based table functions within a query, such as the XMLTABLE function. Some path-based table functions apply a path expression to input from a base table of XML documents to select rows to produce the result table. Path statistics are collected for the path expressions for the base table. The path statistics are used to estimate the cardinalities of the result table. The estimated cardinality of the result table is useful for estimating costs of query execution plans that are generated for the query.

    摘要翻译: 提供了用于估计通过在查询中执行基于路径的表函数(例如XMLTABLE函数)产生的虚拟结果表的基数的技术。 一些基于路径的表函数将路径表达式应用于从XML文档的基表中输入,以选择行以生成结果表。 针对基表的路径表达式收集路径统计信息。 路径统计信息用于估计结果表的基数。 结果表的估计基数对于为查询生成的查询执行计划的估计成本很有用。

    FAST EXTRACTION OF SCALAR VALUES FROM BINARY ENCODED XML
    88.
    发明申请
    FAST EXTRACTION OF SCALAR VALUES FROM BINARY ENCODED XML 有权
    从二进制编码XML快速提取标量值

    公开(公告)号:US20090307239A1

    公开(公告)日:2009-12-10

    申请号:US12134196

    申请日:2008-06-06

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30923

    摘要: Techniques are provided for efficiently extracting scalar values from binary-encoded XML data. Node information is stored in association with binary-encoded XML data to indicate whether one or more nodes of an XML document are simple or complex. A node is simple if the node has no child elements and no attributes. The node information of a particular node is used to determine whether a particular node, identified in a query, is simple or complex. If the particular node is simple, then the scalar value of the particular node is identified without performing any operations other than possibly converting the scalar value to a non-binary-encoded format or converting the scalar value to a value of a different data type.

    摘要翻译: 提供了从二进制编码的XML数据高效地提取标量值的技术。 节点信息与二进制编码的XML数据相关联地存储以指示XML文档的一个或多个节点是简单还是复杂的。 如果节点没有子元素而没有属性,则节点很简单。 特定节点的节点信息用于确定在查询中标识的特定节点是简单还是复杂。 如果特定节点是简单的,则特定节点的标量值被识别,而不执行除可能将标量值转换为非二进制编码格式之外的任何操作,或者将标量值转换为不同数据类型的值。

    EFFICIENT STREAMING EVALUATION OF XPATHS ON BINARY-ENCODED XML SCHEMA-BASED DOCUMENTS
    89.
    发明申请
    EFFICIENT STREAMING EVALUATION OF XPATHS ON BINARY-ENCODED XML SCHEMA-BASED DOCUMENTS 有权
    XPATHS对二进制编码的XML样本文档的有效流程评估

    公开(公告)号:US20090150412A1

    公开(公告)日:2009-06-11

    申请号:US11950642

    申请日:2007-12-05

    IPC分类号: G06F17/30

    摘要: A database system may utilize XML schema information to increase the efficiency of an XPath streaming evaluation. The database system may access XML schema or translation information during the evaluation of an element, attribute, or value in an XML data source. Based on the XML schema or translation information, the database system may determine matches to an XPath expression without decoding any binary-encoded data in the XML data source. Also, based on the XML schema information, the database may selectively skip or evaluate portions of the XML data source depending on whether those portions are defined in the XML schema so as to possibly contain a match to one or more unmatched steps in the XPath expression. XML schema information may be compiled into a compiled representation of the XPath expression for additional efficiencies.

    摘要翻译: 数据库系统可以利用XML模式信息来提高XPath流评估的效率。 在XML数据源中的元素,属性或值的评估期间,数据库系统可以访问XML模式或翻译信息。 基于XML模式或翻译信息,数据库系统可以确定与XPath表达式的匹配,而不对XML数据源中的任何二进制编码数据进行解码。 而且,基于XML模式信息,数据库可以有选择地跳过或评估XML数据源的部分,这取决于这些部分是否在XML模式中定义,以便可能包含与XPath表达式中的一个或多个不相匹配的步骤的匹配 。 可以将XML模式信息编译为XPath表达式的编译表达式,以获得更多的效率。

    DOCUMENT FIDELITY WITH BINARY XML STORAGE
    90.
    发明申请
    DOCUMENT FIDELITY WITH BINARY XML STORAGE 有权
    具有二进制XML存储的文档清晰度

    公开(公告)号:US20090112902A1

    公开(公告)日:2009-04-30

    申请号:US12180760

    申请日:2008-07-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30569 G06F17/30908

    摘要: Techniques are provided for ensuring lexical fidelity when an XML document is stored in a binary format. Operations, on the XML data, that would cause the loss of lexical fidelity between the original XML document and the binary-encoded version of the XML document are not performed. Such operations include the removal of unnecessary whitespace characters, certain data type conversions, CRLF normalization, the “collapsing” of two-tag empty elements into a single tag empty element, and the replacing of entity references or numeric character references with another value. An XML schema, to which the XML document conforms, may indicate that the XML document is to be stored in a lexical fidelity mode. Additionally, or alternatively, the database statement that (when executed) causes the XML document to be stored in a binary format may so indicate.

    摘要翻译: 当XML文档以二进制格式存储时,提供了用于确保词汇保真度的技术。 不会执行对原始XML文档和XML文档的二进制编码版本之间的词法保真度的XML数据的操作。 这样的操作包括删除不必要的空白字符,某些数据类型转换,CRLF归一化,将双标签空元素“折叠”到单个标签空元素中,以及用另一个值替换实体引用或数字字符引用。 XML文档符合的XML模式可以指示XML文档将以词法保真模式存储。 另外或替代地,数据库语句(当被执行时)导致XML文档以二进制格式存储可以这样表示。