Document fidelity with binary XML storage
    1.
    发明授权
    Document fidelity with binary XML storage 有权
    使用二进制XML存储文件保真

    公开(公告)号:US08090731B2

    公开(公告)日:2012-01-03

    申请号:US12180760

    申请日:2008-07-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30569 G06F17/30908

    摘要: Techniques are provided for ensuring lexical fidelity when an XML document is stored in a binary format. Operations, on the XML data, that would cause the loss of lexical fidelity between the original XML document and the binary-encoded version of the XML document are not performed. Such operations include the removal of unnecessary whitespace characters, certain data type conversions, CRLF normalization, the “collapsing” of two-tag empty elements into a single tag empty element, and the replacing of entity references or numeric character references with another value. An XML schema, to which the XML document conforms, may indicate that the XML document is to be stored in a lexical fidelity mode. Additionally, or alternatively, the database statement that (when executed) causes the XML document to be stored in a binary format may so indicate.

    摘要翻译: 当XML文档以二进制格式存储时,提供了用于确保词汇保真度的技术。 不会执行对原始XML文档和XML文档的二进制编码版本之间的词法保真度的XML数据的操作。 这样的操作包括删除不必要的空白字符,某些数据类型转换,CRLF归一化,将双标签空元素“折叠”到单个标签空元素中,以及用另一个值替换实体引用或数字字符引用。 XML文档符合的XML模式可以指示XML文档将以词法保真模式存储。 另外或替代地,数据库语句(当被执行时)导致XML文档以二进制格式存储可能如此指示。

    TECHNIQUES FOR EFFICIENT AND SCALABLE PROCESSING OF COMPLEX SETS OF XML SCHEMAS
    2.
    发明申请
    TECHNIQUES FOR EFFICIENT AND SCALABLE PROCESSING OF COMPLEX SETS OF XML SCHEMAS 有权
    用于XML方案的复杂集合的有效和可扩展处理的技术

    公开(公告)号:US20110179089A1

    公开(公告)日:2011-07-21

    申请号:US12689729

    申请日:2010-01-19

    IPC分类号: G06F17/30

    摘要: Techniques are described herein for efficient and scalable processing of complex sets of XML schemas. The techniques described herein provide for reducing duplication of schema elements in volatile memory by building an XML schema in-memory model that stores repeating schema elements in in-memory data structures that are separate from in-memory data structures that store the parent schema elements which logically include or otherwise refer to the repeating schema elements. The techniques described herein also provide for faster generation of an in-memory model of an XML schema by pre-loading, in data structures on persistent storage, of schema elements from dependent XML schemas that are referenced and/or incorporated by the XML schema. The techniques described herein also provide for efficient processing of inter-dependent XML schemas by tracking all unresolved schema elements from dependent XML schemas and freeing the portions of volatile memory, which are used to process schema elements from the dependent XML schemas, as soon as the dependent schema elements being processed are stored in data structures on persistent storage.

    摘要翻译: 这里描述了用于复杂的XML模式集合的有效和可缩放处理的技术。 本文描述的技术提供了通过构建XML模式内存中模型来减少易失性存储器中的模式元素的重复,该模式将内存模型元素存储在与存储父模式元素的内存中数据结构分开的内存中数据结构中, 逻辑上包括或以其他方式引用重复的模式元素。 本文描述的技术还提供了通过在由XML模式引用和/或并入的依赖XML模式中预先加载在持久存储器上的模式元素的数据结构中来更快地生成XML模式的内存中模型。 本文描述的技术还提供了通过从依赖的XML模式跟踪所有未解决的模式元素并释放用于从依赖的XML模式处理模式元素的易失性存储器的部分的快速处理相互依赖的XML模式,一旦 正在处理的依赖模式元素存储在持久存储器上的数据结构中。

    DOCUMENT FIDELITY WITH BINARY XML STORAGE
    3.
    发明申请
    DOCUMENT FIDELITY WITH BINARY XML STORAGE 有权
    具有二进制XML存储的文档清晰度

    公开(公告)号:US20090112902A1

    公开(公告)日:2009-04-30

    申请号:US12180760

    申请日:2008-07-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30569 G06F17/30908

    摘要: Techniques are provided for ensuring lexical fidelity when an XML document is stored in a binary format. Operations, on the XML data, that would cause the loss of lexical fidelity between the original XML document and the binary-encoded version of the XML document are not performed. Such operations include the removal of unnecessary whitespace characters, certain data type conversions, CRLF normalization, the “collapsing” of two-tag empty elements into a single tag empty element, and the replacing of entity references or numeric character references with another value. An XML schema, to which the XML document conforms, may indicate that the XML document is to be stored in a lexical fidelity mode. Additionally, or alternatively, the database statement that (when executed) causes the XML document to be stored in a binary format may so indicate.

    摘要翻译: 当XML文档以二进制格式存储时,提供了用于确保词汇保真度的技术。 不会执行对原始XML文档和XML文档的二进制编码版本之间的词法保真度的XML数据的操作。 这样的操作包括删除不必要的空白字符,某些数据类型转换,CRLF归一化,将双标签空元素“折叠”到单个标签空元素中,以及用另一个值替换实体引用或数字字符引用。 XML文档符合的XML模式可以指示XML文档将以词法保真模式存储。 另外或替代地,数据库语句(当被执行时)导致XML文档以二进制格式存储可以这样表示。

    Validation of XML content in a streaming fashion
    4.
    发明申请
    Validation of XML content in a streaming fashion 审中-公开
    以流媒体方式验证XML内容

    公开(公告)号:US20080092037A1

    公开(公告)日:2008-04-17

    申请号:US11729943

    申请日:2007-03-28

    IPC分类号: G06F17/00 G06F7/00

    CPC分类号: G06F17/2725 G06F17/2247

    摘要: A method and apparatus for validating XML documents in a streaming fashion is provided. A streaming validator validates an XML document by comparing the contents of the XML document to an XML schema. Tokens are generated for each element or attribute of the XML schema and for each element or attribute of the XML document using the same generator token function. The elements and attributes of the XML document and XML schema are compared using tokens rather than string comparisons to perform the validation more efficiently.

    摘要翻译: 提供了以流式方式验证XML文档的方法和装置。 流验证器通过将XML文档的内容与XML模式进行比较来验证XML文档。 使用相同的生成器令牌函数为XML模式的每个元素或属性以及XML文档的每个元素或属性生成令牌。 使用令牌而不是字符串比较来比较XML文档和XML模式的元素和属性,以更有效地执行验证。

    Techniques for efficient and scalable processing of complex sets of XML schemas
    5.
    发明授权
    Techniques for efficient and scalable processing of complex sets of XML schemas 有权
    用于高效和可扩展处理复杂的XML模式集的技术

    公开(公告)号:US08209361B2

    公开(公告)日:2012-06-26

    申请号:US12689729

    申请日:2010-01-19

    IPC分类号: G06F17/30

    摘要: Techniques are described herein for efficient and scalable processing of complex sets of XML schemas. The techniques described herein provide for reducing duplication of schema elements in volatile memory by building an XML schema in-memory model that stores repeating schema elements in in-memory data structures that are separate from in-memory data structures that store the parent schema elements which logically include or otherwise refer to the repeating schema elements. The techniques described herein also provide for faster generation of an in-memory model of an XML schema by pre-loading, in data structures on persistent storage, of schema elements from dependent XML schemas that are referenced and/or incorporated by the XML schema. The techniques described herein also provide for efficient processing of inter-dependent XML schemas by tracking all unresolved schema elements from dependent XML schemas and freeing the portions of volatile memory, which are used to process schema elements from the dependent XML schemas, as soon as the dependent schema elements being processed are stored in data structures on persistent storage.

    摘要翻译: 这里描述了用于复杂的XML模式集合的有效和可缩放处理的技术。 本文描述的技术提供了通过构建XML模式内存中模型来减少易失性存储器中的模式元素的重复,该模式将内存模型元素存储在与存储父模式元素的内存中数据结构分开的内存中数据结构中, 逻辑上包括或以其他方式引用重复的模式元素。 本文描述的技术还提供了通过在由XML模式引用和/或并入的依赖XML模式中预先加载在持久存储器上的模式元素的数据结构中来更快地生成XML模式的内存中模型。 本文描述的技术还提供了通过从依赖的XML模式跟踪所有未解决的模式元素并释放用于从依赖的XML模式处理模式元素的易失性存储器的部分的快速处理相互依赖的XML模式,一旦 正在处理的依赖模式元素存储在持久存储器上的数据结构中。

    Incremental maintenance of an XML index on binary XML data
    6.
    发明申请
    Incremental maintenance of an XML index on binary XML data 有权
    对二进制XML数据的XML索引的增量维护

    公开(公告)号:US20080098020A1

    公开(公告)日:2008-04-24

    申请号:US11715603

    申请日:2007-03-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30911 G06F17/30569

    摘要: Techniques are provided for incrementally maintaining an XML index built to access XML data that is encoded in binary XML form. Rather than delete and reinsert index entries of all the nodes of a modified XML document, only the index entries of the affected nodes are modified. Consequently, the order key values stored in the index may become inconsistent with the current hierarchical locations of the nodes to which the order key values correspond. Techniques are described for resolving the inconsistencies, and for addressing additional problems that result when the XML index is path-subsetted.

    摘要翻译: 提供了用于增量维护构建以访问以二进制XML形式编码的XML数据的XML索引的技术。 修改的XML文档的所有节点的删除和重新插入索引条目,而不是修改受影响的节点的索引条目。 因此,存储在索引中的订单键值可能与订单键值对应的节点的当前分层位置变得不一致。 描述了解决不一致的技术,以及解决当XML索引是路径子集化时产生的其他问题。

    Efficient partitioning technique while managing large XML documents
    7.
    发明申请
    Efficient partitioning technique while managing large XML documents 有权
    管理大型XML文档时的高效分区技术

    公开(公告)号:US20080091714A1

    公开(公告)日:2008-04-17

    申请号:US11716074

    申请日:2007-03-08

    IPC分类号: G06F17/00

    CPC分类号: G06F17/30923

    摘要: A method is provided to efficiently evaluate an expression to determine the partition key for an XML document stored in a database without the entire XML document first being stored in temporary memory storage. The partition key is determined using streaming evaluation or incrementally using a DOM node tree as a portion of the document is read and stored in the buffer. The XML document is stored in the partition using the read portion of the document stored in the buffer and the remaining portion from the original source.

    摘要翻译: 提供了一种方法来有效地评估表达式以确定存储在数据库中的XML文档的分区密钥,而不将整个XML文档首先存储在临时存储器存储器中。 当文档的一部分被读取并存储在缓冲器中时,使用流式评估或递增地使用DOM节点树来确定分区密钥。 使用存储在缓冲区中的文档的读取部分和原始源的剩余部分将XML文档存储在分区中。

    In-place evolution of XML schemas
    8.
    发明申请
    In-place evolution of XML schemas 有权
    XML模式的就地演进

    公开(公告)号:US20050050105A1

    公开(公告)日:2005-03-03

    申请号:US10648749

    申请日:2003-08-25

    IPC分类号: G06F12/00 G06F17/30

    CPC分类号: G06F17/30297 G06F17/30908

    摘要: A method and system for the in-place evolution of XML schemas is disclosed. To automatically evolve an existing XML schema, a schema evolver receives both an existing XML schema and an XML document as input. The XML document indicates changes to be made to the existing XML schema. Based on the existing XML schema and the XML document, the schema evolver evolves the existing XML schema into a new XML schema that incorporates the changes indicated in the XML document. According to one aspect, the schema evolver generates one or more SQL statements based on the new XML schema. The SQL statements, when executed by a database server, cause the database server to evolve database structures that were based on the formerly existing XML schema so that the database structures conform to the new XML schema. This is accomplished “in place,” without copying the data in the database structures.

    摘要翻译: 公开了一种用于XML模式的就地演化的方法和系统。 为了自动演进现有的XML模式,模式演进器接收现有的XML模式和XML文档作为输入。 XML文档指示对现有XML模式进行的更改。 基于现有的XML架构和XML文档,架构演进者将现有的XML架构演变为新的XML架构,其中包含XML文档中指示的更改。 根据一个方面,模式演进者基于新的XML模式生成一个或多个SQL语句。 SQL语句在由数据库服务器执行时,会导致数据库服务器发展基于以前存在的XML模式的数据库结构,以便数据库结构符合新的XML模式。 这完成了“到位”,而不复制数据库结构中的数据。

    Efficient partitioning technique while managing large XML documents
    9.
    发明授权
    Efficient partitioning technique while managing large XML documents 有权
    管理大型XML文档时的高效分区技术

    公开(公告)号:US07933935B2

    公开(公告)日:2011-04-26

    申请号:US11716074

    申请日:2007-03-08

    IPC分类号: G06F17/00

    CPC分类号: G06F17/30923

    摘要: A method is provided to efficiently evaluate an expression to determine the partition key for an XML document stored in a database without the entire XML document first being stored in temporary memory storage. The partition key is determined using streaming evaluation or incrementally using a DOM node tree as a portion of the document is read and stored in the buffer. The XML document is stored in the partition using the read portion of the document stored in the buffer and the remaining portion from the original source.

    摘要翻译: 提供了一种方法来有效地评估表达式以确定存储在数据库中的XML文档的分区密钥,而不将整个XML文档首先存储在临时存储器存储器中。 当文档的一部分被读取并存储在缓冲器中时,使用流式评估或递增地使用DOM节点树来确定分区密钥。 使用存储在缓冲区中的文档的读取部分和原始源的剩余部分将XML文档存储在分区中。

    Mechanism For Deferred Rewrite Of Multiple Xpath Evaluations Over Binary XML
    10.
    发明申请
    Mechanism For Deferred Rewrite Of Multiple Xpath Evaluations Over Binary XML 有权
    延迟重写二进制XML的Xpath评估机制

    公开(公告)号:US20100030726A1

    公开(公告)日:2010-02-04

    申请号:US12181725

    申请日:2008-07-29

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30935 G06F17/30911

    摘要: A mechanism is provided for improving the performance of particular database queries operating on a view comprising binary-encoded XML. The key to improving performance is to determine whether a XML index is available to improve the computation of XPath data before deriving an optimized execution plan for a particular query. When a XML index is not available to help expedite the computation of binary-encoded XML, then deferring the computation of the XPath data until after the view merge stage avoids unnecessary computation of XPaths that are part of the view but not needed for the particular query.

    摘要翻译: 提供了一种机制,用于提高在包含二进制编码XML的视图上操作的特定数据库查询的性能。 提高性能的关键是确定在为特定查询获得优化的执行计划之前,XML索引是否可用于改进XPath数据的计算。 当XML索引不可用以帮助加速二进制编码的XML的计算时,则推迟XPath数据的计算,直到视图合并阶段之后,避免不必要的计算XPath作为视图的一部分,但不需要特定的查询 。