TECHNIQUES FOR EFFICIENT AND SCALABLE PROCESSING OF COMPLEX SETS OF XML SCHEMAS
    1.
    发明申请
    TECHNIQUES FOR EFFICIENT AND SCALABLE PROCESSING OF COMPLEX SETS OF XML SCHEMAS 有权
    用于XML方案的复杂集合的有效和可扩展处理的技术

    公开(公告)号:US20110179089A1

    公开(公告)日:2011-07-21

    申请号:US12689729

    申请日:2010-01-19

    IPC分类号: G06F17/30

    摘要: Techniques are described herein for efficient and scalable processing of complex sets of XML schemas. The techniques described herein provide for reducing duplication of schema elements in volatile memory by building an XML schema in-memory model that stores repeating schema elements in in-memory data structures that are separate from in-memory data structures that store the parent schema elements which logically include or otherwise refer to the repeating schema elements. The techniques described herein also provide for faster generation of an in-memory model of an XML schema by pre-loading, in data structures on persistent storage, of schema elements from dependent XML schemas that are referenced and/or incorporated by the XML schema. The techniques described herein also provide for efficient processing of inter-dependent XML schemas by tracking all unresolved schema elements from dependent XML schemas and freeing the portions of volatile memory, which are used to process schema elements from the dependent XML schemas, as soon as the dependent schema elements being processed are stored in data structures on persistent storage.

    摘要翻译: 这里描述了用于复杂的XML模式集合的有效和可缩放处理的技术。 本文描述的技术提供了通过构建XML模式内存中模型来减少易失性存储器中的模式元素的重复,该模式将内存模型元素存储在与存储父模式元素的内存中数据结构分开的内存中数据结构中, 逻辑上包括或以其他方式引用重复的模式元素。 本文描述的技术还提供了通过在由XML模式引用和/或并入的依赖XML模式中预先加载在持久存储器上的模式元素的数据结构中来更快地生成XML模式的内存中模型。 本文描述的技术还提供了通过从依赖的XML模式跟踪所有未解决的模式元素并释放用于从依赖的XML模式处理模式元素的易失性存储器的部分的快速处理相互依赖的XML模式,一旦 正在处理的依赖模式元素存储在持久存储器上的数据结构中。

    Techniques for efficient and scalable processing of complex sets of XML schemas
    2.
    发明授权
    Techniques for efficient and scalable processing of complex sets of XML schemas 有权
    用于高效和可扩展处理复杂的XML模式集的技术

    公开(公告)号:US08209361B2

    公开(公告)日:2012-06-26

    申请号:US12689729

    申请日:2010-01-19

    IPC分类号: G06F17/30

    摘要: Techniques are described herein for efficient and scalable processing of complex sets of XML schemas. The techniques described herein provide for reducing duplication of schema elements in volatile memory by building an XML schema in-memory model that stores repeating schema elements in in-memory data structures that are separate from in-memory data structures that store the parent schema elements which logically include or otherwise refer to the repeating schema elements. The techniques described herein also provide for faster generation of an in-memory model of an XML schema by pre-loading, in data structures on persistent storage, of schema elements from dependent XML schemas that are referenced and/or incorporated by the XML schema. The techniques described herein also provide for efficient processing of inter-dependent XML schemas by tracking all unresolved schema elements from dependent XML schemas and freeing the portions of volatile memory, which are used to process schema elements from the dependent XML schemas, as soon as the dependent schema elements being processed are stored in data structures on persistent storage.

    摘要翻译: 这里描述了用于复杂的XML模式集合的有效和可缩放处理的技术。 本文描述的技术提供了通过构建XML模式内存中模型来减少易失性存储器中的模式元素的重复,该模式将内存模型元素存储在与存储父模式元素的内存中数据结构分开的内存中数据结构中, 逻辑上包括或以其他方式引用重复的模式元素。 本文描述的技术还提供了通过在由XML模式引用和/或并入的依赖XML模式中预先加载在持久存储器上的模式元素的数据结构中来更快地生成XML模式的内存中模型。 本文描述的技术还提供了通过从依赖的XML模式跟踪所有未解决的模式元素并释放用于从依赖的XML模式处理模式元素的易失性存储器的部分的快速处理相互依赖的XML模式,一旦 正在处理的依赖模式元素存储在持久存储器上的数据结构中。

    Document fidelity with binary XML storage
    5.
    发明授权
    Document fidelity with binary XML storage 有权
    使用二进制XML存储文件保真

    公开(公告)号:US08090731B2

    公开(公告)日:2012-01-03

    申请号:US12180760

    申请日:2008-07-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30569 G06F17/30908

    摘要: Techniques are provided for ensuring lexical fidelity when an XML document is stored in a binary format. Operations, on the XML data, that would cause the loss of lexical fidelity between the original XML document and the binary-encoded version of the XML document are not performed. Such operations include the removal of unnecessary whitespace characters, certain data type conversions, CRLF normalization, the “collapsing” of two-tag empty elements into a single tag empty element, and the replacing of entity references or numeric character references with another value. An XML schema, to which the XML document conforms, may indicate that the XML document is to be stored in a lexical fidelity mode. Additionally, or alternatively, the database statement that (when executed) causes the XML document to be stored in a binary format may so indicate.

    摘要翻译: 当XML文档以二进制格式存储时,提供了用于确保词汇保真度的技术。 不会执行对原始XML文档和XML文档的二进制编码版本之间的词法保真度的XML数据的操作。 这样的操作包括删除不必要的空白字符,某些数据类型转换,CRLF归一化,将双标签空元素“折叠”到单个标签空元素中,以及用另一个值替换实体引用或数字字符引用。 XML文档符合的XML模式可以指示XML文档将以词法保真模式存储。 另外或替代地,数据库语句(当被执行时)导致XML文档以二进制格式存储可能如此指示。

    DOCUMENT FIDELITY WITH BINARY XML STORAGE
    6.
    发明申请
    DOCUMENT FIDELITY WITH BINARY XML STORAGE 有权
    具有二进制XML存储的文档清晰度

    公开(公告)号:US20090112902A1

    公开(公告)日:2009-04-30

    申请号:US12180760

    申请日:2008-07-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30569 G06F17/30908

    摘要: Techniques are provided for ensuring lexical fidelity when an XML document is stored in a binary format. Operations, on the XML data, that would cause the loss of lexical fidelity between the original XML document and the binary-encoded version of the XML document are not performed. Such operations include the removal of unnecessary whitespace characters, certain data type conversions, CRLF normalization, the “collapsing” of two-tag empty elements into a single tag empty element, and the replacing of entity references or numeric character references with another value. An XML schema, to which the XML document conforms, may indicate that the XML document is to be stored in a lexical fidelity mode. Additionally, or alternatively, the database statement that (when executed) causes the XML document to be stored in a binary format may so indicate.

    摘要翻译: 当XML文档以二进制格式存储时,提供了用于确保词汇保真度的技术。 不会执行对原始XML文档和XML文档的二进制编码版本之间的词法保真度的XML数据的操作。 这样的操作包括删除不必要的空白字符,某些数据类型转换,CRLF归一化,将双标签空元素“折叠”到单个标签空元素中,以及用另一个值替换实体引用或数字字符引用。 XML文档符合的XML模式可以指示XML文档将以词法保真模式存储。 另外或替代地,数据库语句(当被执行时)导致XML文档以二进制格式存储可以这样表示。

    Validation of XML content in a streaming fashion
    7.
    发明申请
    Validation of XML content in a streaming fashion 审中-公开
    以流媒体方式验证XML内容

    公开(公告)号:US20080092037A1

    公开(公告)日:2008-04-17

    申请号:US11729943

    申请日:2007-03-28

    IPC分类号: G06F17/00 G06F7/00

    CPC分类号: G06F17/2725 G06F17/2247

    摘要: A method and apparatus for validating XML documents in a streaming fashion is provided. A streaming validator validates an XML document by comparing the contents of the XML document to an XML schema. Tokens are generated for each element or attribute of the XML schema and for each element or attribute of the XML document using the same generator token function. The elements and attributes of the XML document and XML schema are compared using tokens rather than string comparisons to perform the validation more efficiently.

    摘要翻译: 提供了以流式方式验证XML文档的方法和装置。 流验证器通过将XML文档的内容与XML模式进行比较来验证XML文档。 使用相同的生成器令牌函数为XML模式的每个元素或属性以及XML文档的每个元素或属性生成令牌。 使用令牌而不是字符串比较来比较XML文档和XML模式的元素和属性,以更有效地执行验证。

    Techniques for efficient queries on a file system-like repository
    8.
    发明授权
    Techniques for efficient queries on a file system-like repository 有权
    用于在文件系统状态库上进行高效查询的技术

    公开(公告)号:US09146961B2

    公开(公告)日:2015-09-29

    申请号:US12618356

    申请日:2009-11-13

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30471 G06F17/30477

    摘要: Techniques are provided to achieve performance improvements for path-based access to hierarchical data and for utilizing an extensible indexing framework. Extensible operators may be optimized by evaluating the operators using functions native to the database system instead of using functions defined through an extensible indexing mechanism. Furthermore, the database system may resolve a resource path of a query when the query is compiled, and may store the mapping of the resource path to one or more resources in a cursor for the query execution plan to avoid resolving the resource path each time the cursor is used to run the query. The cursor may be made dependent on the one or more resources to which the resource path refers at compile time. This takes advantage of the observation that the mapping of the resource path to particular resources rarely changes between evaluations of a particular query.

    摘要翻译: 提供了技术来实现基于路径的分层数据访问和利用可扩展索引框架的性能改进。 可以通过使用本机对数据库系统的功能来评估操作员而不是使用通过可扩展索引机制定义的功能来优化可扩展运算符。 此外,当查询被编译时,数据库系统可以解析查询的资源路径,并且可以将资源路径的映射存储到用于查询执行计划的游标中的一个或多个资源,以避免在每次 游标用于运行查询。 光标可以依赖于在编译时资源路径所指的一个或多个资源。 这充分利用了对特定查询的评估之间的资源路径映射到特定资源的观察很少。

    Technique to gather statistics on very large hierarchical collections in a scalable and efficient manner
    9.
    发明授权
    Technique to gather statistics on very large hierarchical collections in a scalable and efficient manner 有权
    以可扩展和高效的方式收集关于非常大的分层收藏的统计数据的技术

    公开(公告)号:US09582525B2

    公开(公告)日:2017-02-28

    申请号:US12555198

    申请日:2009-09-08

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30306

    摘要: Techniques are provided for efficiently collecting statistics for hierarchically-organized collections of data. A database system leverages container-level modification time stamps and stored subtree-level change information to gather statistical information from only those resources in a hierarchical collection for which the statistics have changed since the last time that statistics were gathered for the collection. A lockless data structure is also described for storing the subtree-level change information in which an identifier corresponding to each subtree in a collection containing a changed resource may be stored. This data structure may be a table that is distinct from one or more tables representing the collection. In one embodiment of the invention, the immediate parent resource of a particular modified resource may be omitted from the subtree table by leveraging modification time stamps while gathering statistics based on tracked subtree-level information.

    摘要翻译: 提供了技术来高效地收集分层组织的数据集合的统计数据。 数据库系统利用容器级修改时间戳和存储的子树级别更改信息,从上次收集统计信息以来统计信息已经更改的分层收集中的那些资源中收集统计信息。 还描述了一种无锁数据结构,用于存储子树级别改变信息,其中可以存储与包含改变的资源的集合中的每个子树对应的标识符。 该数据结构可以是与表示集合的一个或多个表不同的表。 在本发明的一个实施例中,可以通过在基于跟踪的子树级别信息收集统计信息的同时利用修改时间戳从子树表中省略特定修改资源的直接父资源。