TECHNIQUES FOR MORE EFFICIENT GENERATION OF XML EVENTS FROM XML DATA SOURCES
    1.
    发明申请
    TECHNIQUES FOR MORE EFFICIENT GENERATION OF XML EVENTS FROM XML DATA SOURCES 有权
    从XML数据源更有效地生成XML事件的技术

    公开(公告)号:US20090125693A1

    公开(公告)日:2009-05-14

    申请号:US11938020

    申请日:2007-11-09

    IPC分类号: G06F13/00

    摘要: One may increase the efficiency of an XML event-generating process by reducing the number of requests to allocate or deallocate system memory. Such reduction may occur as the result of pre-allocating a memory chunk of sufficient size to contain all of the memory buffers required by a particular event-generating process. Instead of allocating new memory chunks for new memory buffers, an application may store any required buffers within the pre-allocated memory chunk. A sufficient memory size may be estimated by performing the event-generating process on a training set of XML documents. Also, an application may re-use buffers during the process or between different iterations of the process, thus avoiding the need to deallocate and reallocate memory that is essentially being used for the same purpose.

    摘要翻译: 可以通过减少分配或释放系统内存的请求数来提高XML事件生成过程的效率。 这种减少可以作为预先分配足够大小的存储器块以包含特定事件生成过程所需的所有存储器缓冲器的结果而发生。 代替为新的内存缓冲区分配新的内存块,应用程序可以在预先分配的内存块内存储任何所需的缓冲区。 可以通过对XML文档的训练集执行事件生成过程来估计足够的存储器大小。 此外,应用程序可以在进程期间或在进程的不同迭代之间重新使用缓冲区,从而避免需要释放并重新分配基本上被用于相同目的的存储器。

    OPTIMIZED STREAMING EVALUATION OF XML QUERIES
    2.
    发明申请
    OPTIMIZED STREAMING EVALUATION OF XML QUERIES 有权
    XML QUERIES的优化流评估

    公开(公告)号:US20090125495A1

    公开(公告)日:2009-05-14

    申请号:US11938017

    申请日:2007-11-09

    IPC分类号: G06F17/30

    摘要: A database system may perform a streaming evaluation of an XPath expression by utilizing an XPath evaluation component in tandem with an XML event-streaming component. For a more optimal filtered streaming evaluation, the XML event-streaming component may provide an interface whereby the evaluation component sends certain criteria to the event-streaming component when requesting an XML event. The criteria may be based on a next unmatched step in the XPath expression. In response to the request for an XML event, the event-streaming component may only return events that match the criteria. The evaluation component may be, for example, a compiled state machine for the XPath expression. The criteria may be pre-compiled for each possible state in the state machine. The event-streaming component may also utilize the criteria along with schema information to skip parsing of certain segments of XML data.

    摘要翻译: 数据库系统可以通过与XML事件流组件一起使用XPath评估组件来执行XPath表达式的流评估。 对于更优化的过滤流评估,XML事件流组件可以提供一个接口,当请求XML事件时,评估组件向事件流组件发送特定标准。 标准可能基于XPath表达式中的下一个不匹配的步骤。 响应对XML事件的请求,事件流组件可能只返回符合条件的事件。 评估组件可以是例如XPath表达式的编译状态机。 可以对状态机中的每个可能的状态预先准备标准。 事件流组件还可以利用标准以及模式信息来跳过某些XML数据段的解析。

    Optimized streaming evaluation of XML queries
    3.
    发明授权
    Optimized streaming evaluation of XML queries 有权
    XML查询的优化流评估

    公开(公告)号:US08250062B2

    公开(公告)日:2012-08-21

    申请号:US11938017

    申请日:2007-11-09

    IPC分类号: G06F7/00 G06F17/00

    摘要: A database system may perform a streaming evaluation of an XPath expression by utilizing an XPath evaluation component in tandem with an XML event-streaming component. For a more optimal filtered streaming evaluation, the XML event-streaming component may provide an interface whereby the evaluation component sends certain criteria to the event-streaming component when requesting an XML event. The criteria may be based on a next unmatched step in the XPath expression. In response to the request for an XML event, the event-streaming component may only return events that match the criteria. The evaluation component may be, for example, a compiled state machine for the XPath expression. The criteria may be pre-compiled for each possible state in the state machine. The event-streaming component may also utilize the criteria along with schema information to skip parsing of certain segments of XML data.

    摘要翻译: 数据库系统可以通过与XML事件流组件一起使用XPath评估组件来执行XPath表达式的流评估。 对于更优化的过滤流评估,XML事件流组件可以提供一个接口,当请求XML事件时,评估组件向事件流组件发送特定标准。 标准可能基于XPath表达式中的下一个不匹配的步骤。 响应对XML事件的请求,事件流组件可能只返回符合条件的事件。 评估组件可以是例如XPath表达式的编译状态机。 可以对状态机中的每个可能的状态预先准备标准。 事件流组件还可以利用标准以及模式信息来跳过某些XML数据段的解析。

    Estimating Cardinalities of XML Table Constructs Within Queries
    4.
    发明申请
    Estimating Cardinalities of XML Table Constructs Within Queries 有权
    估计查询中XML表构造的基数

    公开(公告)号:US20100145957A1

    公开(公告)日:2010-06-10

    申请号:US12328642

    申请日:2008-12-04

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30935

    摘要: Techniques are provided for estimating the cardinality of a virtual result table that is produced by executing path-based table functions within a query, such as the XMLTABLE function. Some path-based table functions apply a path expression to input from a base table of XML documents to select rows to produce the result table. Path statistics are collected for the path expressions for the base table. The path statistics are used to estimate the cardinalities of the result table. The estimated cardinality of the result table is useful for estimating costs of query execution plans that are generated for the query.

    摘要翻译: 提供了用于估计通过在查询中执行基于路径的表函数(例如XMLTABLE函数)产生的虚拟结果表的基数的技术。 一些基于路径的表函数将路径表达式应用于从XML文档的基表中输入,以选择行以生成结果表。 针对基表的路径表达式收集路径统计信息。 路径统计信息用于估计结果表的基数。 结果表的估计基数对于为查询生成的查询执行计划的估计成本很有用。

    FAST EXTRACTION OF SCALAR VALUES FROM BINARY ENCODED XML
    5.
    发明申请
    FAST EXTRACTION OF SCALAR VALUES FROM BINARY ENCODED XML 有权
    从二进制编码XML快速提取标量值

    公开(公告)号:US20090307239A1

    公开(公告)日:2009-12-10

    申请号:US12134196

    申请日:2008-06-06

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30923

    摘要: Techniques are provided for efficiently extracting scalar values from binary-encoded XML data. Node information is stored in association with binary-encoded XML data to indicate whether one or more nodes of an XML document are simple or complex. A node is simple if the node has no child elements and no attributes. The node information of a particular node is used to determine whether a particular node, identified in a query, is simple or complex. If the particular node is simple, then the scalar value of the particular node is identified without performing any operations other than possibly converting the scalar value to a non-binary-encoded format or converting the scalar value to a value of a different data type.

    摘要翻译: 提供了从二进制编码的XML数据高效地提取标量值的技术。 节点信息与二进制编码的XML数据相关联地存储以指示XML文档的一个或多个节点是简单还是复杂的。 如果节点没有子元素而没有属性,则节点很简单。 特定节点的节点信息用于确定在查询中标识的特定节点是简单还是复杂。 如果特定节点是简单的,则特定节点的标量值被识别,而不执行除可能将标量值转换为非二进制编码格式之外的任何操作,或者将标量值转换为不同数据类型的值。

    Techniques for more efficient generation of XML events from XML data sources
    6.
    发明授权
    Techniques for more efficient generation of XML events from XML data sources 有权
    从XML数据源更有效地生成XML事件的技术

    公开(公告)号:US08543898B2

    公开(公告)日:2013-09-24

    申请号:US11938020

    申请日:2007-11-09

    IPC分类号: G06F17/00

    摘要: One may increase the efficiency of an XML event-generating process by reducing the number of requests to allocate or deallocate system memory. Such reduction may occur as the result of pre-allocating a memory chunk of sufficient size to contain all of the memory buffers required by a particular event-generating process. Instead of allocating new memory chunks for new memory buffers, an application may store any required buffers within the pre-allocated memory chunk. A sufficient memory size may be estimated by performing the event-generating process on a training set of XML documents. Also, an application may re-use buffers during the process or between different iterations of the process, thus avoiding the need to deallocate and reallocate memory that is essentially being used for the same purpose.

    摘要翻译: 可以通过减少分配或释放系统内存的请求数来提高XML事件生成过程的效率。 这种减少可以作为预先分配足够大小的存储器块以包含特定事件生成过程所需的所有存储器缓冲器的结果而发生。 代替为新的内存缓冲区分配新的内存块,应用程序可以在预先分配的内存块内存储任何所需的缓冲区。 可以通过对XML文档的训练集执行事件生成过程来估计足够的存储器大小。 此外,应用程序可能会在进程期间或在进程的不同迭代之间重新使用缓冲区,从而避免需要重新分配和重新分配基本上用于相同目的的内存。

    Fast extraction of scalar values from binary encoded XML
    7.
    发明授权
    Fast extraction of scalar values from binary encoded XML 有权
    从二进制编码的XML快速提取标量值

    公开(公告)号:US08429196B2

    公开(公告)日:2013-04-23

    申请号:US12134196

    申请日:2008-06-06

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30923

    摘要: Techniques are provided for efficiently extracting scalar values from binary-encoded XML data. Node information is stored in association with binary-encoded XML data to indicate whether one or more nodes of an XML document are simple or complex. A node is simple if the node has no child elements and no attributes. The node information of a particular node is used to determine whether a particular node, identified in a query, is simple or complex. If the particular node is simple, then the scalar value of the particular node is identified without performing any operations other than possibly converting the scalar value to a non-binary-encoded format or converting the scalar value to a value of a different data type.

    摘要翻译: 提供了从二进制编码的XML数据高效地提取标量值的技术。 节点信息与二进制编码的XML数据相关联地存储以指示XML文档的一个或多个节点是简单还是复杂的。 如果节点没有子元素而没有属性,则节点很简单。 特定节点的节点信息用于确定在查询中标识的特定节点是简单还是复杂。 如果特定节点是简单的,则特定节点的标量值被识别,而不执行除可能将标量值转换为非二进制编码格式之外的任何操作,或者将标量值转换为不同数据类型的值。

    Estimating cardinalities of XML table constructs within queries
    8.
    发明授权
    Estimating cardinalities of XML table constructs within queries 有权
    估计查询中XML表结构的基数

    公开(公告)号:US08200679B2

    公开(公告)日:2012-06-12

    申请号:US12328642

    申请日:2008-12-04

    IPC分类号: G06F17/00

    CPC分类号: G06F17/30935

    摘要: Techniques are provided for estimating the cardinality of a virtual result table that is produced by executing path-based table functions within a query, such as the XMLTABLE function. Some path-based table functions apply a path expression to input from a base table of XML documents to select rows to produce the result table. Path statistics are collected for the path expressions for the base table. The path statistics are used to estimate the cardinalities of the result table. The estimated cardinality of the result table is useful for estimating costs of query execution plans that are generated for the query.

    摘要翻译: 提供了用于估计通过在查询中执行基于路径的表函数(例如XMLTABLE函数)产生的虚拟结果表的基数的技术。 一些基于路径的表函数将路径表达式应用于从XML文档的基表中输入,以选择行以生成结果表。 针对基表的路径表达式收集路径统计信息。 路径统计信息用于估计结果表的基数。 结果表的估计基数对于为查询生成的查询执行计划的估计成本很有用。

    Incremental maintenance of an XML index on binary XML data
    9.
    发明申请
    Incremental maintenance of an XML index on binary XML data 有权
    对二进制XML数据的XML索引的增量维护

    公开(公告)号:US20080098020A1

    公开(公告)日:2008-04-24

    申请号:US11715603

    申请日:2007-03-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30911 G06F17/30569

    摘要: Techniques are provided for incrementally maintaining an XML index built to access XML data that is encoded in binary XML form. Rather than delete and reinsert index entries of all the nodes of a modified XML document, only the index entries of the affected nodes are modified. Consequently, the order key values stored in the index may become inconsistent with the current hierarchical locations of the nodes to which the order key values correspond. Techniques are described for resolving the inconsistencies, and for addressing additional problems that result when the XML index is path-subsetted.

    摘要翻译: 提供了用于增量维护构建以访问以二进制XML形式编码的XML数据的XML索引的技术。 修改的XML文档的所有节点的删除和重新插入索引条目,而不是修改受影响的节点的索引条目。 因此,存储在索引中的订单键值可能与订单键值对应的节点的当前分层位置变得不一致。 描述了解决不一致的技术,以及解决当XML索引是路径子集化时产生的其他问题。

    Efficient partitioning technique while managing large XML documents
    10.
    发明申请
    Efficient partitioning technique while managing large XML documents 有权
    管理大型XML文档时的高效分区技术

    公开(公告)号:US20080091714A1

    公开(公告)日:2008-04-17

    申请号:US11716074

    申请日:2007-03-08

    IPC分类号: G06F17/00

    CPC分类号: G06F17/30923

    摘要: A method is provided to efficiently evaluate an expression to determine the partition key for an XML document stored in a database without the entire XML document first being stored in temporary memory storage. The partition key is determined using streaming evaluation or incrementally using a DOM node tree as a portion of the document is read and stored in the buffer. The XML document is stored in the partition using the read portion of the document stored in the buffer and the remaining portion from the original source.

    摘要翻译: 提供了一种方法来有效地评估表达式以确定存储在数据库中的XML文档的分区密钥,而不将整个XML文档首先存储在临时存储器存储器中。 当文档的一部分被读取并存储在缓冲器中时,使用流式评估或递增地使用DOM节点树来确定分区密钥。 使用存储在缓冲区中的文档的读取部分和原始源的剩余部分将XML文档存储在分区中。