Efficient evaluation of XQuery and XPath full text extension
    11.
    发明授权
    Efficient evaluation of XQuery and XPath full text extension 有权
    高效评估XQuery和XPath全文扩展

    公开(公告)号:US08312030B2

    公开(公告)日:2012-11-13

    申请号:US12388249

    申请日:2009-02-18

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30929

    摘要: Techniques are provided for efficiently evaluating XML queries that conform to an extension of an XML language (e.g., XQuery or XPath). The extension allows XML queries to have full-text search capabilities. Such an XML query is compiled to generate a tree of nodes that correspond to one or more conditions in the full-text portion of the query. In one technique, the amount of memory for the execution state of the tree is determined at compile time and allocated only once throughout execution of the query. In another technique, to ensure at most a single scan of a document, all the words or phrases in the full-text portion of an XML query are located before any of the other conditions in the full-text portion are evaluated. In another technique, the elements of the full-text portion of an XML query are analyzed to determine, based at least in part on cost, which evaluation strategy, of a plurality of evaluation strategies, should be employed.

    摘要翻译: 提供了有效评估符合XML语言(例如,XQuery或XPath)的扩展的XML查询的技术。 该扩展允许XML查询具有全文搜索功能。 编译这样的XML查询以生成与查询的全文部分中的一个或多个条件相对应的节点树。 在一种技术中,树的执行状态的内存量在编译时确定,并且在执行查询时只分配一次。 在另一种技术中,为了确保文档的单次扫描,XML查询的全文部分中的所有单词或短语位于全文部分中的任何其他条件之前。 在另一技术中,分析XML查询的全文部分的元素,至少部分地基于成本来确定应当采用多个评估策略的哪个评估策略。

    EFFICIENTLY REGISTERING A RELATIONAL SCHEMA
    12.
    发明申请
    EFFICIENTLY REGISTERING A RELATIONAL SCHEMA 有权
    有效地注册关系图

    公开(公告)号:US20120136868A1

    公开(公告)日:2012-05-31

    申请号:US12956264

    申请日:2010-11-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30312 G06F17/30595

    摘要: A method, device, and non-transitory computer-readable storage medium are provided for efficiently registering a relational schema. In co-compilation and data guide approaches, a subset of entities from schema descriptions are selected for physical registration, and other entities from the schema descriptions are not physically registered. In the co-compilation approach, a first schema description references a second schema description, and the subset includes a set of entities from the second schema description that are used by the first schema description. In the data guide approach, the subset includes entities that are used by a set of structured documents. In a pay-as-you-go approach, schema registration includes logically registering entities without creating relational database structures corresponding to the entities. A database server may execute database commands that reference the logically registered entities. A request to store data for the entities may be executed by creating relational database structures to store the data.

    摘要翻译: 提供了一种用于有效地注册关系模式的方法,设备和非暂时计算机可读存储介质。 在共同编译和数据指南方法中,选择了来自模式描述的实体的子集用于物理注册,并且来自模式描述的其他实体没有物理注册。 在共同编译方法中,第一模式描述引用第二模式描述,并且该子集包括来自第一模式描述使用的第二模式描述的一组实体。 在数据指南方法中,子集包括由一组结构化文档使用的实体。 在现收现付方式中,模式注册包括逻辑注册实体,而不创建与实体相对应的关系数据库结构。 数据库服务器可以执行引用逻辑注册实体的数据库命令。 可以通过创建关系数据库结构来存储数据来执行用于存储实体的数据的请求。

    Indexing strategy with improved DML performance and space usage for node-aware full-text search over XML
    13.
    发明授权
    Indexing strategy with improved DML performance and space usage for node-aware full-text search over XML 有权
    具有改进的DML性能的索引策略和通过XML的节点感知全文搜索的空间使用

    公开(公告)号:US08126932B2

    公开(公告)日:2012-02-28

    申请号:US12346393

    申请日:2008-12-30

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30911

    摘要: Techniques are provided for searching within a collection of XML documents. A relational table stores an entry for each node of a set of nodes in a collection of XML documents. Each entry of the relational table stores an order key and a path identifier along with the atomized value of the node. Instead of storing the atomized value in a full-text index, a virtual column can be created to represent, for each node, the atomized value of the node. Alternately, each entry of the relational table stores an order key and a path identifier along with, for simple nodes, the atomized value, and for complex nodes, a null value. For a complex node with a descendant text node, a separate entry is stored for the descendant text node in the relational table.

    摘要翻译: 提供了在XML文档集合内进行搜索的技术。 关系表存储XML文档集合中的一组节点的每个节点的条目。 关系表的每个条目存储订单密钥和路径标识符以及节点的雾化值。 不必将全部文本索引中的雾化值存储起来,可以创建虚拟列,以便为每个节点表示节点的雾化值。 或者,关系表的每个条目存储订单密钥和路径标识符,对于简单节点,存在雾化值,对于复杂节点存储空值。 对于具有后代文本节点的复杂节点,为关系表中的后代文本节点存储单独的条目。

    Creating storage for XML schemas with limited numbers of columns per table
    14.
    发明授权
    Creating storage for XML schemas with limited numbers of columns per table 有权
    为每个表格的列数量有限的XML模式创建存储

    公开(公告)号:US08103695B2

    公开(公告)日:2012-01-24

    申请号:US12122589

    申请日:2008-05-16

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30917

    摘要: Techniques are described herein for automatically generating multiple interrelated database tables to store XML data, while ensuring that each such table has no more than the maximum DBMS-allowed number of columns. In response to the registration of an XML schema with a database server, the server determines whether any of the elements specified in the XML schema are complex elements that have more than a threshold number of descendant elements. If a complex element has more than the threshold number of descendant elements, then the server automatically generates one or more separate “out-of-line” database tables for storing at least some of those descendant elements, so that the table created to store the complex element will have no more than the permitted number of columns. Each of the out-of-line database tables is similarly generated so as to have no more than the permitted number of columns.

    摘要翻译: 这里描述了用于自动生成多个相互关联的数据库表以存储XML数据的技术,同时确保每个这样的表具有不超过DBMS允许的最大列数。 响应于XML模式与数据库服务器的注册,服务器确定XML模式中指定的任何元素是否是具有多于阈值数量的后代元素的复杂元素。 如果复杂元素具有超过阈值数量的后代元素,则服务器自动生成一个或多个单独的“行外”数据库表,用于存储这些后代元素中的至少一些,使得创建用于存储 复杂元素将不超过允许的列数。 类似地生成每个外联数据库表,以便不超过允许的列数。

    Searching backward to speed up query
    15.
    发明授权
    Searching backward to speed up query 有权
    向后搜索以加快查询速度

    公开(公告)号:US08566343B2

    公开(公告)日:2013-10-22

    申请号:US12871869

    申请日:2010-08-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30539

    摘要: A method, computing device, and a non-transitory computer-readable medium are provided for performing a context-aware search by finding a set of nodes that are mapped to a given text or other value and, for each node in the set of nodes, performing a reverse path lookup to determine whether the node satisfies a given context. The query processor performs the reverse path lookup for a node by traversing up a node tree away from the node, using a stored mapping from the node to a parent of the node. Using mappings from nodes to parent nodes, the node tree is traversed backwards from the node up to distant ancestor nodes through parent nodes. An optimizer instructs the query processor to perform a value-based portion of the search before a path-based portion of the search based on value distribution statistics and path distribution statistics.

    摘要翻译: 提供了一种方法,计算设备和非暂时计算机可读介质,用于通过找到映射到给定文本或其他值的节点集合来执行上下文感知搜索,并且对于节点集合中的每个节点 ,执行反向路径查找以确定节点是否满足给定的上下文。 查询处理器通过使用存储的从节点到节点的父节点的映射来遍历节点之外的节点树来执行对节点的反向路径查找。 使用从节点到父节点的映射,节点树通过父节点从节点向后穿过远处的祖先节点。 优化器指示查询处理器在基于值分布统计和路径分布统计的基于路径的搜索部分之前执行搜索的基于值的部分。

    Indexing Mechanism for Efficient Node-Aware Full-Text Search Over XML
    16.
    发明申请
    Indexing Mechanism for Efficient Node-Aware Full-Text Search Over XML 有权
    用于高效节点感知的索引机制通过XML进行全文搜索

    公开(公告)号:US20100169354A1

    公开(公告)日:2010-07-01

    申请号:US12346327

    申请日:2008-12-30

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F17/30911

    摘要: Techniques are provided for searching within a collection of XML documents. A relational table in an XML index stores an entry for each node of a set of nodes in the collection. Each entry of the relational table stores an order key and a path identifier along with the atomized value of the node. An index on the atomized value provides a mechanism to perform a node-aware full-text search. Instead of storing the atomized value in the table, a virtual column may be created to represent, for each node, the atomized value of the node. Alternately, each entry of the relational table stores an order key and a path identifier along with, for simple nodes, the atomized value, and for complex nodes, a null value. For a complex node with a descendant text node, a separate entry is stored for the descendant text node in the relational table.

    摘要翻译: 提供了在XML文档集合内进行搜索的技术。 XML索引中的关系表存储集合中一组节点的每个节点的条目。 关系表的每个条目存储订单密钥和路径标识符以及节点的雾化值。 雾化值上的索引提供了执行节点感知全文搜索的机制。 不用在表中存储雾化值,而是可以创建一个虚拟列,以便为每个节点表示节点的雾化值。 或者,关系表的每个条目存储订单密钥和路径标识符,对于简单节点,存在雾化值,对于复杂节点存储空值。 对于具有后代文本节点的复杂节点,为关系表中的后代文本节点存储单独的条目。

    CREATING STORAGE FOR XML SCHEMAS WITH LIMITED NUMBERS OF COLUMNS PER TABLE
    17.
    发明申请
    CREATING STORAGE FOR XML SCHEMAS WITH LIMITED NUMBERS OF COLUMNS PER TABLE 有权
    创建具有每表列的有限数量的XML方案的存储

    公开(公告)号:US20090287719A1

    公开(公告)日:2009-11-19

    申请号:US12122589

    申请日:2008-05-16

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30917

    摘要: Techniques are described herein for automatically generating multiple interrelated database tables to store XML data, while ensuring that each such table has no more than the maximum DBMS-allowed number of columns. In response to the registration of an XML schema with a database server, the server determines whether any of the elements specified in the XML schema are complex elements that have more than a threshold number of descendant elements. If a complex element has more than the threshold number of descendant elements, then the server automatically generates one or more separate “out-of-line” database tables for storing at least some of those descendant elements, so that the table created to store the complex element will have no more than the permitted number of columns. Each of the out-of-line database tables is similarly generated so as to have no more than the permitted number of columns.

    摘要翻译: 这里描述了用于自动生成多个相互关联的数据库表以存储XML数据的技术,同时确保每个这样的表具有不超过DBMS允许的最大列数。 响应于XML模式与数据库服务器的注册,服务器确定XML模式中指定的任何元素是否是具有多于阈值数量的后代元素的复杂元素。 如果复杂元素具有超过阈值数量的后代元素,则服务器自动生成一个或多个单独的“行外”数据库表,用于存储这些后代元素中的至少一些,使得创建用于存储 复杂元素将不超过允许的列数。 类似地生成每个外联数据库表,以便不超过允许的列数。

    Method and system for implementing efficient updatable relational views over XML data
    18.
    发明授权
    Method and system for implementing efficient updatable relational views over XML data 有权
    用于实现XML数据的高效可更新关系视图的方法和系统

    公开(公告)号:US09396284B2

    公开(公告)日:2016-07-19

    申请号:US13110671

    申请日:2011-05-18

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30926 G06F17/30932

    摘要: Disclosed is a method, system, and program product for providing an efficient mechanism to provide updateable relational views over XML. This will allow application developers to write both queries and updates using a relational (SQL) syntax, even though the underlying data is stored as XML. According to some approaches, this is accomplished by implementing a reverse mapping from the data within the views to the node addresses of the corresponding data in the underlying XML document.

    摘要翻译: 公开了一种方法,系统和程序产品,用于提供有效的机制以通过XML提供可更新的关系视图。 这将允许应用程序开发人员使用关系(SQL)语法编写查询和更新,即使底层数据存储为XML。 根据一些方法,这通过实现从视图内的数据到底层XML文档中相应数据的节点地址的反向映射来实现。

    Leveraging structured XML index data for evaluating database queries
    19.
    发明授权
    Leveraging structured XML index data for evaluating database queries 有权
    利用结构化XML索引数据来评估数据库查询

    公开(公告)号:US08880506B2

    公开(公告)日:2014-11-04

    申请号:US12580923

    申请日:2009-10-16

    IPC分类号: G06F17/30 G06F15/16

    CPC分类号: G06F17/30932 G06F17/30935

    摘要: A query may be rewritten to leverage information stored in a structured XML index. An operator in the query may be analyzed to determine an input source database object for the operator by traversing an operator tree rooted at the operator. The path expressions associated with the operator tree may be fused together to form an effective path expression for the operator. If the effective path expression directly matches a path expression derived from the index, the query may be rewritten using references to the index. Operators in a query that have effective paths that refer to data in the same index table may be grouped together. A single subquery may be written for a group of operators. Also, a structured XML index may be used as an implied schema for indexed XML data. This implied schema may be used to optimize queries that refer to the indexed XML data.

    摘要翻译: 可以重写查询以利用存储在结构化XML索引中的信息。 可以分析查询中的运算符以通过遍历运算符的操作符树来确定运算符的输入源数据库对象。 与运算符树相关联的路径表达式可以被融合在一起以形成操作者的有效路径表达式。 如果有效路径表达式直接匹配从索引导出的路径表达式,则可以使用对索引的引用来重写查询。 具有指向相同索引表中的数据的有效路径的查询中的运算符可以被分组在一起。 可以为一组运营商编写单个子查询。 此外,结构化XML索引可以用作索引XML数据的隐含模式。 此隐含模式可用于优化引用索引XML数据的查询。

    Providing context aware search adaptively
    20.
    发明授权
    Providing context aware search adaptively 有权
    自适应地提供上下文感知搜索

    公开(公告)号:US08447785B2

    公开(公告)日:2013-05-21

    申请号:US12792659

    申请日:2010-06-02

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30929

    摘要: A method, computing device, and a non-transitory computer-readable medium is provided for facilitating a context-aware search on documents stored in an unstructured database column. The database server determines whether a document stored in an unstructured database object is marked up according to a recognized markup language. Documents that conform to the markup language are indexed with value-to-node mappings and node-to-physical-location mappings to provide a context-aware search of marked up nodes within the documents. Documents that do not conform to the markup language are indexed with value-to-document mappings to provide a full-text search of the documents. The database server evaluates a context-aware search against documents that conform to the markup language within the unstructured column without requiring the user identify the documents as XML documents and store the documents in an XMLType structured column.

    摘要翻译: 提供了一种方法,计算设备和非暂时计算机可读介质,用于便于对存储在非结构化数据库列中的文档的上下文感知搜索。 数据库服务器确定存储在非结构化数据库对象中的文档是否根据识别的标记语言进行标记。 符合标记语言的文档使用值到节点映射和节点到物理位置映射进行索引,以提供文档中标记节点的上下文感知搜索。 不符合标记语言的文档使用值到文档的映射进行索引,以提供文档的全文搜索。 数据库服务器根据非结构化列中符合标记语言的文档评估上下文感知搜索,而不需要用户将文档标识为XML文档,并将文档存储在XMLType结构化列中。