Inferring schemas from XML document collections
    182.
    发明授权
    Inferring schemas from XML document collections 有权
    从XML文档集合推导模式

    公开(公告)号:US08868482B2

    公开(公告)日:2014-10-21

    申请号:US12052510

    申请日:2008-03-20

    申请人: Ravi Murthy

    发明人: Ravi Murthy

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30914

    摘要: In an automated fashion, a collection of XML documents are traversed and statistical data is generated describing hierarchical relationships of the elements, the elements data types and values. Based the analysis, an XML schema is derived such that the collection of XML documents conforms to the schema to a certain degree.

    摘要翻译: 以自动方式,遍历XML文档的集合,并生成描述元素的层次关系,元素数据类型和值的统计数据。 基于分析,导出XML模式,使得XML文档的集合在一定程度上符合模式。

    Mechanisms for ranking XML tags
    183.
    发明授权
    Mechanisms for ranking XML tags 有权
    排序XML标签的机制

    公开(公告)号:US08560535B2

    公开(公告)日:2013-10-15

    申请号:US12358610

    申请日:2009-01-23

    申请人: Ravi Murthy

    发明人: Ravi Murthy

    IPC分类号: G06F7/04

    CPC分类号: G06F17/30908

    摘要: XML Schema design often involves repeating work already done by others. Schema designers benefit from a system that helps designers choose from a set of candidate XML tags already in use by others. Mechanisms are described for providing decision support in the selection of candidate XML tags based on tags that are in use within a collection of XML documents. The approaches presented include a static analysis of tags which reflects the use of such tags in documents considered to be of high quality. A real-time analysis of tags considers the shared context of candidate XML tags with the set of tags already selected for use in the schema under design. Measure values resulting from the static analysis and/or the real-time analysis are used to derive an overall ranking measure for each candidate tag, and the candidate tags may be displayed in rank order of their measure values.

    摘要翻译: XML Schema设计通常涉及重复他人已经完成的工作。 模式设计师从一个系统中受益,这个系统可帮助设计者从已经被其他人使用的一组候选XML标签中进行选择。 描述了基于在XML文档集合内使用的标签来选择候选XML标签的决策支持的机制。 所提出的方法包括反映在被认为是高质量的文件中使用这种标签的标签的静态分析。 标签的实时分析考虑了候选XML标签的共享上下文与已经选择在设计中使用的模式中的标签集合。 由静态分析和/或实时分析得到的测量值用于导出每个候选标签的总体排名测量,并且可以以其测量值的等级顺序显示候选标签。

    Method and mechanism for handling arbitrarily-sized XML in SQL operator tree
    184.
    发明授权
    Method and mechanism for handling arbitrarily-sized XML in SQL operator tree 有权
    在SQL运算符树中处理任意大小的XML的方法和机制

    公开(公告)号:US07873645B2

    公开(公告)日:2011-01-18

    申请号:US10656800

    申请日:2003-09-05

    IPC分类号: G06F7/00 G06F17/30

    摘要: Disclosed is a method and mechanism for processing expressions and operator trees. An approach is particularly useful to optimize processing of XML statements with respect to SQL operators. A top-down processing approach can be taken to directly output data from operators to a data stream. In addition, multiple processing approaches can be taken within a single expression tree, with some operators processed using the top-down approach and other operators processed with the bottom-up approach. Even data that can not be streamed is copied fewer times using this approach, intermediate values from bottom-up processing may still be streamed if it is used by an operator that is eligible for top-down processing.

    摘要翻译: 公开了一种处理表达式和运算符树的方法和机制。 一种方法对于优化与SQL操作符相关的XML语句的处理特别有用。 可以采用自上而下的处理方式将数据从运算符直接输出到数据流。 此外,可以在单个表达式树中采用多种处理方法,一些运算符使用自上而下的方法进行处理,其他运算符也使用自下而上的方法进行处理。 即使不能流式传输的数据也可以使用这种方法被复制较少次,如果由符合自上而下处理的操作符使用,则自下而上处理的中间值可能仍然被流式传输。

    MECHANISM FOR EFFICIENTLY SEARCHING XML DOCUMENT COLLECTIONS
    185.
    发明申请
    MECHANISM FOR EFFICIENTLY SEARCHING XML DOCUMENT COLLECTIONS 有权
    有效搜索XML文档收集的机制

    公开(公告)号:US20100228734A1

    公开(公告)日:2010-09-09

    申请号:US12391818

    申请日:2009-02-24

    申请人: Ravi Murthy

    发明人: Ravi Murthy

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F17/30569 G06F17/30911

    摘要: The techniques presented herein are directed towards providing a user-directed keyword-based search on a large collection of XML documents, and displaying a summary of results to the user. Prior to receiving search requests from a user, an offline analysis of a large collection of XML documents is performed to construct an inverted index of keywords. For each keyword, the index stores a set of location indicators that identify all the instances of the keyword found in the collection of documents. A location indicator may comprise a document identifier, an indication of the position of the node in the hierarchy of nodes within the XML document containing the keyword, and an indication of the pathname of the node containing the keyword. Once the index is constructed, keyword searching can be done efficiently by a keyword lookup in the index. Various display strategies enable the user to see the specific portion of a large XML document containing the keyword and/or path frequency information allowing the user to easily refine the search to specific paths within the collection of documents.

    摘要翻译: 本文提出的技术旨在提供对大量XML文档集合的用户导向的基于关键字的搜索,并向用户显示结果摘要。 在从用户接收到搜索请求之前,执行大量XML文档的离线分析以构造关键字的反向索引。 对于每个关键字,索引存储一组位置指示符,用于标识在文档集合中找到的关键字的所有实例。 位置指示符可以包括文档标识符,在包含关键字的XML文档内的节点的层次结构中节点的位置的指示,以及包含该关键字的节点的路径名的指示。 一旦构建了索引,可以通过索引中的关键字查找来高效地进行关键字搜索。 各种显示策略使得用户能够看到包含关键字和/或路径频率信息的大型XML文档的特定部分,允许用户容易地将搜索精简到文档集合内的特定路径。

    Inferring Schemas From XML Document Collections
    186.
    发明申请
    Inferring Schemas From XML Document Collections 有权
    从XML文档集合推导模式

    公开(公告)号:US20090240712A1

    公开(公告)日:2009-09-24

    申请号:US12052510

    申请日:2008-03-20

    申请人: Ravi Murthy

    发明人: Ravi Murthy

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30914

    摘要: In an automated fashion, a collection of XML documents are traversed and statistical data is generated describing hierarchical relationships of the elements, the elements data types and values. Based the analysis, an XML schema is derived such that the collection of XML documents conforms to the schema to a certain degree.

    摘要翻译: 以自动方式,遍历XML文档的集合,生成描述元素的层次关系,元素数据类型和值的统计数据。 基于分析,导出XML模式,使得XML文档的集合在一定程度上符合模式。

    Techniques for efficient integration of text searching with queries over XML data
    187.
    发明授权
    Techniques for efficient integration of text searching with queries over XML data 有权
    通过XML数据高效集成文本搜索与查询的技术

    公开(公告)号:US07305414B2

    公开(公告)日:2007-12-04

    申请号:US11099918

    申请日:2005-04-05

    IPC分类号: G06F17/30

    摘要: A method and apparatus for rewriting a database command containing an embedded XML expression such that the rewritten database command recites a text function, in lieu of the embedded XML expression, is provided. Advantageously, a DBMS may take advantage of the efficiencies in storing XML data within the database, while avoiding the generation of unnecessary XML elements in processing the query when the XML elements contribute nothing to the outcome of the query. Cost-base or rule-based analysis may be performed to determine how to rewrite a received database command. The database server may functionally evaluate the text function or may use an index defined on a column of the database. The text function may function as a primary filter or may reference a column upon which an index is defined, wherein the index operates at the same or higher level than a column being referenced in the embedded XML expression.

    摘要翻译: 提供一种用于重写包含嵌入式XML表达式的数据库命令的方法和装置,使得重写的数据库命令背景文本功能代替嵌入的XML表达。 有利地,DBMS可以利用在数据库中存储XML数据的效率,同时避免在XML元素对查询的结果没有贡献时在处理查询时生成不必要的XML元素。 可以执行基于成本或基于规则的分析来确定如何重写接收到的数据库命令。 数据库服务器可以在功能上评估文本功能,或者可以使用在数据库的列上定义的索引。 文本函数可以用作主过滤器,或者可以引用定义索引的列,其中索引在与嵌入式XML表达式中引用的列相同或更高的级别上操作。

    Techniques for efficient access control in a database system
    188.
    发明申请
    Techniques for efficient access control in a database system 审中-公开
    数据库系统中高效访问控制的技术

    公开(公告)号:US20070276835A1

    公开(公告)日:2007-11-29

    申请号:US11442002

    申请日:2006-05-26

    申请人: Ravi Murthy

    发明人: Ravi Murthy

    IPC分类号: G06F17/30

    CPC分类号: G06F21/6227

    摘要: Access control rewrites generate rewritten queries that may be executed more efficiently using index evaluation to determine which rows satisfy one or more access control conditions.

    摘要翻译: 访问控制重写生成可以使用索引评估更有效地执行的重写查询,以确定哪些行满足一个或多个访问控制条件。