ELEMENT QUERY METHOD AND SYSTEM
    1.
    发明申请
    ELEMENT QUERY METHOD AND SYSTEM 审中-公开
    元素查询方法和系统

    公开(公告)号:US20080010256A1

    公开(公告)日:2008-01-10

    申请号:US11758306

    申请日:2007-06-05

    IPC分类号: G06F17/30

    CPC分类号: G06F16/81

    摘要: Methods, systems, and computer-readable media for representing and querying positional information for a hierarchical document (such as an XML document) are disclosed. In one set of embodiments, at least one word in the hierarchical document is associated with one or more word positions, and at least one element in the hierarchical document is associated with one or more word position ranges. The word positions and word position ranges are analyzed to determine whether a particular word or phrase is a direct or indirect descendant of a particular element in the hierarchical document. In various embodiments, the word positions are indexed in a first index and the word position ranges are indexed in a second index. Thus, the analysis may be efficiently performed by intersecting the first and second indexes. In further embodiments, the word position ranges may be encoded in a space efficient format for storage or transmittal.

    摘要翻译: 公开了用于表示和查询分层文档(例如XML文档)的位置信息的方法,系统和计算机可读介质。 在一组实施例中,分层文档中的至少一个单词与一个或多个单词位置相关联,并且分层文档中的至少一个单元与一个或多个单词位置范围相关联。 分析单词位置和单词位置范围以确定特定单词或短语是分层文档中特定元素的直接或间接后代。 在各种实施例中,字位置被索引在第一索引中,并且字位置范围在第二索引中被索引。 因此,可以通过与第一和第二索引相交来有效地执行分析。 在另外的实施例中,字位置范围可以以空间有效的格式被编码以用于存储或传送。

    POINT-IN-TIME QUERY METHOD AND SYSTEM
    2.
    发明申请
    POINT-IN-TIME QUERY METHOD AND SYSTEM 审中-公开
    时间查询方法和系统

    公开(公告)号:US20070271242A1

    公开(公告)日:2007-11-22

    申请号:US11750966

    申请日:2007-05-18

    IPC分类号: G06F17/30

    CPC分类号: G06F16/83 G06F16/24526

    摘要: Embodiments of the present invention include storing a plurality of subtrees in a database, the plurality of subtrees representing one or more structured documents. At least one subtree has a birth timestamp indicating a time at which the at least one subtree was created. If a subtree has been obsoleted, the subtree has a death timestamp indicating a time at which the subtree was obsoleted. Embodiments further include receiving a database query comprising a query string and a query timestamp, the query timestamp indicating a historical time for which the query is to apply, and determining an intermediate result list of subtrees. The intermediate result list is filtered to generate a final result list responsive to the database query, the filtering comprising removing subtrees that do not have a birth timestamp, have a birth timestamp later than the query timestamp, or have a death timestamp earlier than the query timestamp.

    摘要翻译: 本发明的实施例包括将多个子树存储在数据库中,多个子树表示一个或多个结构化文档。 至少一个子树具有指示创建至少一个子树的时间的出生时间戳。 如果子树已经过时,则子树具有指示子树被过时的时间的死亡时间戳。 实施例还包括接收包括查询字符串和查询时间戳的数据库查询,查询时间戳指示查询应用于其的历史时间,以及确定子树的中间结果列表。 过滤中间结果列表以响应于数据库查询生成最终结果列表,包括删除不具有出生时间戳的子树的过滤器具有晚于查询时间戳的出生时间戳,或者具有早于查询的死亡时间戳 时间戳。

    PARENT-CHILD QUERY INDEXING FOR XML DATABASES
    3.
    发明申请
    PARENT-CHILD QUERY INDEXING FOR XML DATABASES 有权
    父母儿童查询索引XML数据库

    公开(公告)号:US20070168327A1

    公开(公告)日:2007-07-19

    申请号:US11567676

    申请日:2006-12-06

    IPC分类号: G06F17/30

    摘要: A method for processing queries for a document of elements is provided. The document includes a plurality of subsections where each subsection includes at least a portion of elements in the document. The method comprises: receiving a query for a path of elements in the document of elements; determining a plurality of step queries from the query, each step query including at least a part of the path of elements; for each step query in the plurality of step queries, determining one or more subsections that include elements that correspond to a step query; and determining at least one subsection that includes the path of elements of the query. A result for the query is generated using the at least one subsection.

    摘要翻译: 提供了一种用于处理元素文档查询的方法。 文档包括多个子部分,其中每个子部分包括文档中的至少一部分元素。 该方法包括:接收对元素文档中的元素的路径的查询; 确定来自查询的多个步骤查询,每个步骤查询包括元素路径的至少一部分; 对于所述多个步骤查询中的每个步骤查询,确定包括与步骤查询相对应的元素的一个或多个子部分; 以及确定包括所述查询的元素的路径的至少一个小节。 使用至少一个子部分生成查询的结果。

    Apparatus and Method for Forming and Using a Tree Structured Database with Top-Down Trees and Bottom-Up Indices
    7.
    发明申请
    Apparatus and Method for Forming and Using a Tree Structured Database with Top-Down Trees and Bottom-Up Indices 审中-公开
    用于形成和使用具有自上而下树和自下而上指数的树结构化数据库的装置和方法

    公开(公告)号:US20130297657A1

    公开(公告)日:2013-11-07

    申请号:US13461701

    申请日:2012-05-01

    IPC分类号: G06F17/30

    CPC分类号: G06F16/9027

    摘要: A method for loading information into a tree structured database includes receiving a document and forming a top-down tree characterizing the document. Leaf nodes in the top-down tree are identified. Bottom-up indices are formed for the leaf nodes, where the bottom-up indices characterizes paths from selected leaf nodes to a root node of the top-down tree. The top-down tree and bottom-up indices are stored as separately searchable entities in the tree structured database.

    摘要翻译: 将信息加载到树结构化数据库中的方法包括接收文档并形成表征文档的自上而下的树。 识别自顶向下树中的叶节点。 为叶节点形成自下而上的索引,其中自下而上的索引表示从所选叶节点到自顶向下树的根节点的路径。 自上而下的树和自下而上的索引作为可单独搜索的实体存储在树结构化数据库中。

    Parent-child query indexing for XML databases
    8.
    发明授权
    Parent-child query indexing for XML databases 有权
    XML数据库的父子查询索引

    公开(公告)号:US07171404B2

    公开(公告)日:2007-01-30

    申请号:US10462019

    申请日:2003-06-13

    IPC分类号: G06F17/30 G06F15/16

    摘要: A method for processing queries for a document of elements is provided. The document includes a plurality of subsections where each subsection includes at least a portion of elements in the document. The method comprises: receiving a query for a npath of elements in the document of elements; determining a plurality of step queries from the query, each step query including at least a part of the path of elements; for each step query in the plurality of step queries, determining one or more subsections that include elements that correspond to a step query; and determining at least one subsection that includes the path of elements of the query. A result for the query is generated using the at least one subsection.

    摘要翻译: 提供了一种用于处理元素文档查询的方法。 文档包括多个子部分,其中每个子部分包括文档中的至少一部分元素。 该方法包括:接收元素文档中元素的npath的查询; 确定来自查询的多个步骤查询,每个步骤查询包括元素路径的至少一部分; 对于所述多个步骤查询中的每个步骤查询,确定包括与步骤查询相对应的元素的一个或多个子部分; 以及确定包括所述查询的元素的路径的至少一个小节。 使用至少一个子部分生成查询的结果。

    Parent-child query indexing for xml databases
    9.
    发明授权
    Parent-child query indexing for xml databases 有权
    xml数据库的父子查询索引

    公开(公告)号:US07756858B2

    公开(公告)日:2010-07-13

    申请号:US11567676

    申请日:2006-12-06

    IPC分类号: G06F17/30

    摘要: A method for processing queries for a document of elements is provided. The document includes a plurality of subsections where each subsection includes at least a portion of elements in the document. The method comprises: receiving a query for a path of elements in the document of elements; determining a plurality of step queries from the query, each step query including at least a part of the path of elements; for each step query in the plurality of step queries, determining one or more subsections that include elements that correspond to a step query; and determining at least one subsection that includes the path of elements of the query. A result for the query is generated using the at least one subsection.

    摘要翻译: 提供了一种用于处理元素文档查询的方法。 文档包括多个子部分,其中每个子部分包括文档中的至少一部分元素。 该方法包括:接收对元素文档中的元素的路径的查询; 确定来自查询的多个步骤查询,每个步骤查询包括元素路径的至少一部分; 对于所述多个步骤查询中的每个步骤查询,确定包括与步骤查询相对应的元素的一个或多个子部分; 以及确定包括所述查询的元素的路径的至少一个小节。 使用至少一个子部分生成查询的结果。

    XML Database Mixed Structural-Textual Classification System
    10.
    发明申请
    XML Database Mixed Structural-Textual Classification System 审中-公开
    XML数据库混合结构文本分类系统

    公开(公告)号:US20070136250A1

    公开(公告)日:2007-06-14

    申请号:US11531738

    申请日:2006-09-14

    IPC分类号: G06F17/30

    摘要: One aspect of the present invention is a system for classifying element nodes in a subtree-structured XML database. The XQE structural-textual classification system is sensitive to both the textual resemblance between document elements as well as the structural resemblance between document elements. The XQE structural-textual classification system might use the XQE parent-child index described in Lindblad II-A for the purpose of forming vectors of “terms” which encode both the structural and the textual content of XML elements. The element vectors are processed by a classifier to create class prototype vectors which can be used to classify elements as they are added to the database.

    摘要翻译: 本发明的一个方面是用于对子树结构的XML数据库中的元素节点进行分类的系统。 XQE结构文本分类系统对文档元素之间的文本相似性以及文档元素之间的结构相似性都很敏感。 XQE结构文本分类系统可以使用Lindblad II-A中描述的XQE父子索引来形成编码XML元素的结构和文本内容的“术语”向量的目的。 元素向量由分类器处理,以创建类原型向量,可以在元素添加到数据库时对它们进行分类。