System and method to enable parallel text search using in-charge index ranges
    1.
    发明授权
    System and method to enable parallel text search using in-charge index ranges 失效
    系统和方法,可以使用使用索引范围启用并行文本搜索

    公开(公告)号:US07689545B2

    公开(公告)日:2010-03-30

    申请号:US11185733

    申请日:2005-07-21

    IPC分类号: G06F7/00

    摘要: In registering operation of a document to be searched for, a document identifier management table for managing a range of a document identifier stored for each page and a page identifier of the page is created, and an individual-search-server's search range management table for managing the range of the document identifier in charge of each search server is created. In searching operation of each search server of the document to be searched for, the individual-search-server's search range management table is referred to acquire a range of the allocated document identifier. For each index key forming a query term specified as a query condition, the document identifier management table is referred to to acquire the page identifier storing the document identifier of the allocated range. The searching operation is carried out by referring to a page shown by the acquired page identifier.

    摘要翻译: 在记录要搜索的文档的操作中,创建用于管理为每个页面存储的文档标识符的范围的文档标识符管理表和页面的页面标识符,以及个人搜索服务器的搜索范围管理表 创建管理每个搜索服务器的文档标识符的范围。 在搜索要搜索的文档的每个搜索服务器的搜索操作中,参考个人搜索服务器的搜索范围管理表来获取所分配的文档标识符的范围。 对于形成作为查询条件指定的查询项的每个索引关键字,参考文档标识符管理表来获取存储分配范围的文档标识符的页面标识符。 通过参考由所获取的页面标识符示出的页面来执行搜索操作。

    DATA MANAGEMENT METHOD, DATA MANAGEMENT PROGRAM, AND DATA MANAGEMENT DEVICE
    2.
    发明申请
    DATA MANAGEMENT METHOD, DATA MANAGEMENT PROGRAM, AND DATA MANAGEMENT DEVICE 审中-公开
    数据管理方法,数据管理程序和数据管理设备

    公开(公告)号:US20090271435A1

    公开(公告)日:2009-10-29

    申请号:US12388726

    申请日:2009-02-19

    IPC分类号: G06F17/30

    摘要: Provided is a data management method. Data corresponds to an entry including a reference to another entry and is managed in a set which is a collection of pieces of the data. The set corresponds to a linked list where the entry corresponding to the data is linked in order of addition of the data. The entry includes an insertion time sequence number inserted into the linked list and information indicating if the data has been deleted from the set. In that case, the entry is separated from the linked list at a predetermined timing. The linked list is traced to refer to the data. When the insertion time sequence number of the reference entry is later than the insertion time sequence number of the entry which has already been referred to, it is judged that the reference entry has been separated from the linked list.

    摘要翻译: 提供了一种数据管理方法。 数据对应于包括对另一条目的引用的条目,并且在作为数据片段的集合的集合中进行管理。 该集合对应于链接列表,其中对应于数据的条目按照添加数据的顺序链接。 条目包括插入到链表中的插入时间序列号和指示数据是否已经从集合中删除的信息。 在这种情况下,条目在预定的时刻与链表分离。 链表被追溯到参考数据。 当参考条目的插入时间序列号晚于已经被引用的条目的插入时间序列号时,判断参考条目已经与链表分离。

    Method, system and program for creating an index
    3.
    发明授权
    Method, system and program for creating an index 失效
    用于创建索引的方法,系统和程序

    公开(公告)号:US07711726B2

    公开(公告)日:2010-05-04

    申请号:US11689562

    申请日:2007-03-22

    IPC分类号: G06F17/00

    CPC分类号: G06F17/30911

    摘要: A method for creating an index for searching a structured document having a document data-structure stored, in a computer. The method to be performed in the computer for creating an index includes the steps of: analyzing a structured document to extract a document data-structure in the structured document; normalizing the extracted document data-structure to create a logical structure index composed of a plurality of elements having a hierarchical structure; extracting an appearance number of each element in the created logical structure index; and extracting elements for creating the index, based on the logical structure index, by comparing the extracted element appearance number and a first predetermined threshold.

    摘要翻译: 一种用于创建用于搜索在计算机中存储有文档数据结构的结构化文档的索引的方法。 在计算机中执行用于创建索引的方法包括以下步骤:分析结构化文档以提取结构化文档中的文档数据结构; 归一化所提取的文档数据结构以创建由具有分层结构的多个元素组成的逻辑结构索引; 提取所创建的逻辑结构索引中每个元素的外观号; 以及通过将所提取的元素出现次数与第一预定阈值进行比较,基于所述逻辑结构索引来提取用于创建所述索引的元素。

    DATA MANAGEMENT METHOD FOR ACCESSING DATA STORAGE AREA BASED ON CHARACTERISTIC OF STORED DATA
    4.
    发明申请
    DATA MANAGEMENT METHOD FOR ACCESSING DATA STORAGE AREA BASED ON CHARACTERISTIC OF STORED DATA 审中-公开
    基于存储数据特征访问数据存储区的数据管理方法

    公开(公告)号:US20080320053A1

    公开(公告)日:2008-12-25

    申请号:US12041299

    申请日:2008-03-03

    IPC分类号: G06F17/30

    CPC分类号: G06F16/83 G06F16/27

    摘要: There is provided a data management method for managing data stored in a parallel database system in which a plurality of data servers manage data. The parallel database system manages: correspondence information between a characteristic of the data and each of the plurality of data servers that manages the data; and a data area corresponding to the characteristic of the data. The data management method comprising the steps of: extracting the characteristic of the data from data to be stored in the data area; storing the data in the data area based on the extracted characteristic of the data; specifying a corresponding data area based on the characteristic of the data stored in the data area by referring to the correspondence information; and accessing, by each of the plurality of data servers, the specified data area.

    摘要翻译: 提供了一种用于管理存储在并行数据库系统中的数据的数据管理方法,其中多个数据服务器管理数据。 并行数据库系统管理数据特征与管理数据的多个数据服务器中的每个数据服务器之间的对应信息; 以及与数据的特性对应的数据区域。 所述数据管理方法包括以下步骤:从要存储在所述数据区域的数据中提取所述数据的特性; 基于提取的数据特征将数据存储在数据区域中; 通过参照对应信息,基于存储在数据区域中的数据的特性来指定对应的数据区域; 以及由所述多个数据服务器中的每一个访问所述指定的数据区域。

    METHOD, SYSTEM AND PROGRAM FOR CREATING AN INDEX
    5.
    发明申请
    METHOD, SYSTEM AND PROGRAM FOR CREATING AN INDEX 失效
    用于创建索引的方法,系统和程序

    公开(公告)号:US20080120280A1

    公开(公告)日:2008-05-22

    申请号:US11689562

    申请日:2007-03-22

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30911

    摘要: A method for creating an index for searching a structured document having a document data-structure stored, in a computer. The method to be performed in the computer for creating an index includes the steps of: analyzing a structured document to extract a document data-structure in the structured document; normalizing the extracted document data-structure to create a logical structure index composed of a plurality of elements having a hierarchical structure; extracting an appearance number of each element in the created logical structure index; and extracting elements for creating the index, based on the logical structure index, by comparing the extracted element appearance number and a first predetermined threshold.

    摘要翻译: 一种用于创建用于搜索在计算机中存储有文档数据结构的结构化文档的索引的方法。 在计算机中执行用于创建索引的方法包括以下步骤:分析结构化文档以提取结构化文档中的文档数据结构; 归一化所提取的文档数据结构以创建由具有分层结构的多个元素组成的逻辑结构索引; 提取所创建的逻辑结构索引中每个元素的外观号; 以及通过将所提取的元素出现次数与第一预定阈值进行比较,基于所述逻辑结构索引来提取用于创建所述索引的元素。

    Method and system for retrieving a document
    6.
    发明申请
    Method and system for retrieving a document 失效
    检索文档的方法和系统

    公开(公告)号:US20060101004A1

    公开(公告)日:2006-05-11

    申请号:US11185733

    申请日:2005-07-21

    IPC分类号: G06F17/30

    摘要: In registering operation of a document to be searched for, a document identifier management table for managing a range of a document identifier stored for each page and a page identifier of the page is created, and an individual-search-server's search range management table for managing the range of the document identifier in charge of each search server is created. In searching operation of each search server of the document to be searched for, the individual-search-server's search range management table is referred to acquire a range of the allocated document identifier. For each index key forming a query term specified as a query condition, the document identifier management table is referred to to acquire the page identifier storing the document identifier of the allocated range. The searching operation is carried out by referring to a page shown by the acquired page identifier.

    摘要翻译: 在记录要搜索的文档的操作中,创建用于管理为每个页面存储的文档标识符的范围的文档标识符管理表和页面的页面标识符,以及个人搜索服务器的搜索范围管理表 创建管理每个搜索服务器的文档标识符的范围。 在搜索要搜索的文档的每个搜索服务器的搜索操作中,参考个人搜索服务器的搜索范围管理表来获取所分配的文档标识符的范围。 对于形成作为查询条件指定的查询项的每个索引关键字,参考文档标识符管理表来获取存储分配范围的文档标识符的页面标识符。 通过参考由所获取的页面标识符示出的页面来执行搜索操作。