Method and search method for structured documents
    11.
    发明授权
    Method and search method for structured documents 失效
    结构化文件的方法和搜索方法

    公开(公告)号:US06496820B1

    公开(公告)日:2002-12-17

    申请号:US09300594

    申请日:1999-04-28

    IPC分类号: G06F1730

    摘要: A registration method for structured documents includes the steps of: preparing correspondence data between a string and a string occurrence position within a structured document for each structured document, and additionally storing the correspondence data in an occurrence frequency extracting index; and preparing a list of a character, an element containing the character and a length of the element and additionally storing the list in an element length index. A search method for structured documents includes the steps of: inputting search conditions including a search term and an element for specifying a search range; decomposing the search term into a plurality of substrings, obtaining an occurrence frequency and an occurrence position of the search term using the plurality of substrings from the occurrence frequency extracting index; selecting a character from the search term, obtaining an element containing the character using the character from the element length index, and further extracting a length of the element within the search range; calculating a matching degree for the search conditions from the occurrence frequency and the occurrence position of the search term and the length of the element within the search range; and outputting the element containing the search term and the matching degree.

    摘要翻译: 结构化文档的注册方法包括以下步骤:为每个结构化文档准备在结构化文档内的字符串和字符串发生位置之间的对应数据,并且将对应数据另外存储在发生频率提取索引中; 以及准备字符的列表,包含该元素的字符和长度的元素,并另外将该列表存储在元素长度索引中。 用于结构化文档的搜索方法包括以下步骤:输入包括搜索项和用于指定搜索范围的元素的搜索条件; 将搜索项分解为多个子串,使用来自发生频率提取索引的多个子串来获得搜索项的出现频率和出现位置; 从所述搜索项中选择一个字符,从所述元素长度索引获得包含所述字符的元素,并进一步提取所述元素在所述搜索范围内的长度; 根据搜索项的出现频率和出现位置以及搜索范围内的元素的长度来计算搜索条件的匹配度; 并输出包含搜索项和匹配度的元素。

    Registration method and search method for structured documents
    13.
    发明授权
    Registration method and search method for structured documents 失效
    结构化文件的注册方法和搜索方法

    公开(公告)号:US06826567B2

    公开(公告)日:2004-11-30

    申请号:US10218495

    申请日:2002-08-15

    IPC分类号: G06F1730

    摘要: A registration/search method for structured documents where correspondence data is prepared between a fixed-length-string and a string occurrence position within a structured document for all fixed-length-strings in the document and for each structured document. A list of a character and all hierarchical elements containing the character and element lengths is prepared. An occurrence frequency and an occurrence position of a search term is obtained using the plurality of fixed-length-substrings and the occurrence frequency extracting index. A search character is selected from the search term. A hierarchical element containing the search character is obtained using the character from the element length index. A length of the element corresponding to a search range is extracted using the obtained occurrence position. A matching degree for the search term is calculated from the obtained occurrence frequency of the search term and the extracted element length of the element corresponding to the search range.

    摘要翻译: 一种结构化文档的注册/搜索方法,其中在文档和每个结构化文档中的所有固定长度字符串的结构化文档中的固定长度字符串和字符串发生位置之间准备对应数据。 准备了包含字符和元素长度的字符和所有分层元素的列表。 使用多个固定长度子串和出现频率提取索引来获得搜索项的出现频率和出现位置。 从搜索项中选择搜索字符。 使用元素长度索引中的字符获得包含搜索字符的分层元素。 使用所获得的发生位置提取与搜索范围对应的元素的长度。 从搜索项的获得的出现频率和与搜索范围对应的元素的提取的元素长度计算搜索项的匹配度。

    Similar document retrieving method and system
    14.
    发明授权
    Similar document retrieving method and system 有权
    类似的文件检索方法和系统

    公开(公告)号:US07231388B2

    公开(公告)日:2007-06-12

    申请号:US10206595

    申请日:2002-07-29

    IPC分类号: G06F10/30

    摘要: Similar document retrieving method and system for retrieving similar documents from a document database storing plural documents written in different languages with high accuracy while suppressing retrieval noise even when difference is found in the number of registered documents in dependence on the species of description languages. Statistical information concerning the registration-subjected documents is collected on a language-by-language basis upon registration thereof. Upon retrieval of documents similar to a query document, weights of words extracted from the query document are taken into account and on a language-by-language basis by referencing the statistical information.

    摘要翻译: 相似的文件检索方法和系统,用于从存储多种写入不同语言的多种文件的文件数据库中检索类似的文档,同时抑制检索噪声,即使在依赖于描述语言的种类的登记文件的数量上存在差异的情况下。 有关登记受影响的文件的统计资料,在登记后将逐一收集。 在检索与查询文档类似的文档时,通过参考统计信息考虑从查询文档中提取的单词的权重,并且逐个语言地考虑。

    Method of searching similar document, system for performing the same and program for processing the same
    15.
    发明授权
    Method of searching similar document, system for performing the same and program for processing the same 失效
    搜索类似文档的方法,执行相同的系统和处理程序的方法

    公开(公告)号:US07200587B2

    公开(公告)日:2007-04-03

    申请号:US10081203

    申请日:2002-02-25

    IPC分类号: G06F17/30 G06F17/00

    摘要: A similar document search method includes a step of extracting a characteristic word candidate as a candidate for a characteristic word from a seeds document including desired retrieval contents, a step of extracting as characteristic words of the seeds document, when the characteristic word candidate extracted by the extracting step is a compound characteristic word including a plurality of characteristic words, the compound characteristic word and constituent characteristic words included in the compound characteristic word from the characteristic word candidate, a step of calculating, according to the characteristic words extracted by the extracting step, similarity between the seeds document and a registration document, and a step of outputting as a retrieval result a result of the similarity calculated by the similarity calculating step.

    摘要翻译: 类似的文档搜索方法包括从包括期望的检索内容的种子文档中提取特征词候选作为特征词的候选的步骤,当由所述特征词候选提取的特征词候选提取时,提取种子文档的特征词的步骤 提取步骤是包括多个特征词的复合特征词,来自特征词候选的复合特征词中包括的复合特征词和构成特征词,根据由提取步骤提取的特征词计算的步骤, 种子文档和登记文档之间的相似性,以及作为检索结果输出由相似度计算步骤计算出的相似度的结果的步骤。

    Method and system for storing and managing electronic mail
    16.
    发明授权
    Method and system for storing and managing electronic mail 失效
    存储和管理电子邮件的方法和系统

    公开(公告)号:US07080099B2

    公开(公告)日:2006-07-18

    申请号:US10167011

    申请日:2002-06-10

    IPC分类号: G06F7/00

    摘要: A computerized document management system manages and allows viewing of attachment documents in groups of electronic mail messages. A determination is first made as to whether an electronic mail message is a task message. If so, task history information, including the main text of the electronic mail message, attribute information, and information about relations with other messages, is stored. The attachment documents are then extracted and stored together with attachment document management information. Upon receipt of a search request, a list of attachment documents and task histories can then be displayed.

    摘要翻译: 计算机化的文档管理系统管理并允许以电子邮件消息的组来查看附件文档。 首先确定电子邮件消息是否是任务消息。 如果是,则存储包括电子邮件消息的主文本,属性信息以及与其他消息的关系的信息的任务历史信息。 然后将附件文件与附件文件管理信息一起提取和存储。 在接收到搜索请求后,可以显示附件文档和任务历史列表。

    Document search method and apparatus and portable medium used therefor
    17.
    发明授权
    Document search method and apparatus and portable medium used therefor 失效
    文件检索方法及其使用的便携式媒体

    公开(公告)号:US06377946B1

    公开(公告)日:2002-04-23

    申请号:US09256178

    申请日:1999-02-24

    IPC分类号: G06F1730

    摘要: A document search method and apparatus and a portable medium used therefor are described, in which when registering a document in a data base, the logic structures of each document to be registered are superposed one on another to generate a structure index in which the structure elements having the same position of occurrence in the document are represented by a single meta-node. At the time of document search, a mass of the meta-nodes meeting a specified structural condition is determined with reference to the structure index. A string index is searched with the meta-node identifiers as a key thereby to determine a mass of documents meeting the specified condition. As a result, a highly accurate structure-specified search is made possible on a document data base including a mass of structured documents. In the structure-specified search of structured documents, the conditions for the position of occurrence of the logic elements in the document are specified, thereby making possible a highly accurate structure-specified search.

    摘要翻译: 描述了一种文档搜索方法和装置及其使用的便携式介质,其中当在数据库中注册文档时,将要注册的每个文档的逻辑结构彼此叠加以生成结构元素 在文档中具有相同的出现位置由单个元节点表示。 在文档搜索时,参考结构索引确定满足指定结构条件的大量元节点。 使用元节点标识符作为关键字搜索字符串索引,从而确定满足指定条件的文档的大小。 结果,在包括大量结构化文档的文档数据库上可以进行高度精确的结构指定搜索。 在结构化指定的结构化文档搜索中,指定了文档中逻辑元素的发生位置的条件,从而使得可以进行高精度的结构指定搜索。

    Document searching method using forward and backward citation tables
    18.
    发明授权
    Document searching method using forward and backward citation tables 失效
    使用前向和后向引用表的文档搜索方法

    公开(公告)号:US5832476A

    公开(公告)日:1998-11-03

    申请号:US746905

    申请日:1996-11-19

    IPC分类号: G06F17/30

    摘要: A document searching system searches for other documents having a user-specified document cited therein as its referred document to thereby uncover the latest document associated with the user-specified document. In related document searching method, document information is registered in a text storage region, a referred document table and a related document table are created, and referred documents associated with the user-specified document are searched for with use of the created tables.

    摘要翻译: 文档搜索系统搜索具有其中引用的用户指定文档的其他文档作为其引用文档,从而发现与用户指定文档相关联的最新文档。 在相关文档搜索方法中,将文档信息登记在文本存储区域中,创建参考文档表和相关文档表,并且使用所创建的表来搜索与用户指定的文档相关联的参考文档。

    File server system and file access control method of the same
    19.
    发明授权
    File server system and file access control method of the same 失效
    文件服务器系统和文件访问控制方法相同

    公开(公告)号:US5548724A

    公开(公告)日:1996-08-20

    申请号:US216047

    申请日:1994-03-21

    IPC分类号: G06F9/50 G06F17/30 H01J13/00

    CPC分类号: G06F9/505 G06F17/30067

    摘要: A typical structure of a file server system is a file server system having a plurality of file servers connected in parallel on a network and sharing files placed distributedly in the file servers among a plurality of client computers, and there are provided in a specific file server among the plurality of file servers, a load information monitoring device for measuring respective loads of the plurality of file servers and a file access request distributing device for referring to the loads measured by the load information monitoring device so as to select a file server having a light load from the plurality of file servers having light loads, and distributing a file access request transmitted from client computers to the selected file server.

    摘要翻译: 文件服务器系统的典型结构是文件服务器系统,其具有在网络上并行连接的多个文件服务器,并且共享分散放置在多个客户端计算机中的文件服务器中的文件,并且提供在特定文件服务器 在多个文件服务器中,设置有用于测量多个文件服务器的相应负载的负载信息监视装置和用于参考由负载信息监视装置测量的负载的文件访问请求分发装置,以便选择具有 来自具有轻负载的多个文件服务器的轻负载,以及将从客户端计算机发送的文件访问请求分发到所选择的文件服务器。