Structured-text cataloging method, structured-text searching method, and portable medium used in the methods
    22.
    发明授权
    Structured-text cataloging method, structured-text searching method, and portable medium used in the methods 失效
    结构化文本编目方法,结构化文本搜索方法和方法中使用的便携式媒体

    公开(公告)号:US06226632B1

    公开(公告)日:2001-05-01

    申请号:US09589226

    申请日:2000-06-08

    IPC分类号: G06F1730

    摘要: A text cataloging method includes a step of cataloging already-analyzed-text data obtained from an analysis of a logical structure of a text to be cataloged in a text database, a step of creating a structure index by sequentially superposing logical structures of texts to be cataloged, wherein a single metaelement is used for representing a group of elements in the texts having the same position of appearance in one of the texts and the same element type, a single piece of meta-character-string data is used for representing a group of pieces of character-string data in the texts having the same position of appearance in one of the texts, and a context identifier is assigned to each metanode composing a tree-like structure of the structure index for uniquely identifying the metanode; a step of generating structured-full-text data composed of definitions of associative relations between all pieces of character-string data included in already-analyzed-text data of each text to be cataloged, and context identifiers of pieces of meta-character-string data in the structure index used for representing the pieces of character-string data; and a character-string-index updating step, including the sub-steps of extracting partial character strings, generating structured-character-position information, and updating a character-string index.

    摘要翻译: 文本编目方法包括对从文本数据库中要编目的文本的逻辑结构的分析获得的已经分析的文本数据进行编目的步骤,通过将文本的逻辑结构顺序叠加来创建结构索引的步骤 其中单个元组用于表示在一个文本和相同元素类型中具有相同出现位置的文本中的一组元素,单个元字符串数据用于表示组 在文本中具有相同的出现位置的文本中的字符串数据段,并且将上下文标识符分配给构成用于唯一地标识元数据的结构索引的树状结构的每个元模型; 生成结构化全文数据的步骤,该结构化全文数据由包含在要编目的每个文本的已分析文本数据中的所有字符串数据之间的关联关系的定义以及元字符串的上下文标识符组成 用于表示字符串数据片段的结构索引中的数据; 以及字符串索引更新步骤,包括提取部分字符串,生成结构化字符位置信息和更新字符串索引的子步骤。

    News clipping method and system
    23.
    发明授权
    News clipping method and system 失效
    新闻剪辑方法和系统

    公开(公告)号:US5970485A

    公开(公告)日:1999-10-19

    申请号:US891064

    申请日:1997-07-10

    IPC分类号: G06F3/14 G06F3/048 G06F17/30

    摘要: A method of fast clipping, despite of large number of users, can be achieved through analyzing query expressions, storing the number of query terms included in the query expressions in a term number count table, generating a finite automaton for matching the terms occurring in text data with all terms included in the query expressions, generating a user identifier table for storing the identifiers of users in association with the terms included in the query expressions, matching the terms by scanning the text data by the finite automaton, calculating for each user the occurrence count of terms occurring in the text data as substrings coincident with the terms included in the query expressions made to the user identifier table, storing the calculated occurrence count in the term occurrence count region of the table, comparing the calculated term occurrence count of the table with the number of terms included the query expressions, and when a match is found from the comparison, delivering the text data to the user.

    摘要翻译: 可以通过分析查询表达式,将包含在查询表达式中的查询项的数量存储在术语数量计数表中来实现快速裁剪的方法,尽管有大量用户,生成用于匹配文本中出现的术语的有限自动机 具有包括在查询表达式中的所有术语的数据,生成用于存储与查询表达式中包括的术语相关联的用户标识符的用户标识符表,通过有限自动机扫描文本数据来匹配术语,为每个用户计算 在文本数据中出现的术语的出现次数与作为对用户标识符表的查询表达式中包括的术语一致的子字符串存储,将计算的出现次数存储在表的术语出现计数区域中,将计算出的术语发生次数 表中包含查询表达式的术语数量,当从比较中找到匹配时,传递 g给用户的文本数据。

    Structured-text cataloging method, structured-text searching method, and portable medium used in the methods
    24.
    发明授权
    Structured-text cataloging method, structured-text searching method, and portable medium used in the methods 失效
    结构化文本编目方法,结构化文本搜索方法和方法中使用的便携式媒体

    公开(公告)号:US06745202B2

    公开(公告)日:2004-06-01

    申请号:US10303782

    申请日:2002-11-26

    IPC分类号: G06F1730

    摘要: A text cataloging method includes a step of cataloging already-analyzed-text data obtained from an analysis of a logical structure of a text to be cataloged in a text database, a step of creating a structure index by sequentially superposing logical structures of texts to be cataloged, wherein a single metaelement is used for representing a group of elements in the texts having the same position of appearance in one of the texts and the same element type, a single piece of meta-character-string data is used for representing a group of pieces of character-string data in the texts having the same position of appearance in one of the texts, and a context identifier is assigned to each metanode composing a tree-like structure of the structure index for uniquely identifying the metanode; a step of generating structured-full-text data composed of definitions of associative relations between all pieces of character-string data included in already-analyzed-text data of each text to be cataloged, and context identifiers of pieces of meta-character-string data in the structure index used for representing the pieces of character-string data; and a character-string-index updating step, including the sub-steps of extracting partial character strings, generating structured-character-position information, and updating a character-string index.

    摘要翻译: 文本编目方法包括对从文本数据库中要编目的文本的逻辑结构的分析获得的已经分析的文本数据进行编目的步骤,通过将文本的逻辑结构顺序叠加来创建结构索引的步骤 其中单个元组用于表示在一个文本和相同元素类型中具有相同出现位置的文本中的一组元素,单个元字符串数据用于表示组 在文本中具有相同的出现位置的文本中的字符串数据段,并且将上下文标识符分配给构成用于唯一地标识元数据的结构索引的树状结构的每个元模型; 生成结构化全文数据的步骤,该结构化全文数据由包含在要编目的每个文本的已分析文本数据中的所有字符串数据之间的关联关系的定义以及元字符串的上下文标识符组成 用于表示字符串数据的结构索引中的数据; 以及字符串索引更新步骤,包括提取部分字符串,生成结构化字符位置信息和更新字符串索引的子步骤。

    Structured-text cataloging method, structured-text searching method, and portable medium used in the methods
    26.
    发明授权
    Structured-text cataloging method, structured-text searching method, and portable medium used in the methods 失效
    结构化文本编目方法,结构化文本搜索方法和方法中使用的便携式媒体

    公开(公告)号:US06389413B2

    公开(公告)日:2002-05-14

    申请号:US09814692

    申请日:2001-03-15

    IPC分类号: G06F1730

    摘要: A text cataloging method includes a step of cataloging already-analyzed-text data obtained from an analysis of a logical structure of a text to be cataloged in a text database, a step of creating a structure index by sequentially superposing logical structures of texts to be cataloged, wherein a single metaelement is used for representing a group of elements in the texts having the same position of appearance in one of the texts and the same element type, a single piece of meta-character-string data is used for representing a group of pieces of character-string data in the texts having the same position of appearance in one of the texts, and a context identifier is assigned to each metanode composing a tree-like structure of the structure index for uniquely identifying the metanode; a step of generating structured-full-text data composed of definitions of associative relations between all pieces of character-string data included in already-analyzed-text data of each text to be cataloged, and context identifiers of pieces of meta-character-string data in the structure index used for representing the pieces of character-string data; and a character-string-index updating step, including the sub-steps of extracting partial character strings, generating structured-character-position information, and updating a character-string index.

    摘要翻译: 文本编目方法包括对从文本数据库中要编目的文本的逻辑结构的分析获得的已经分析的文本数据进行编目的步骤,通过将文本的逻辑结构顺序叠加来创建结构索引的步骤 其中单个元组用于表示在一个文本和相同元素类型中具有相同出现位置的文本中的一组元素,单个元字符串数据用于表示组 在文本中具有相同的出现位置的文本中的字符串数据段,并且将上下文标识符分配给构成用于唯一地标识元数据的结构索引的树状结构的每个元模型; 生成结构化全文数据的步骤,该结构化全文数据由包含在要编目的每个文本的已分析文本数据中的所有字符串数据之间的关联关系的定义以及元字符串的上下文标识符组成 用于表示字符串数据的结构索引中的数据; 以及字符串索引更新步骤,包括提取部分字符串,生成结构化字符位置信息和更新字符串索引的子步骤。