专利检索 ap:("Tadataka Matsubayashi" OR "Katsumi Tada" OR "Yoshifumi Sato" OR "Yasuhiko Inaba" OR "Jugo Noda") AND inv:"Tadataka Matsubayashi" 第 1 页

1.

发明授权
Method of searching similar document, system for performing the same and program for processing the same 失效
标题翻译：搜索类似文档的方法，执行相同的系统和处理程序的方法

公开(公告)号：US07200587B2

公开(公告)日：2007-04-03

申请号：US10081203

申请日：2002-02-25

申请人： Tadataka Matsubayashi , Katsumi Tada , Yoshifumi Sato , Yasuhiko Inaba , Jugo Noda

发明人： Tadataka Matsubayashi , Katsumi Tada , Yoshifumi Sato , Yasuhiko Inaba , Jugo Noda

IPC分类号： G06F17/30 , G06F17/00

CPC分类号： G06F17/3069 , Y10S707/99931 , Y10S707/99932 , Y10S707/99933

摘要： A similar document search method includes a step of extracting a characteristic word candidate as a candidate for a characteristic word from a seeds document including desired retrieval contents, a step of extracting as characteristic words of the seeds document, when the characteristic word candidate extracted by the extracting step is a compound characteristic word including a plurality of characteristic words, the compound characteristic word and constituent characteristic words included in the compound characteristic word from the characteristic word candidate, a step of calculating, according to the characteristic words extracted by the extracting step, similarity between the seeds document and a registration document, and a step of outputting as a retrieval result a result of the similarity calculated by the similarity calculating step.

摘要翻译： 类似的文档搜索方法包括从包括期望的检索内容的种子文档中提取特征词候选作为特征词的候选的步骤，当由所述特征词候选提取的特征词候选提取时，提取种子文档的特征词的步骤提取步骤是包括多个特征词的复合特征词，来自特征词候选的复合特征词中包括的复合特征词和构成特征词，根据由提取步骤提取的特征词计算的步骤，种子文档和登记文档之间的相似性，以及作为检索结果输出由相似度计算步骤计算出的相似度的结果的步骤。

2.

发明授权
Data display method and apparatus for use in text mining 失效
标题翻译：用于文本挖掘的数据显示方法和装置

公开(公告)号：US06738786B2

公开(公告)日：2004-05-18

申请号：US09874005

申请日：2001-06-06

申请人： Natsuko Sugaya , Katsumi Tada , Yoshifumi Sato , Tadataka Matsubayashi , Yasuhiko Inaba , Mikihiko Tokunaga

发明人： Natsuko Sugaya , Katsumi Tada , Yoshifumi Sato , Tadataka Matsubayashi , Yasuhiko Inaba , Mikihiko Tokunaga

IPC分类号： G06F1730

CPC分类号： G06F17/30616 , Y10S707/99945 , Y10S707/99948

摘要： In a text mining technique, if the system only extracts characteristic words and phrases frequently cooccurring with the respective components of an analysis axis as an analysis condition, similar words and phrases are extracted for any component. To clearly indicate existence of characteristic words and phrases which do not appear as cooccurrence words and phrases for other components of the analysis axis, it is desired to appropriately present distinguishable features between the components to the user. For this purpose, the frequency of appearances of a plurality of characteristic words and phrases in a document satisfying each analysis condition is calculated. As a result, multiple cooccurrence words and phrases and component-cooccurrence words and phrases are discriminatively displayed. It is therefore possible for the user to appropriately analyze the contents of a plurality of documents.

摘要翻译： 在文本挖掘技术中，如果系统只提取经常与分析轴的各个分量共同出现的特征词和短语作为分析条件，则为任何分量提取类似的词和短语。为了清楚地表示存在不是作为分析轴的其他部件的共同文字和短语的特征词和短语，希望适当地向用户呈现组件之间的可区分的特征。为此，计算满足各分析条件的文件中的多个特征词和短语的出现次数。结果，多个同时出现的单词和短语以及组合 - 共同文字和短语被歧视地显示出来。因此，用户可以适当地分析多个文档的内容。

3.

发明授权
Similar document retrieving method and system 有权
标题翻译：类似的文件检索方法和系统

公开(公告)号：US07231388B2

公开(公告)日：2007-06-12

申请号：US10206595

申请日：2002-07-29

申请人： Tadataka Matsubayashi , Katsumi Tada , Yoshifumi Sato , Yasuhiko Inaba , Shin′ ya Yamamoto

发明人： Tadataka Matsubayashi , Katsumi Tada , Yoshifumi Sato , Yasuhiko Inaba , Shin′ ya Yamamoto

IPC分类号： G06F10/30

CPC分类号： G06F17/3069 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935

摘要： Similar document retrieving method and system for retrieving similar documents from a document database storing plural documents written in different languages with high accuracy while suppressing retrieval noise even when difference is found in the number of registered documents in dependence on the species of description languages. Statistical information concerning the registration-subjected documents is collected on a language-by-language basis upon registration thereof. Upon retrieval of documents similar to a query document, weights of words extracted from the query document are taken into account and on a language-by-language basis by referencing the statistical information.

摘要翻译： 相似的文件检索方法和系统，用于从存储多种写入不同语言的多种文件的文件数据库中检索类似的文档，同时抑制检索噪声，即使在依赖于描述语言的种类的登记文件的数量上存在差异的情况下。有关登记受影响的文件的统计资料，在登记后将逐一收集。在检索与查询文档类似的文档时，通过参考统计信息考虑从查询文档中提取的单词的权重，并且逐个语言地考虑。

4.

发明授权
Method and system for retrieving a document and computer readable storage medium 有权
标题翻译：用于检索文档和计算机可读存储介质的方法和系统

公开(公告)号：US07054860B2

公开(公告)日：2006-05-30

申请号：US10678065

申请日：2003-10-06

申请人： Yasuhiko Inaba , Katsumi Tada , Natsuko Sugaya , Tadataka Matsubayashi , Akihiko Yamaguchi , Mikihiko Tokunaga

发明人： Yasuhiko Inaba , Katsumi Tada , Natsuko Sugaya , Tadataka Matsubayashi , Akihiko Yamaguchi , Mikihiko Tokunaga

IPC分类号： G06F7/00 , G06F17/30

CPC分类号： G06F17/30648 , Y10S707/99935 , Y10S707/99943 , Y10S707/99952

摘要： In document retrieval having the relevance feedback function to modify a searching profile for retrieval on the basis of a user's evaluation to evaluate a search result as pertinent or impertinent, recommencement of the relevance feedback returned to a desired time is permitted. An evaluation inputted by a user, a searching profile modified by the evaluation and a search result based on the searching profile are all saved while making the correspondence between them. When a request for restoration of searching profile is made, a searching profile corresponding to an evaluation designated by the user is restored.

摘要翻译： 在具有相关性反馈功能的文档检索中，基于用户的评价来修改用于检索的搜索简档以评估搜索结果作为相关或无关的，允许重新启动返回到期望时间的相关性反馈。由用户输入的评估，通过评估修改的搜索简档和基于搜索简档的搜索结果在保持它们之间的对应关系的同时被保存。当进行搜索简档的恢复请求时，恢复与由用户指定的评估对应的搜索简档。

5.

发明授权
Document retrieval method and document retrieval system 有权

公开(公告)号：US07039636B2

公开(公告)日：2006-05-02

申请号：US10456519

申请日：2003-06-09

申请人： Katsumi Tada , Takuya Okamoto , Natsuko Sugaya , Tadataka Matsubayashi , Yasuhiko Inaba , Yasushi Kawashimo

发明人： Katsumi Tada , Takuya Okamoto , Natsuko Sugaya , Tadataka Matsubayashi , Yasuhiko Inaba , Yasushi Kawashimo

IPC分类号： G06F17/30

CPC分类号： G06F17/30011 , Y10S707/99933 , Y10S707/99934

摘要： Word boundary identification operations such as morpheme analysis is performed on documents to be registered, and the top positions and the end positions of words are identified. Word boundary information is obtained based on these identification results. Search indexes are created for sub-strings of a predetermined length (n-grams) extracted from the document being registered. The search index includes document identification information as well as occurrence position information which indicates that the string is located at the n-th position from the beginning of the text data, and word boundary information for an n-gram in a document.

6.

发明授权
Text mining method and apparatus allowing a user to analyze contents of a document set from plural analysis axes 失效
标题翻译：允许用户从多个分析轴分析文档集的内容的文本挖掘方法和装置

公开(公告)号：US06757676B1

公开(公告)日：2004-06-29

申请号：US09649961

申请日：2000-08-29

申请人： Natsuko Sugaya , Katsumi Tada , Tadataka Matsubayashi , Akihiko Yamaguchi , Yasuhiko Inaba , Mikihiko Tokunaga

发明人： Natsuko Sugaya , Katsumi Tada , Tadataka Matsubayashi , Akihiko Yamaguchi , Yasuhiko Inaba , Mikihiko Tokunaga

IPC分类号： G06F1730

CPC分类号： G06F17/30616 , Y10S707/99935 , Y10S707/99945

摘要： A text mining method whereby documents (texts) can be analyzed from a wide variety of visual points. The text mining method includes: distinctive word and/or phrase extraction step of extracting words and/or phrases characteristically emerging in a processing subject document set obtained by taking out whole or a part of a set of documents registered beforehand; definition information setting step of setting definition information including a specified word or phrase or specified bibliography information; coincident word and/or phrase acquisition step of acquiring coincident words and/or phrases coincident in a predetermined range with a word or phrase or bibliography information included in said definition information from among words and/or phrases extracted at said distinctive word and/or phrase extraction step; and multiplex coincident word and/or phrase acquisition step of acquiring coincident words and/or phrases coincident in a predetermined range with an individual word or phrase or bibliography information acquired from each of a plurality of different definition information pieces.

摘要翻译： 一种文本挖掘方法，可以从各种视觉点分析文档（文本）。文本挖掘方法包括：提取通过取出预先登记的一组文档的全部或一部分而获得的处理对象文档集中特征出现的单词和/或短语的特征词和/或短语提取步骤; 定义信息设置步骤，设置包括指定的单词或短语或指定参考书目信息的定义信息; 一致的单词和/或短语获取步骤，用于在预定范围内与在所述特征词和/或短语中提取的单词和/或短语中包含的所述定义信息中包含的单词或短语或参考书目信息获取一致的单词和/或短语提取步骤以及将从预定范围重合的一致字和/或短语与从多个不同定义信息片段中的每一个获取的单个词或短语或参考书目信息进行多路复用的一致词和/或短语获取步骤。

7.

发明授权
Method of and an apparatus for retrieving and delivering documents and a recording media on which a program for retrieving and delivering documents are stored 失效
标题翻译：用于检索和传送文件的方法和装置以及在其上存储用于检索和传送文档的程序的记录介质

公开(公告)号：US06549898B1

公开(公告)日：2003-04-15

申请号：US09518689

申请日：2000-03-03

申请人： Yasuhiko Inaba , Tadataka Matsubayashi , Katsumi Tada , Takuya Okamoto , Natsuko Sugaya , Yousuke Ushiroji

发明人： Yasuhiko Inaba , Tadataka Matsubayashi , Katsumi Tada , Takuya Okamoto , Natsuko Sugaya , Yousuke Ushiroji

IPC分类号： G06F1730

CPC分类号： G06F17/30699 , G06F17/30702 , Y10S707/917 , Y10S707/99935 , Y10S707/99945

摘要： Retrieval conditions inputted from a plurality of users are registered. According to the retrieval conditions, a retrieval is conducted for a text inputted. As a result of the retrieval, similarity of the text is calculated for each retrieval condition. The text is delivered to users of which the retrieval condition satisfies the similarity.

摘要翻译： 登记从多个用户输入的检索条件。根据检索条件，对输入的文本进行检索。作为检索的结果，针对每个检索条件计算文本的相似度。文本被传递给其检索条件满足相似性的用户。

8.

发明授权
Method of and an apparatus for retrieving and delivering documents and a recording media on which a program for retrieving and delivering documents are stored 失效

公开(公告)号：US06665667B2

公开(公告)日：2003-12-16

申请号：US10232721

申请日：2002-09-03

申请人： Yasuhiko Inaba , Tadataka Matsubayashi , Katsumi Tada , Takuya Okamoto , Natsuko Sugaya , Yousuke Ushiroji

发明人： Yasuhiko Inaba , Tadataka Matsubayashi , Katsumi Tada , Takuya Okamoto , Natsuko Sugaya , Yousuke Ushiroji

IPC分类号： G06F1730

CPC分类号： G06F17/30699 , G06F17/30702 , Y10S707/917 , Y10S707/99935 , Y10S707/99945

摘要： Retrieval conditions inputted from a plurality of users are registered. According to the retrieval conditions, a retrieval is conducted for a text inputted. As a result of the retrieval, similarity of the text is calculated for each retrieval condition. The text is delivered to users of which the retrieval condition satisfies the similarity.

9.

发明授权
Method of and an apparatus for retrieving and delivering documents and a recording media on which a program for retrieving and delivering documents are stored 失效
标题翻译：用于检索和传送文件的方法和装置以及在其上存储用于检索和传送文档的程序的记录介质

公开(公告)号：US07333983B2

公开(公告)日：2008-02-19

申请号：US10718699

申请日：2003-11-24

申请人： Yasuhiko Inaba , Tadataka Matsubayashi , Katsumi Tada , Takuya Okamoto , Natsuko Sugaya , Yousuke Ushiroji

发明人： Yasuhiko Inaba , Tadataka Matsubayashi , Katsumi Tada , Takuya Okamoto , Natsuko Sugaya , Yousuke Ushiroji

IPC分类号： G06F17/30

CPC分类号： G06F17/30011 , Y10S707/99935 , Y10S707/99945

摘要： Retrieval conditions inputted from a plurality of users are registered. According to the retrieval conditions, a retrieval is conducted for a text inputted. As a result of the retrieval, similarity of the text is calculated for each retrieval condition. The text is delivered to users of which the retrieval condition satisfies the similarity.

摘要翻译： 登记从多个用户输入的检索条件。根据检索条件，对输入的文本进行检索。作为检索的结果，针对每个检索条件计算文本的相似度。文本被传递给其检索条件满足相似性的用户。

10.

发明授权
Document retrieval method and system and computer readable storage medium 失效
标题翻译：文件检索方法和系统以及计算机可读存储介质

公开(公告)号：US06865571B2

公开(公告)日：2005-03-08

申请号：US09952594

申请日：2001-09-13

申请人： Yasuhiko Inaba , Katsumi Tada , Natsuko Sugaya , Tadataka Matsubayashi , Akihiko Yamaguchi , Mikihiko Tokunaga

发明人： Yasuhiko Inaba , Katsumi Tada , Natsuko Sugaya , Tadataka Matsubayashi , Akihiko Yamaguchi , Mikihiko Tokunaga

IPC分类号： G06F17/30 , G06F7/00

CPC分类号： G06F17/30699 , Y10S707/99935

摘要： A document retrieval method using a computer program includes retrieving a first set of documents using a first query expression generated by the computer program. The first set of documents is provided to a user. An evaluation of the first set of documents is received from the user. The first query expression is changed to a second query expression generated by the computer program based on the evaluation.

摘要翻译： 使用计算机程序的文档检索方法包括使用由计算机程序生成的第一查询表达式来检索第一组文档。第一组文件被提供给用户。从用户接收对第一组文档的评估。基于评估，第一个查询表达式被更改为由计算机程序生成的第二个查询表达式。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类