DATA PROCESSING METHOD, DATA PROCESSING SYSTEM, AND PROGRAM
    1.
    发明公开
    DATA PROCESSING METHOD, DATA PROCESSING SYSTEM, AND PROGRAM 审中-公开
    DATENVERARBEITUNGSVERFAHREN,DATENVERARBEITUNGSSYSTEM UND PROGRAMM

    公开(公告)号:EP1429258A1

    公开(公告)日:2004-06-16

    申请号:EP02746128.4

    申请日:2002-07-19

    IPC分类号: G06F17/28

    摘要: [Object] Provided is a support system or a method for efficiently enabling generation of candidate synonyms, when a thesaurus usable in text mining is created.
    [Constitution] A candidate synonym acquisition device 130 acquires a set of candidate synonyms similar to an input word for each writer from data 110 for each writer, and acquires a set of candidate synonyms similar to the input word from a collective data 120. A generated candidate synonym set 140 is inputted to a candidate synonym determination device 150 to evaluate the candidate synonyms of the collective data 120. In the evaluation, the status of "absolute" is given to a word matching a word ranked first in the candidate synonyms for each writer and the status of "negative" is given to words matching words ranked second and lower therein.

    摘要翻译: ÄObjectÜ提供的是一种支持系统或一种方法,用于在创建可用于文本挖掘的同义词库时有效地启用候选同义词生成。 候选同义词获取装置130从每个写入器的数据110获取类似于每个写入器的输入字的一组候选同义词,并且从集合数据120获取类似于输入字的一组候选同义词。 生成的候选同义词集140被输入到候选同义词确定装置150,以评估集体数据120的候选同义词。在评估中,给出与候选同义词首先排列的词匹配的单词的“绝对”状态 对于每个作者,“负”的状态被给予匹配在其中第二和第二的词的匹配词。