-
公开(公告)号:US20110131213A1
公开(公告)日:2011-06-02
申请号:US12748681
申请日:2010-03-29
申请人: Yu-Chieh Wu , Pei-Sen Liu , Han-Shiang Chang , Sheng-Ho Chang , Hsin-Jung Huang
发明人: Yu-Chieh Wu , Pei-Sen Liu , Han-Shiang Chang , Sheng-Ho Chang , Hsin-Jung Huang
IPC分类号: G06F17/30
CPC分类号: G06F16/3344
摘要: This invention discloses a method for mining a comment term in a document. The method comprises, first, to build a document database and a keyword database, wherein the document database includes at least one digital document, the keyword database includes at least one keyword. Then, a language of the digital document is determined. The digital document is processed based on the language to form a first document. Next, word groups are gathered from the first document based on a gathering range and apart-of-speech, wherein each word group includes the keyword and a word with the part-of-speech.
摘要翻译: 本发明公开了一种在文档中挖掘评论项的方法。 该方法首先构建文档数据库和关键字数据库,其中文档数据库包括至少一个数字文档,该关键字数据库包括至少一个关键字。 然后,确定数字文档的语言。 基于该语言处理数字文档以形成第一个文档。 接下来,基于收集范围和分词语言从第一文档收集单词组,其中每个单词组包括关键词和具有词性的单词。