发明申请
- 专利标题: METHOD FOR KEYWORD EXTRACTION
- 专利标题(中): 关键词提取方法
-
申请号: US13641054申请日: 2010-04-14
-
公开(公告)号: US20130036076A1公开(公告)日: 2013-02-07
- 发明人: Sheng-Wen Yang , Yuhong Xiong , Wei Liu
- 申请人: Sheng-Wen Yang , Yuhong Xiong , Wei Liu
- 申请人地址: US TX Houston
- 专利权人: HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P.
- 当前专利权人: HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P.
- 当前专利权人地址: US TX Houston
- 国际申请: PCT/CN2010/071758 WO 20100414
- 主分类号: G06F17/30
- IPC分类号: G06F17/30 ; G06F15/18
摘要:
Presented is a method of extracting keywords. The method includes obtaining a corpus of documents, determining a first set of words that appear as keywords in a document present in the corpus of documents, determining a second set of words that appear in the corpus of documents but not necessarily appear as keywords in the document, and determining a final set of keywords for the document by combining the first set of words with the second set of words.
信息查询