发明申请
- 专利标题: EXTRACTION AND GROUPING OF FEATURE WORDS
- 专利标题(中): 特征词提取与分组
-
申请号: US13049922申请日: 2011-03-17
-
公开(公告)号: US20120239668A1公开(公告)日: 2012-09-20
- 发明人: CHIRANJIB BHATTACHARYYA , Himabindu Lakkaraju , Kaushik Nath , Sunil Arvindam
- 申请人: CHIRANJIB BHATTACHARYYA , Himabindu Lakkaraju , Kaushik Nath , Sunil Arvindam
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
Various embodiments of systems and methods for extraction and grouping of feature words are described herein. Feature words are obtained from a first corpus of text bodies comprising a plurality of reviews. A second corpus is created using a combination of the obtained feature words, verbs and adjectives from the first corpus. The second corpus comprises filtered reviews and each of the filtered reviews pertains to a review. Topics are preliminarily assigned for words in the filtered reviews of the second corpus. For each of the feature words in the second corpus, a topic count is determined for every preliminarily assigned topic. After determining the topic count, one or more of the topics are finally assigned to the feature words based on a topic count value. At least one topic is presented as a group of the feature words for which the at least one topic is assigned based on the topic count value.
公开/授权文献
- US08484228B2 Extraction and grouping of feature words 公开/授权日:2013-07-09
信息查询