发明授权
- 专利标题: Modeling topics using statistical distributions
- 专利标题(中): 使用统计分布建模主题
-
申请号: US12243267申请日: 2008-10-01
-
公开(公告)号: US09317593B2公开(公告)日: 2016-04-19
- 发明人: David L. Marvit , Jawahar Jain , Stergios Stergiou , Alex Gilman , B. Thomas Adler , John J. Sidorowich , Yannis Labrou
- 申请人: David L. Marvit , Jawahar Jain , Stergios Stergiou , Alex Gilman , B. Thomas Adler , John J. Sidorowich , Yannis Labrou
- 申请人地址: JP Kawasaki-shi
- 专利权人: Fujitsu Limited
- 当前专利权人: Fujitsu Limited
- 当前专利权人地址: JP Kawasaki-shi
- 代理机构: Baker Botts L.L.P.
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
In one embodiment, modeling topics includes accessing a corpus comprising documents that include words. Words of a document are selected as keywords of the document. The documents are clustered according to the keywords to yield clusters, where each cluster corresponds to a topic. A statistical distribution is generated for a cluster from words of the documents of the cluster. A topic is modeled using the statistical distribution generated for the cluster corresponding to the topic.
公开/授权文献
- US20090094233A1 Modeling Topics Using Statistical Distributions 公开/授权日:2009-04-09
信息查询