发明授权
US09116985B2 Computer-implemented systems and methods for taxonomy development
有权
用于分类学开发的计算机实现的系统和方法
- 专利标题: Computer-implemented systems and methods for taxonomy development
- 专利标题(中): 用于分类学开发的计算机实现的系统和方法
-
申请号: US13327949申请日: 2011-12-16
-
公开(公告)号: US09116985B2公开(公告)日: 2015-08-25
- 发明人: Bruce Monroe Mills , John Courtney Haws , John Clare Brocklebank , Thomas Robert Lehman
- 申请人: Bruce Monroe Mills , John Courtney Haws , John Clare Brocklebank , Thomas Robert Lehman
- 申请人地址: US NC Cary
- 专利权人: SAS Institute Inc.
- 当前专利权人: SAS Institute Inc.
- 当前专利权人地址: US NC Cary
- 代理机构: Kilpatrick Townsend & Stockton LLP
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
Systems and methods are provided for generating a set of classifiers. A location is determined for each instance of a topic term in a collection of documents. One or more topic term phrases are identified, and one or more sentiment terms within each topic term phrase. Candidate classifiers are identified by parsing words in the one or more topic term phrases, and a colocation matrix is generated. A seed row of the colocation associated with a particular attribute is identified, and distance metrics are determined by comparing each row of the colocation matrix to the seed row. A set of classifiers are generated for the particular attribute, where classifiers in the set of classifiers are selected using the distance metrics.