Computer-Implemented Systems and Methods for Taxonomy Development
    1.
    发明申请
    Computer-Implemented Systems and Methods for Taxonomy Development 有权
    计算机实施的分类学发展系统和方法

    公开(公告)号:US20130159348A1

    公开(公告)日:2013-06-20

    申请号:US13327949

    申请日:2011-12-16

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30731 G06F17/30705

    摘要: Systems and methods are provided for generating a set of classifiers. A location is determined for each instance of a topic term in a collection of documents. One or more topic term phrases are identified, and one or more sentiment terms within each topic term phrase. Candidate classifiers are identified by parsing words in the one or more topic term phrases, and a colocation matrix is generated. A seed row of the colocation associated with a particular attribute is identified, and distance metrics are determined by comparing each row of the colocation matrix to the seed row. A set of classifiers are generated for the particular attribute, where classifiers in the set of classifiers are selected using the distance metrics.

    摘要翻译: 提供了用于生成一组分类器的系统和方法。 确定文档集合中每个主题项的实例的位置。 识别一个或多个主题术语短语,以及每个主题术语短语内的一个或多个情绪术语。 候选分类器通过解析一个或多个主题术语短语中的单词来识别,并且生成了一个位置矩阵。 识别与特定属性相关联的托盘的种子行,并且通过将托管矩阵的每一行与种子行进行比较来确定距离度量。 为特定属性生成一组分类器,其中使用距离度量来选择分类器集合中的分类器。

    Computer-implemented systems and methods for taxonomy development
    2.
    发明授权
    Computer-implemented systems and methods for taxonomy development 有权
    用于分类学开发的计算机实现的系统和方法

    公开(公告)号:US09116985B2

    公开(公告)日:2015-08-25

    申请号:US13327949

    申请日:2011-12-16

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30731 G06F17/30705

    摘要: Systems and methods are provided for generating a set of classifiers. A location is determined for each instance of a topic term in a collection of documents. One or more topic term phrases are identified, and one or more sentiment terms within each topic term phrase. Candidate classifiers are identified by parsing words in the one or more topic term phrases, and a colocation matrix is generated. A seed row of the colocation associated with a particular attribute is identified, and distance metrics are determined by comparing each row of the colocation matrix to the seed row. A set of classifiers are generated for the particular attribute, where classifiers in the set of classifiers are selected using the distance metrics.

    摘要翻译: 提供了用于生成一组分类器的系统和方法。 确定文档集合中每个主题项的实例的位置。 识别一个或多个主题术语短语,以及每个主题术语短语内的一个或多个情绪术语。 候选分类器通过解析一个或多个主题术语短语中的单词来识别,并且生成了一个位置矩阵。 识别与特定属性相关联的托盘的种子行,并且通过将托管矩阵的每一行与种子行进行比较来确定距离度量。 为特定属性生成一组分类器,其中使用距离度量来选择分类器集合中的分类器。