Systems and methods for capturing and managing collective social intelligence information
    3.
    发明申请
    Systems and methods for capturing and managing collective social intelligence information 审中-公开
    收集和管理集体社会情报信息的系统和方法

    公开(公告)号:US20110099133A1

    公开(公告)日:2011-04-28

    申请号:US12801779

    申请日:2010-06-24

    IPC分类号: G06F15/18 G06N5/02

    CPC分类号: G06N20/00 G06F16/353

    摘要: A method for capturing and managing training data collected online includes: receiving a first dataset from one or more online sources; sampling the first dataset and generating a second dataset, the second dataset including the data sampled from the first dataset; receiving an annotated second dataset with predefined labels; and dividing the annotated second dataset into a training dataset and a test dataset. The disclosed method further includes: configuring a machine learning based classifier based on the training dataset; predicting at least one data point based on the training dataset and calculating a confidence score; comparing the at least one predicted data point to the test dataset; sorting the at least one predicted data point based on its confidence score; and receiving corrected training data associated with the at least one predicted data point.

    摘要翻译: 用于捕获和管理在线收集的训练数据的方法包括:从一个或多个在线来源接收第一数据集; 对第一数据集进行采样并生成第二数据集,第二数据集包括从第一数据集采样的数据; 接收带有预定义标签的带注释的第二个数据集; 并将注释的第二个数据集划分为训练数据集和测试数据集。 所公开的方法还包括:基于训练数据集配置基于机器学习的分类器; 基于训练数据集预测至少一个数据点并计算置信度分数; 将所述至少一个预测数据点与所述测试数据集进行比较; 基于其置信度得分对所述至少一个预测数据点进行排序; 以及接收与所述至少一个预测数据点相关联的校正训练数据。

    Systems and methods for organizing collective social intelligence information using an organic object data model
    4.
    发明申请
    Systems and methods for organizing collective social intelligence information using an organic object data model 审中-公开
    使用有机对象数据模型组织集体社会情报信息的系统和方法

    公开(公告)号:US20110112995A1

    公开(公告)日:2011-05-12

    申请号:US12801777

    申请日:2010-06-24

    IPC分类号: G06N5/02 G06F15/18

    CPC分类号: G06N20/00 G06F16/353

    摘要: A method for capturing and organizing intelligence data using an organic data model includes: receiving one or more webpages containing social intelligence data; segmenting content of the one or more webpages containing social intelligence data; identifying named entities in the segmented content of the one or more webpages; identifying topics in the segmented content of the one or more webpages; identifying opinions in the segmented content of the one or more webpages; integrating the identified named entities, topics, and opinions to construct an organic object data model; and storing organic object data associated with the constructed organic object data model in an organic object database.

    摘要翻译: 使用有机数据模型捕获和组织智能数据的方法包括:接收一个或多个包含社会智能数据的网页; 分割包含社交情报数据的一个或多个网页的内容; 识别一个或多个网页的分段内容中的命名实体; 识别一个或多个网页的分段内容中的主题; 识别一个或多个网页的分段内容中的意见; 整合识别的命名实体,主题和意见,构建有机对象数据模型; 以及将与所构建的有机对象数据模型相关联的有机对象数据存储在有机对象数据库中。