TEXT CLASSIFICATION BY RANKING WITH CONVOLUTIONAL NEURAL NETWORKS

    公开(公告)号:US20170308790A1

    公开(公告)日:2017-10-26

    申请号:US15134719

    申请日:2016-04-21

    IPC分类号: G06N3/08 G06N99/00

    CPC分类号: G06N3/084 G06N3/0454

    摘要: According to an aspect a method includes configuring a convolutional neural network (CNN) for classifying text based on word embedding features into a predefined set of classes identified by class labels. The predefined set of classes includes a class labeled none-of-the-above for text that does not fit into any of the other classes in the predefined set of classes. The CNN is trained based on a set of training data. The training includes learning parameters of class distributed vector representations (DVRs) of each of the predefined set of classes. The learning includes minimizing a pair-wise ranking loss function over the set of training data. A class embedding matrix of the class DVRs of the predefined set of classes that excludes a class embedding for the none-of-the-above class is generated. Each column in the class embedding matrix corresponds to one of the predefined classes.

    GEOSPATIAL EVENT EXTRACTION AND ANALYSIS THROUGH DATA SOURCES
    4.
    发明申请
    GEOSPATIAL EVENT EXTRACTION AND ANALYSIS THROUGH DATA SOURCES 审中-公开
    通过数据来源进行地质事件提取和分析

    公开(公告)号:US20160210310A1

    公开(公告)日:2016-07-21

    申请号:US14598776

    申请日:2015-01-16

    IPC分类号: G06F17/30

    CPC分类号: G06F16/29 G06F16/2477

    摘要: In an approach for extracting geospatial temporal facts and events, a processor receives a set of structured data and a set of unstructured data. A processor extracts a first set of temporal information and a first set of geospatial information from the set of unstructured data. A processor identifies a second set of temporal information and a second set of geospatial information from the set of structured data. A processor determines that the set of structured data and the set of unstructured data are related, based on at least the first set of temporal information, the second set of temporal information, the first set of geospatial information, and the second set of geospatial information. A processor groups the set of structured data and the set of unstructured data into a collective set of data. A processor stores the collective set of data.

    摘要翻译: 在提取地理空间时间事件和事件的方法中,处理器接收一组结构化数据和一组非结构化数据。 处理器从非结构化数据集中提取第一组时间信息和第一组地理空间信息。 处理器从所述一组结构化数据识别第二组时间信息和第二组地理空间信息。 处理器基于至少第一组时间信息,第二组时间信息,第一组地理空间信息和第二组地理空间信息来确定结构化数据集和非结构化数据集是相关的 。 A处理器将该组结构化数据和一组非结构化数据分组为一组集合的数据。 处理器存储集合的数据。