-
公开(公告)号:US20100211609A1
公开(公告)日:2010-08-19
申请号:US12371806
申请日:2009-02-16
申请人: Wuzhen Xiong , Bing Tang , Jing Liu , Han Yang , Xiaolu Dai
发明人: Wuzhen Xiong , Bing Tang , Jing Liu , Han Yang , Xiaolu Dai
IPC分类号: G06F17/30
CPC分类号: G06F17/30684
摘要: A system to process unstructured data is provided. An example system to process unstructured data comprises a receiver to access a source of unstructured data, an entity extractor to extract entity instances from the source of unstructured data and organize the extracted entity instances into an entity instance table, a pattern generator to generate a pattern comprising a key entity and one or more non-key entities associated with the key entity based on the entity instance table, and a dataset generator to generate a two-dimensional table based on the pattern and the entity instance table.
摘要翻译: 提供了一种处理非结构化数据的系统。 用于处理非结构化数据的示例系统包括:接收器,用于访问非结构化数据源;实体提取器,用于从非结构化数据源提取实体实例,并将提取的实体实例组织到实体实例表中;模式生成器,用于生成模式 包括基于所述实体实例表与所述密钥实体相关联的密钥实体和一个或多个非密钥实体,以及基于所述模式和所述实体实例表生成二维表的数据集生成器。
-
公开(公告)号:US08719308B2
公开(公告)日:2014-05-06
申请号:US12371806
申请日:2009-02-16
申请人: Wuzhen Xiong , Bing Tang , Jing Liu , Han Yang , Xiaolu Dai
发明人: Wuzhen Xiong , Bing Tang , Jing Liu , Han Yang , Xiaolu Dai
CPC分类号: G06F17/30684
摘要: A system to process unstructured data is provided. An example system to process unstructured data comprises a receiver to access a source of unstructured data, an entity extractor to extract entity instances from the source of unstructured data and organize the extracted entity instances into an entity instance table, a pattern generator to generate a pattern comprising a key entity and one or more non-key entities associated with the key entity based on the entity instance table, and a dataset generator to generate a two-dimensional table based on the pattern and the entity instance table.
摘要翻译: 提供了一种处理非结构化数据的系统。 用于处理非结构化数据的示例系统包括:接收器,用于访问非结构化数据源;实体提取器,用于从非结构化数据源提取实体实例,并将提取的实体实例组织到实体实例表中;模式生成器,用于生成模式 包括基于所述实体实例表与所述密钥实体相关联的密钥实体和一个或多个非密钥实体,以及基于所述模式和所述实体实例表生成二维表的数据集生成器。
-