APPARATUS AND METHOD FOR KNOWLEDGE GRAPH STABILIZATION
    3.
    发明申请
    APPARATUS AND METHOD FOR KNOWLEDGE GRAPH STABILIZATION 有权
    用于知识图表稳定的装置和方法

    公开(公告)号:US20110137919A1

    公开(公告)日:2011-06-09

    申请号:US12877063

    申请日:2010-09-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30958

    摘要: A method for stabilizing a knowledge graph includes: generating a knowledge graph in which same entities in a semantic relation list between entities provided as an input are represented as a single node based on names and types of the entities; computing, on the knowledge graph, semantic similarities between all potential entity pairs of same entity types by comparing, for each potential entity pair, a type of relation associated with an entity in the entity pair and an opponent entity to the entity; and selecting, based on the semantic similarities, a representative entity from each of semantically similar entity pairs on the knowledge graph and integrating an opponent entity to the representative entity into the representative entity. The method further includes computing relation weighted values between the entities by using a graph analysis and statistic information, and adding the weighted values to the knowledge graph.

    摘要翻译: 一种用于稳定知识图的方法包括:生成知识图,其中根据实体的名称和类型将提供作为输入的实体之间的语义关系列表中的相同实体表示为单个节点; 在知识图上计算相同实体类型的所有潜在实体对之间的语义相似性,通过对于每个潜在实体对,将与实体对中的实体相关联的关系的类型与对该实体的对象实体进行比较; 并且基于语义相似性,从知识图上的每个语义上相似的实体对中选择代表性实体,并将对手实体整合到代表性实体中。 该方法还包括通过使用图分析和统计信息来计算实体之间的关系加权值,并将加权值加到知识图中。

    Apparatus and method for knowledge graph stabilization
    4.
    发明授权
    Apparatus and method for knowledge graph stabilization 有权
    用于知识图稳定的装置和方法

    公开(公告)号:US08407253B2

    公开(公告)日:2013-03-26

    申请号:US12877063

    申请日:2010-09-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30958

    摘要: A method for stabilizing a knowledge graph includes: generating a knowledge graph in which same entities in a semantic relation list between entities provided as an input are represented as a single node based on names and types of the entities; computing, on the knowledge graph, semantic similarities between all potential entity pairs of same entity types by comparing, for each potential entity pair, a type of relation associated with an entity in the entity pair and an opponent entity to the entity; and selecting, based on the semantic similarities, a representative entity from each of semantically similar entity pairs on the knowledge graph and integrating an opponent entity to the representative entity into the representative entity. The method further includes computing relation weighted values between the entities by using a graph analysis and statistic information, and adding the weighted values to the knowledge graph.

    摘要翻译: 一种用于稳定知识图的方法包括:生成知识图,其中根据实体的名称和类型将提供作为输入的实体之间的语义关系列表中的相同实体表示为单个节点; 在知识图上计算相同实体类型的所有潜在实体对之间的语义相似性,通过对于每个潜在实体对,将与实体对中的实体相关联的关系的类型与对该实体的对象实体进行比较; 并且基于语义相似性,从知识图上的每个语义上相似的实体对中选择代表性实体,并将对手实体整合到代表性实体中。 该方法还包括通过使用图分析和统计信息来计算实体之间的关系加权值,并将加权值加到知识图中。

    ELECTRONIC DOCUMENT PROCESSING APPARATUS AND METHOD
    6.
    发明申请
    ELECTRONIC DOCUMENT PROCESSING APPARATUS AND METHOD 审中-公开
    电子文件处理装置和方法

    公开(公告)号:US20100145952A1

    公开(公告)日:2010-06-10

    申请号:US12635042

    申请日:2009-12-10

    IPC分类号: G06F17/30

    CPC分类号: G06F16/35 G06F16/31 G06F16/93

    摘要: An electronic document processing apparatus includes: a document set storage unit storing hash tables including hash values of documents to be processed; a content extraction unit for extracting body contents from a newly input electronic document; and a sentence separation unit for separating sentences from the extracted body contents. The apparatus further includes a duplicate document determination unit for converting the separated sentences into unique hash values by a hash algorithm, determining each of the separated checking if there is a duplicate sentence depending on whether or not there is a collision between the converted hash values and the hash values in the hash tables of the document set storage unit, and determining if the electronic document is a duplicate document based on the ratio of duplicate sentences to all of the sentences in the electronic document.

    摘要翻译: 电子文档处理装置包括:文档集存储单元,存储包括要处理的文档的哈希值的散列表; 内容提取单元,用于从新输入的电子文档中提取身体内容; 以及用于从所提取的身体内容中分离句子的句子分离单元。 该装置还包括一个重复文件确定单元,用于通过散列算法将分离的句子转换成唯一的散列值,根据是否存在经转换的哈希值和 所述文档集存储单元的散列表中的散列值,以及基于所述电子文档中的所有句子的重复句子的比例来确定所述电子文档是否是重复的文档。

    PERSONALIZED SEARCH APPARATUS AND METHOD
    7.
    发明申请
    PERSONALIZED SEARCH APPARATUS AND METHOD 审中-公开
    个性化搜索设备和方法

    公开(公告)号:US20100145922A1

    公开(公告)日:2010-06-10

    申请号:US12628171

    申请日:2009-11-30

    IPC分类号: G06F17/30

    CPC分类号: G06F16/9535

    摘要: A personalized search apparatus includes: a model generating unit for generating a user favorites analysis model based on directory grouping information about directories stored in a user terminal and user behavior information; and a user favorites analysis model DB for storing the generated user favorites analysis model. Further, the personalized search apparatus includes a search engine for searching for a file relevant to an input query using an information search engine installed in the user terminal to generate search results; and a personalized search engine for re-ranking the search results generated by the search engine based on the user favorites analysis model to generate personalized search results.

    摘要翻译: 个性化搜索装置包括:模型生成单元,用于基于关于存储在用户终端中的目录的目录分组信息和用户行为信息来生成用户收藏分析模型; 以及用于存储生成的用户收藏分析模型的用户收藏分析模型DB。 此外,个性化搜索装置包括搜索引擎,用于使用安装在用户终端中的信息搜索引擎来搜索与输入查询相关的文件,以生成搜索结果; 以及个性化搜索引擎,用于基于用户收藏分析模型重新排列由搜索引擎生成的搜索结果,以生成个性化搜索结果。

    APPARATUS AND METHOD FOR SELECTING ONLINE ADVERTISEMENT BASED ON CONTENTS SENTIMENT AND INTENTION ANALYSIS
    10.
    发明申请
    APPARATUS AND METHOD FOR SELECTING ONLINE ADVERTISEMENT BASED ON CONTENTS SENTIMENT AND INTENTION ANALYSIS 审中-公开
    基于内容分析和意向分析选择在线广告的设备和方法

    公开(公告)号:US20100153210A1

    公开(公告)日:2010-06-17

    申请号:US12537542

    申请日:2009-08-07

    IPC分类号: G06Q30/00

    摘要: The invention provides an apparatus and method for selecting an online advertisement. An apparatus for selecting an online advertisement based on contents sentiment and intention analysis includes a context analysis unit for analyzing a context of contents, a context matching advertisement recommendation unit for selecting an advertisement matching with the context of the contents from an advertisement database (DB) based on the result of the analyzed context, an sentiment information analysis unit for analyzing an sentiment object and sentiment information variously described in the contents based on the result of the analyzed context, an intention recognition unit for recognizing a writing intention of the contents, and an advertisement selection unit for excluding the selected advertisement for the contents or selecting an alternative advertisement depending on the result of the analyzed context, the result of the analyzed sentiment object and sentiment information and the recognized writing intention.

    摘要翻译: 本发明提供一种用于选择在线广告的装置和方法。 一种用于基于内容情感和意图分析来选择在线广告的装置,包括用于分析内容上下文的上下文分析单元,用于从广告数据库(DB)中选择与内容的上下文匹配的广告的上下文匹配广告推荐单元, 基于所分析的上下文的结果,用于基于分析的上下文的结果分析内容中的情绪对象和情绪信息的情绪信息分析单元,识别内容的写意图的意图识别单元,以及 广告选择单元,用于根据所分析的上下文的结果,所分析的情绪对象的结果和情绪信息以及所识别的书写意图,排除所选内容的广告或选择替代广告。