APPARATUS AND METHOD FOR KNOWLEDGE GRAPH STABILIZATION
    1.
    发明申请
    APPARATUS AND METHOD FOR KNOWLEDGE GRAPH STABILIZATION 有权
    用于知识图表稳定的装置和方法

    公开(公告)号:US20110137919A1

    公开(公告)日:2011-06-09

    申请号:US12877063

    申请日:2010-09-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30958

    摘要: A method for stabilizing a knowledge graph includes: generating a knowledge graph in which same entities in a semantic relation list between entities provided as an input are represented as a single node based on names and types of the entities; computing, on the knowledge graph, semantic similarities between all potential entity pairs of same entity types by comparing, for each potential entity pair, a type of relation associated with an entity in the entity pair and an opponent entity to the entity; and selecting, based on the semantic similarities, a representative entity from each of semantically similar entity pairs on the knowledge graph and integrating an opponent entity to the representative entity into the representative entity. The method further includes computing relation weighted values between the entities by using a graph analysis and statistic information, and adding the weighted values to the knowledge graph.

    摘要翻译: 一种用于稳定知识图的方法包括:生成知识图,其中根据实体的名称和类型将提供作为输入的实体之间的语义关系列表中的相同实体表示为单个节点; 在知识图上计算相同实体类型的所有潜在实体对之间的语义相似性,通过对于每个潜在实体对,将与实体对中的实体相关联的关系的类型与对该实体的对象实体进行比较; 并且基于语义相似性,从知识图上的每个语义上相似的实体对中选择代表性实体,并将对手实体整合到代表性实体中。 该方法还包括通过使用图分析和统计信息来计算实体之间的关系加权值,并将加权值加到知识图中。

    Apparatus and method for knowledge graph stabilization
    2.
    发明授权
    Apparatus and method for knowledge graph stabilization 有权
    用于知识图稳定的装置和方法

    公开(公告)号:US08407253B2

    公开(公告)日:2013-03-26

    申请号:US12877063

    申请日:2010-09-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30958

    摘要: A method for stabilizing a knowledge graph includes: generating a knowledge graph in which same entities in a semantic relation list between entities provided as an input are represented as a single node based on names and types of the entities; computing, on the knowledge graph, semantic similarities between all potential entity pairs of same entity types by comparing, for each potential entity pair, a type of relation associated with an entity in the entity pair and an opponent entity to the entity; and selecting, based on the semantic similarities, a representative entity from each of semantically similar entity pairs on the knowledge graph and integrating an opponent entity to the representative entity into the representative entity. The method further includes computing relation weighted values between the entities by using a graph analysis and statistic information, and adding the weighted values to the knowledge graph.

    摘要翻译: 一种用于稳定知识图的方法包括:生成知识图,其中根据实体的名称和类型将提供作为输入的实体之间的语义关系列表中的相同实体表示为单个节点; 在知识图上计算相同实体类型的所有潜在实体对之间的语义相似性,通过对于每个潜在实体对,将与实体对中的实体相关联的关系的类型与对该实体的对象实体进行比较; 并且基于语义相似性,从知识图上的每个语义上相似的实体对中选择代表性实体,并将对手实体整合到代表性实体中。 该方法还包括通过使用图分析和统计信息来计算实体之间的关系加权值,并将加权值加到知识图中。

    ELECTRONIC DOCUMENT PROCESSING APPARATUS AND METHOD
    3.
    发明申请
    ELECTRONIC DOCUMENT PROCESSING APPARATUS AND METHOD 审中-公开
    电子文件处理装置和方法

    公开(公告)号:US20100145952A1

    公开(公告)日:2010-06-10

    申请号:US12635042

    申请日:2009-12-10

    IPC分类号: G06F17/30

    CPC分类号: G06F16/35 G06F16/31 G06F16/93

    摘要: An electronic document processing apparatus includes: a document set storage unit storing hash tables including hash values of documents to be processed; a content extraction unit for extracting body contents from a newly input electronic document; and a sentence separation unit for separating sentences from the extracted body contents. The apparatus further includes a duplicate document determination unit for converting the separated sentences into unique hash values by a hash algorithm, determining each of the separated checking if there is a duplicate sentence depending on whether or not there is a collision between the converted hash values and the hash values in the hash tables of the document set storage unit, and determining if the electronic document is a duplicate document based on the ratio of duplicate sentences to all of the sentences in the electronic document.

    摘要翻译: 电子文档处理装置包括:文档集存储单元,存储包括要处理的文档的哈希值的散列表; 内容提取单元,用于从新输入的电子文档中提取身体内容; 以及用于从所提取的身体内容中分离句子的句子分离单元。 该装置还包括一个重复文件确定单元,用于通过散列算法将分离的句子转换成唯一的散列值,根据是否存在经转换的哈希值和 所述文档集存储单元的散列表中的散列值,以及基于所述电子文档中的所有句子的重复句子的比例来确定所述电子文档是否是重复的文档。

    PERSONALIZED SEARCH APPARATUS AND METHOD
    4.
    发明申请
    PERSONALIZED SEARCH APPARATUS AND METHOD 审中-公开
    个性化搜索设备和方法

    公开(公告)号:US20100145922A1

    公开(公告)日:2010-06-10

    申请号:US12628171

    申请日:2009-11-30

    IPC分类号: G06F17/30

    CPC分类号: G06F16/9535

    摘要: A personalized search apparatus includes: a model generating unit for generating a user favorites analysis model based on directory grouping information about directories stored in a user terminal and user behavior information; and a user favorites analysis model DB for storing the generated user favorites analysis model. Further, the personalized search apparatus includes a search engine for searching for a file relevant to an input query using an information search engine installed in the user terminal to generate search results; and a personalized search engine for re-ranking the search results generated by the search engine based on the user favorites analysis model to generate personalized search results.

    摘要翻译: 个性化搜索装置包括:模型生成单元,用于基于关于存储在用户终端中的目录的目录分组信息和用户行为信息来生成用户收藏分析模型; 以及用于存储生成的用户收藏分析模型的用户收藏分析模型DB。 此外,个性化搜索装置包括搜索引擎,用于使用安装在用户终端中的信息搜索引擎来搜索与输入查询相关的文件,以生成搜索结果; 以及个性化搜索引擎,用于基于用户收藏分析模型重新排列由搜索引擎生成的搜索结果,以生成个性化搜索结果。