APPARATUS AND METHOD FOR KNOWLEDGE GRAPH STABILIZATION
    1.
    发明申请
    APPARATUS AND METHOD FOR KNOWLEDGE GRAPH STABILIZATION 有权
    用于知识图表稳定的装置和方法

    公开(公告)号:US20110137919A1

    公开(公告)日:2011-06-09

    申请号:US12877063

    申请日:2010-09-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30958

    摘要: A method for stabilizing a knowledge graph includes: generating a knowledge graph in which same entities in a semantic relation list between entities provided as an input are represented as a single node based on names and types of the entities; computing, on the knowledge graph, semantic similarities between all potential entity pairs of same entity types by comparing, for each potential entity pair, a type of relation associated with an entity in the entity pair and an opponent entity to the entity; and selecting, based on the semantic similarities, a representative entity from each of semantically similar entity pairs on the knowledge graph and integrating an opponent entity to the representative entity into the representative entity. The method further includes computing relation weighted values between the entities by using a graph analysis and statistic information, and adding the weighted values to the knowledge graph.

    摘要翻译: 一种用于稳定知识图的方法包括:生成知识图,其中根据实体的名称和类型将提供作为输入的实体之间的语义关系列表中的相同实体表示为单个节点; 在知识图上计算相同实体类型的所有潜在实体对之间的语义相似性,通过对于每个潜在实体对,将与实体对中的实体相关联的关系的类型与对该实体的对象实体进行比较; 并且基于语义相似性,从知识图上的每个语义上相似的实体对中选择代表性实体,并将对手实体整合到代表性实体中。 该方法还包括通过使用图分析和统计信息来计算实体之间的关系加权值,并将加权值加到知识图中。

    Apparatus and method for knowledge graph stabilization
    2.
    发明授权
    Apparatus and method for knowledge graph stabilization 有权
    用于知识图稳定的装置和方法

    公开(公告)号:US08407253B2

    公开(公告)日:2013-03-26

    申请号:US12877063

    申请日:2010-09-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30958

    摘要: A method for stabilizing a knowledge graph includes: generating a knowledge graph in which same entities in a semantic relation list between entities provided as an input are represented as a single node based on names and types of the entities; computing, on the knowledge graph, semantic similarities between all potential entity pairs of same entity types by comparing, for each potential entity pair, a type of relation associated with an entity in the entity pair and an opponent entity to the entity; and selecting, based on the semantic similarities, a representative entity from each of semantically similar entity pairs on the knowledge graph and integrating an opponent entity to the representative entity into the representative entity. The method further includes computing relation weighted values between the entities by using a graph analysis and statistic information, and adding the weighted values to the knowledge graph.

    摘要翻译: 一种用于稳定知识图的方法包括:生成知识图,其中根据实体的名称和类型将提供作为输入的实体之间的语义关系列表中的相同实体表示为单个节点; 在知识图上计算相同实体类型的所有潜在实体对之间的语义相似性,通过对于每个潜在实体对,将与实体对中的实体相关联的关系的类型与对该实体的对象实体进行比较; 并且基于语义相似性,从知识图上的每个语义上相似的实体对中选择代表性实体,并将对手实体整合到代表性实体中。 该方法还包括通过使用图分析和统计信息来计算实体之间的关系加权值,并将加权值加到知识图中。

    ELECTRONIC DOCUMENT PROCESSING APPARATUS AND METHOD
    3.
    发明申请
    ELECTRONIC DOCUMENT PROCESSING APPARATUS AND METHOD 审中-公开
    电子文件处理装置和方法

    公开(公告)号:US20100145952A1

    公开(公告)日:2010-06-10

    申请号:US12635042

    申请日:2009-12-10

    IPC分类号: G06F17/30

    CPC分类号: G06F16/35 G06F16/31 G06F16/93

    摘要: An electronic document processing apparatus includes: a document set storage unit storing hash tables including hash values of documents to be processed; a content extraction unit for extracting body contents from a newly input electronic document; and a sentence separation unit for separating sentences from the extracted body contents. The apparatus further includes a duplicate document determination unit for converting the separated sentences into unique hash values by a hash algorithm, determining each of the separated checking if there is a duplicate sentence depending on whether or not there is a collision between the converted hash values and the hash values in the hash tables of the document set storage unit, and determining if the electronic document is a duplicate document based on the ratio of duplicate sentences to all of the sentences in the electronic document.

    摘要翻译: 电子文档处理装置包括:文档集存储单元,存储包括要处理的文档的哈希值的散列表; 内容提取单元,用于从新输入的电子文档中提取身体内容; 以及用于从所提取的身体内容中分离句子的句子分离单元。 该装置还包括一个重复文件确定单元,用于通过散列算法将分离的句子转换成唯一的散列值,根据是否存在经转换的哈希值和 所述文档集存储单元的散列表中的散列值,以及基于所述电子文档中的所有句子的重复句子的比例来确定所述电子文档是否是重复的文档。

    PERSONALIZED SEARCH APPARATUS AND METHOD
    4.
    发明申请
    PERSONALIZED SEARCH APPARATUS AND METHOD 审中-公开
    个性化搜索设备和方法

    公开(公告)号:US20100145922A1

    公开(公告)日:2010-06-10

    申请号:US12628171

    申请日:2009-11-30

    IPC分类号: G06F17/30

    CPC分类号: G06F16/9535

    摘要: A personalized search apparatus includes: a model generating unit for generating a user favorites analysis model based on directory grouping information about directories stored in a user terminal and user behavior information; and a user favorites analysis model DB for storing the generated user favorites analysis model. Further, the personalized search apparatus includes a search engine for searching for a file relevant to an input query using an information search engine installed in the user terminal to generate search results; and a personalized search engine for re-ranking the search results generated by the search engine based on the user favorites analysis model to generate personalized search results.

    摘要翻译: 个性化搜索装置包括:模型生成单元,用于基于关于存储在用户终端中的目录的目录分组信息和用户行为信息来生成用户收藏分析模型; 以及用于存储生成的用户收藏分析模型的用户收藏分析模型DB。 此外,个性化搜索装置包括搜索引擎,用于使用安装在用户终端中的信息搜索引擎来搜索与输入查询相关的文件,以生成搜索结果; 以及个性化搜索引擎,用于基于用户收藏分析模型重新排列由搜索引擎生成的搜索结果,以生成个性化搜索结果。

    APPARATUS AND METHOD FOR SELECTING ONLINE ADVERTISEMENT BASED ON CONTENTS SENTIMENT AND INTENTION ANALYSIS
    8.
    发明申请
    APPARATUS AND METHOD FOR SELECTING ONLINE ADVERTISEMENT BASED ON CONTENTS SENTIMENT AND INTENTION ANALYSIS 审中-公开
    基于内容分析和意向分析选择在线广告的设备和方法

    公开(公告)号:US20100153210A1

    公开(公告)日:2010-06-17

    申请号:US12537542

    申请日:2009-08-07

    IPC分类号: G06Q30/00

    摘要: The invention provides an apparatus and method for selecting an online advertisement. An apparatus for selecting an online advertisement based on contents sentiment and intention analysis includes a context analysis unit for analyzing a context of contents, a context matching advertisement recommendation unit for selecting an advertisement matching with the context of the contents from an advertisement database (DB) based on the result of the analyzed context, an sentiment information analysis unit for analyzing an sentiment object and sentiment information variously described in the contents based on the result of the analyzed context, an intention recognition unit for recognizing a writing intention of the contents, and an advertisement selection unit for excluding the selected advertisement for the contents or selecting an alternative advertisement depending on the result of the analyzed context, the result of the analyzed sentiment object and sentiment information and the recognized writing intention.

    摘要翻译: 本发明提供一种用于选择在线广告的装置和方法。 一种用于基于内容情感和意图分析来选择在线广告的装置,包括用于分析内容上下文的上下文分析单元,用于从广告数据库(DB)中选择与内容的上下文匹配的广告的上下文匹配广告推荐单元, 基于所分析的上下文的结果,用于基于分析的上下文的结果分析内容中的情绪对象和情绪信息的情绪信息分析单元,识别内容的写意图的意图识别单元,以及 广告选择单元,用于根据所分析的上下文的结果,所分析的情绪对象的结果和情绪信息以及所识别的书写意图,排除所选内容的广告或选择替代广告。

    TOPIC MAP BASED INDEXING AND SEARCHING APPARATUS
    9.
    发明申请
    TOPIC MAP BASED INDEXING AND SEARCHING APPARATUS 有权
    基于主题地图的索引和搜索设备

    公开(公告)号:US20100153094A1

    公开(公告)日:2010-06-17

    申请号:US12484651

    申请日:2009-06-15

    IPC分类号: G06F17/27 G06F17/30

    CPC分类号: G06F17/30654

    摘要: A topic map based indexing apparatus analyzes community Q/A lists to acquire Q/A analysis information, removes redundant answers depending on the Q/A analysis information, removes insignificant answers based on the degree of reliability, ranks answer lists, and extracts the highest ranking answer as a best answer, to thereby store, in a community Q/A topic map, index information containing the community Q/A lists and the Q/A analysis information. A topic map based searching apparatus analyzes a user question to acquire question analysis information, searches similar questions from community Q/A lists belonging to a specific topic node of a pre-stored community Q/A topic map, ranks the searched similar questions depending on the question analysis information, removes redundant answers among answers to the ranked similar questions, ranks the answers, and extracts the highest ranking answer as a best answer.

    摘要翻译: 基于主题地图的索引设备分析社区Q / A列表以获取Q / A分析信息,根据Q / A分析信息删除冗余答案,根据可靠性程度删除不重要的答案,排列答案列表,并提取最高 排名答案作为最佳答案,从而在社区Q / A主题地图中存储包含社区Q / A列表和Q / A分析信息的索引信息。 基于主题图的搜索装置分析用户问题以获取问题分析信息,从属于预存的社区Q / A主题图的特定主题节点的社区Q / A列表中搜索类似的问题,根据 问题分析信息,删除排名相似的问题的答案之间的冗余答案,排列答案,并提取最高排名答案作为最佳答案。

    METHOD AND APPARATUS FOR SOCIAL TAGGING USING PROPERTY FIELD OF ONTOLOGY OBJECT
    10.
    发明申请
    METHOD AND APPARATUS FOR SOCIAL TAGGING USING PROPERTY FIELD OF ONTOLOGY OBJECT 审中-公开
    使用属性的社会标签的方法和装置本体对象的领域

    公开(公告)号:US20100023549A1

    公开(公告)日:2010-01-28

    申请号:US12417232

    申请日:2009-04-02

    IPC分类号: G06F17/30

    CPC分类号: G06F16/48

    摘要: A method for social tagging using a property field of an ontology object includes: selecting an object in an ontology database storing therein objects in forms of classes; selecting a property field in a class corresponding to the selected object; and adding a social tag by storing user's input as a value of the selected property field. Classes stored in the ontology database may have property fields defined when instances of the classes are created, and specific values may be stored as values of the property fields also when the instances are created. The property fields defined when the instances are created are classified into data type property fields and object type property field, and the selected property field is a data type property field.

    摘要翻译: 使用本体对象的属性字段进行社会标记的方法包括:以存储在对象中的对象的类的形式选择本体数据库中的对象; 选择与所选对象相对应的类中的属性字段; 并通过将用户的输入存储为所选属性字段的值来添加社交标签。 存储在本体数据库中的类可以在创建类的实例时定义属性字段,并且在实例创建时也可以将特定值存储为属性字段的值。 创建实例时定义的属性字段分为数据类型属性字段和对象类型属性字段,所选属性字段是数据类型属性字段。