Apparatus and method for knowledge graph stabilization
    1.
    发明授权
    Apparatus and method for knowledge graph stabilization 有权
    用于知识图稳定的装置和方法

    公开(公告)号:US08407253B2

    公开(公告)日:2013-03-26

    申请号:US12877063

    申请日:2010-09-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30958

    摘要: A method for stabilizing a knowledge graph includes: generating a knowledge graph in which same entities in a semantic relation list between entities provided as an input are represented as a single node based on names and types of the entities; computing, on the knowledge graph, semantic similarities between all potential entity pairs of same entity types by comparing, for each potential entity pair, a type of relation associated with an entity in the entity pair and an opponent entity to the entity; and selecting, based on the semantic similarities, a representative entity from each of semantically similar entity pairs on the knowledge graph and integrating an opponent entity to the representative entity into the representative entity. The method further includes computing relation weighted values between the entities by using a graph analysis and statistic information, and adding the weighted values to the knowledge graph.

    摘要翻译: 一种用于稳定知识图的方法包括:生成知识图,其中根据实体的名称和类型将提供作为输入的实体之间的语义关系列表中的相同实体表示为单个节点; 在知识图上计算相同实体类型的所有潜在实体对之间的语义相似性,通过对于每个潜在实体对,将与实体对中的实体相关联的关系的类型与对该实体的对象实体进行比较; 并且基于语义相似性,从知识图上的每个语义上相似的实体对中选择代表性实体,并将对手实体整合到代表性实体中。 该方法还包括通过使用图分析和统计信息来计算实体之间的关系加权值,并将加权值加到知识图中。

    Apparatus for question answering based on answer trustworthiness and method thereof
    2.
    发明授权
    Apparatus for question answering based on answer trustworthiness and method thereof 有权
    基于答案可信度的问答设备及其方法

    公开(公告)号:US08380713B2

    公开(公告)日:2013-02-19

    申请号:US12814220

    申请日:2010-06-11

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30654

    摘要: Provides is an apparatus for question answering based on answer trustworthiness including: an answer indexer that indexes documents of which document trustworthiness satisfying a threshold value among documents included in a document collection and stores it in a knowledge Bases; an answer candidate extractor that extracts answer candidate documents for a user's question from the knowledge Bases; an answer source trustworthiness measurement unit; an answer extraction strategy trustworthiness measurement unit; and a trustworthiness integrator that generates an answer candidate trustworthiness list by ranking the answer candidate documents on the basis of the document trustworthiness, the source trustworthiness, and the extraction strategy trustworthiness of the answer candidate documents.

    摘要翻译: 提供是一种基于应答可信赖性的问答设备,包括:答案索引器,其对包含在文档集合中的文档中的哪个文档信任度满足阈值的文档进行索引,并将其存储在知识库中; 答案候选者提取器,从知识基础中提取用户问题的答案候选文件; 答案源可信度测量单位; 答案提取策略可信度测量单位; 以及可信度积分器,其通过基于答复候选文档的文档可信度,源可信度和提取策略的可信度来对答案候选文档进行排名来生成答复候选可信度列表。

    APPARATUS AND METHOD FOR A QUERY EXPRESS
    3.
    发明申请
    APPARATUS AND METHOD FOR A QUERY EXPRESS 有权
    查询和显示方法

    公开(公告)号:US20110179056A1

    公开(公告)日:2011-07-21

    申请号:US12671909

    申请日:2007-12-27

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30023 G06F17/30926

    摘要: Disclosed is an apparatus and method for expressing a query for searching multimedia data. The apparatus and method of the present invention expresses diverse query types in MPEG-7 query formats and uses field types to re use a designated region. The apparatus for expressing a query inputted from a user for multimedia data search includes: an input means for receiving a query for multimedia data search from a user; and a query expression means for expressing the input query in a field type, wherein the field type includes at least one among identifier information indicating identification (ID) information of a field presenting a search condition included in the input query; type information indicating data type information of the field; and reference information indicating identifier information of another field for reference. The present invention is applied to MPEG-7 query formats.

    摘要翻译: 公开了一种用于表达用于搜索多媒体数据的查询的装置和方法。 本发明的装置和方法以MPEG-7查询格式表示不同的查询类型,并且使用字段类型来重新使用指定的区域。 用于表示从用户输入的用于多媒体数据搜索的查询的装置包括:输入装置,用于从用户接收用于多媒体数据搜索的查询; 以及用于以场类型表达输入查询的查询表达装置,其中,所述字段类型包括表示所述输入查询中包括的搜索条件的字段的标识(ID)信息的标识符信息中的至少一个; 指示该字段的数据类型信息的类型信息; 以及指示用于参考的另一场的标识符信息的参考信息。 本发明应用于MPEG-7查询格式。

    ELECTRONIC DOCUMENT PROCESSING APPARATUS AND METHOD
    4.
    发明申请
    ELECTRONIC DOCUMENT PROCESSING APPARATUS AND METHOD 审中-公开
    电子文件处理装置和方法

    公开(公告)号:US20100145952A1

    公开(公告)日:2010-06-10

    申请号:US12635042

    申请日:2009-12-10

    IPC分类号: G06F17/30

    CPC分类号: G06F16/35 G06F16/31 G06F16/93

    摘要: An electronic document processing apparatus includes: a document set storage unit storing hash tables including hash values of documents to be processed; a content extraction unit for extracting body contents from a newly input electronic document; and a sentence separation unit for separating sentences from the extracted body contents. The apparatus further includes a duplicate document determination unit for converting the separated sentences into unique hash values by a hash algorithm, determining each of the separated checking if there is a duplicate sentence depending on whether or not there is a collision between the converted hash values and the hash values in the hash tables of the document set storage unit, and determining if the electronic document is a duplicate document based on the ratio of duplicate sentences to all of the sentences in the electronic document.

    摘要翻译: 电子文档处理装置包括:文档集存储单元,存储包括要处理的文档的哈希值的散列表; 内容提取单元,用于从新输入的电子文档中提取身体内容; 以及用于从所提取的身体内容中分离句子的句子分离单元。 该装置还包括一个重复文件确定单元,用于通过散列算法将分离的句子转换成唯一的散列值,根据是否存在经转换的哈希值和 所述文档集存储单元的散列表中的散列值,以及基于所述电子文档中的所有句子的重复句子的比例来确定所述电子文档是否是重复的文档。

    PERSONALIZED SEARCH APPARATUS AND METHOD
    5.
    发明申请
    PERSONALIZED SEARCH APPARATUS AND METHOD 审中-公开
    个性化搜索设备和方法

    公开(公告)号:US20100145922A1

    公开(公告)日:2010-06-10

    申请号:US12628171

    申请日:2009-11-30

    IPC分类号: G06F17/30

    CPC分类号: G06F16/9535

    摘要: A personalized search apparatus includes: a model generating unit for generating a user favorites analysis model based on directory grouping information about directories stored in a user terminal and user behavior information; and a user favorites analysis model DB for storing the generated user favorites analysis model. Further, the personalized search apparatus includes a search engine for searching for a file relevant to an input query using an information search engine installed in the user terminal to generate search results; and a personalized search engine for re-ranking the search results generated by the search engine based on the user favorites analysis model to generate personalized search results.

    摘要翻译: 个性化搜索装置包括:模型生成单元,用于基于关于存储在用户终端中的目录的目录分组信息和用户行为信息来生成用户收藏分析模型; 以及用于存储生成的用户收藏分析模型的用户收藏分析模型DB。 此外,个性化搜索装置包括搜索引擎,用于使用安装在用户终端中的信息搜索引擎来搜索与输入查询相关的文件,以生成搜索结果; 以及个性化搜索引擎,用于基于用户收藏分析模型重新排列由搜索引擎生成的搜索结果,以生成个性化搜索结果。

    METHOD AND APPARATUS FOR SOCIAL TAGGING USING PROPERTY FIELD OF ONTOLOGY OBJECT
    6.
    发明申请
    METHOD AND APPARATUS FOR SOCIAL TAGGING USING PROPERTY FIELD OF ONTOLOGY OBJECT 审中-公开
    使用属性的社会标签的方法和装置本体对象的领域

    公开(公告)号:US20100023549A1

    公开(公告)日:2010-01-28

    申请号:US12417232

    申请日:2009-04-02

    IPC分类号: G06F17/30

    CPC分类号: G06F16/48

    摘要: A method for social tagging using a property field of an ontology object includes: selecting an object in an ontology database storing therein objects in forms of classes; selecting a property field in a class corresponding to the selected object; and adding a social tag by storing user's input as a value of the selected property field. Classes stored in the ontology database may have property fields defined when instances of the classes are created, and specific values may be stored as values of the property fields also when the instances are created. The property fields defined when the instances are created are classified into data type property fields and object type property field, and the selected property field is a data type property field.

    摘要翻译: 使用本体对象的属性字段进行社会标记的方法包括:以存储在对象中的对象的类的形式选择本体数据库中的对象; 选择与所选对象相对应的类中的属性字段; 并通过将用户的输入存储为所选属性字段的值来添加社交标签。 存储在本体数据库中的类可以在创建类的实例时定义属性字段,并且在实例创建时也可以将特定值存储为属性字段的值。 创建实例时定义的属性字段分为数据类型属性字段和对象类型属性字段,所选属性字段是数据类型属性字段。

    Apparatus and method for searching multimedia data based on metadata
    7.
    发明申请
    Apparatus and method for searching multimedia data based on metadata 失效
    基于元数据搜索多媒体数据的装置和方法

    公开(公告)号:US20070233673A1

    公开(公告)日:2007-10-04

    申请号:US11701955

    申请日:2007-02-02

    IPC分类号: G06F17/30

    摘要: Provided are an apparatus and method for searching multimedia data based on metadata. The apparatus for searching multimedia data includes: a mapping information storing unit for storing and managing mapping information between a Moving Picture Experts Group 7 (MPEG-7) query attribute and an MPEG-7 metadata property; and a query attribute mapping unit for acquiring the MPEG-7 metadata property to be mapped with the MPEG-7 query attribute according to a user query based on the mapping information.

    摘要翻译: 提供了一种基于元数据来搜索多媒体数据的装置和方法。 用于搜索多媒体数据的装置包括:映射信息存储单元,用于在运动图像专家组7(MPEG-7)查询属性和MPEG-7元数据属性之间存储和管理映射信息; 以及查询属性映射单元,用于根据基于映射信息的用户查询获取要与MPEG-7查询属性映射的MPEG-7元数据属性。

    Method and apparatus for retrieving multimedia contents
    8.
    发明授权
    Method and apparatus for retrieving multimedia contents 有权
    用于检索多媒体内容的方法和装置

    公开(公告)号:US08577919B2

    公开(公告)日:2013-11-05

    申请号:US12597158

    申请日:2008-04-23

    IPC分类号: G06F17/30

    摘要: Disclosed is an apparatus and method for retrieving multimedia contents represented in a Moving Picture Experts Group (MPEG) 7 by transforming a user query into an MPEG-7 query format. The method for retrieving multimedia contents includes: representing a user query by using an indicator indicating a specific region of a Moving Picture Experts Group 7 (MPEG-7) document and a reference for referring to the indicator; analyzing a meaning of the user query represented by using the indicator and the reference to thereby produce an analysis result; and retrieving multimedia contents according to the analysis result. The present research can satisfy more than two retrieval conditions within the same structure in an MPEG-7 query format and it can also clearly represent that two different MPEG-7 documents are referred to. Since the meaning of a user query is analyzed accurately during retrieval process, it is possible to precisely retrieve multimedia contents.

    摘要翻译: 公开了通过将用户查询变换为MPEG-7查询格式来检索运动图像专家组(MPEG)7中所表示的多媒体内容的装置和方法。 用于检索多媒体内容的方法包括:通过使用指示运动图像专家组7(MPEG-7)文档的特定区域的指示符和用于参考该指示符的参考来表示用户查询; 分析通过使用指示符和参考表示的用户查询的含义,从而产生分析结果; 并根据分析结果检索多媒体内容。 本研究可以以MPEG-7查询格式在同一结构内满足两个以上的检索条件,并且还可以清楚地表示两个不同的MPEG-7文档。 由于在检索过程中对用户查询的含义进行了准确的分析,因此可以精确地检索多媒体内容。

    APPARATUS AND METHOD FOR KNOWLEDGE GRAPH STABILIZATION
    9.
    发明申请
    APPARATUS AND METHOD FOR KNOWLEDGE GRAPH STABILIZATION 有权
    用于知识图表稳定的装置和方法

    公开(公告)号:US20110137919A1

    公开(公告)日:2011-06-09

    申请号:US12877063

    申请日:2010-09-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30958

    摘要: A method for stabilizing a knowledge graph includes: generating a knowledge graph in which same entities in a semantic relation list between entities provided as an input are represented as a single node based on names and types of the entities; computing, on the knowledge graph, semantic similarities between all potential entity pairs of same entity types by comparing, for each potential entity pair, a type of relation associated with an entity in the entity pair and an opponent entity to the entity; and selecting, based on the semantic similarities, a representative entity from each of semantically similar entity pairs on the knowledge graph and integrating an opponent entity to the representative entity into the representative entity. The method further includes computing relation weighted values between the entities by using a graph analysis and statistic information, and adding the weighted values to the knowledge graph.

    摘要翻译: 一种用于稳定知识图的方法包括:生成知识图,其中根据实体的名称和类型将提供作为输入的实体之间的语义关系列表中的相同实体表示为单个节点; 在知识图上计算相同实体类型的所有潜在实体对之间的语义相似性,通过对于每个潜在实体对,将与实体对中的实体相关联的关系的类型与对该实体的对象实体进行比较; 并且基于语义相似性,从知识图上的每个语义上相似的实体对中选择代表性实体,并将对手实体整合到代表性实体中。 该方法还包括通过使用图分析和统计信息来计算实体之间的关系加权值,并将加权值加到知识图中。

    Apparatus and method for constructing learning data
    10.
    发明授权
    Apparatus and method for constructing learning data 有权
    用于构建学习数据的装置和方法

    公开(公告)号:US07725408B2

    公开(公告)日:2010-05-25

    申请号:US11633190

    申请日:2006-12-04

    IPC分类号: G06F15/18

    CPC分类号: G06F17/2818

    摘要: An apparatus and method for efficiently constructing learning data required in statistical methodology used in information retrieval, information extraction, translation, natural language processing, etc. are provided. The method includes the steps of: generating learning models by performing machine learning with respect to learning data; attaching tags to a raw corpus automatically by using the generated learning models to thereby generate learning data candidates; calculating confidence scores of the generated learning data candidates, and then selecting a learning data candidate using the confidence scores; and allowing a user to correct an error in the selected learning data candidate through an interface and adding the error-corrected learning data candidate to the learning data, thereby adding new learning models incrementally.

    摘要翻译: 提供了一种用于有效地构建用于信息检索,信息提取,翻译,自然语言处理等的统计方法中所需的学习数据的装置和方法。 该方法包括以下步骤:通过相对于学习数据执行机器学习来产生学习模型; 通过使用生成的学习模型自动附加标签到原始语料库,从而生成学习数据候选; 计算生成的学习数据候选的置信度分数,然后使用置信分数选择学习数据候选; 并且允许用户通过接口校正所选择的学习数据候选中的错误,并将纠错学习数据候选者添加到学习数据中,从而逐渐增加新的学习模型。