Natural Language Hypernym Weighting For Word Sense Disambiguation
    3.
    发明申请
    Natural Language Hypernym Weighting For Word Sense Disambiguation 有权
    自然语言Hypernym加权词义消歧

    公开(公告)号:US20090089047A1

    公开(公告)日:2009-04-02

    申请号:US12201015

    申请日:2008-08-29

    IPC分类号: G06F17/27 G06F17/30

    摘要: Technologies are described herein for probabilistically assigning weights to word senses and hypernyms of a word. The weights can be used in natural language processing applications such as information indexing and querying. A word hypernym weight (WHW) score can be determined by summing word sense probabilities of word senses from which the hypernym is inherited. WHW scores can be used to prune away hypernyms prior to indexing, to rank query results, and for other functions related to information indexing and querying. A semantic search technique can use WHW scores to retrieve an entry related to a word from an index in response to matching an indexed hypernym of the word with a query term applied to the index. More refined and accurate query results may be provided based on reduced user inputs.

    摘要翻译: 技术在这里被描述为概率地将权重分配给单词的单词感觉和高词。 权重可用于自然语言处理应用程序,如信息索引和查询。 单词超音速重量(WHW)分数可以通过求和超音速遗传的单词感觉的词义概率来确定。 WHW分数可以用于在索引之前修剪高分辨率,对查询结果进行排序,以及与信息索引和查询相关的其他功能。 语义搜索技术可以使用WHW分数来从索引中检索与索引相关的条目,以响应于将索引的单词的超文本与应用于索引的查询项匹配。 可以基于减少的用户输入来提供更精确和准确的查询结果。

    Natural language hypernym weighting for word sense disambiguation
    4.
    发明授权
    Natural language hypernym weighting for word sense disambiguation 有权
    自然语言hypernym加权词义消歧

    公开(公告)号:US08463593B2

    公开(公告)日:2013-06-11

    申请号:US12201015

    申请日:2008-08-29

    IPC分类号: G06F17/27 G06F17/30

    摘要: Technologies are described herein for probabilistically assigning weights to word senses and hypernyms of a word. The weights can be used in natural language processing applications such as information indexing and querying. A word hypernym weight (WHW) score can be determined by summing word sense probabilities of word senses from which the hypernym is inherited. WHW scores can be used to prune away hypernyms prior to indexing, to rank query results, and for other functions related to information indexing and querying. A semantic search technique can use WHW scores to retrieve an entry related to a word from an index in response to matching an indexed hypernym of the word with a query term applied to the index. More refined and accurate query results may be provided based on reduced user inputs.

    摘要翻译: 技术在这里被描述为概率地将权重分配给单词的单词感觉和高词。 权重可用于自然语言处理应用程序,如信息索引和查询。 单词超音速重量(WHW)分数可以通过求和超音速遗传的单词感觉的词义概率来确定。 WHW分数可以用于在索引之前修剪高分辨率,对查询结果进行排序,以及与信息索引和查询相关的其他功能。 语义搜索技术可以使用WHW分数来从索引中检索与索引相关的条目,以响应于将索引的单词的超文本与应用于索引的查询项匹配。 可以基于减少的用户输入来提供更精确和准确的查询结果。

    Efficiently representing word sense probabilities
    5.
    发明授权
    Efficiently representing word sense probabilities 有权
    有效地表示单词感觉概率

    公开(公告)号:US08280721B2

    公开(公告)日:2012-10-02

    申请号:US12200999

    申请日:2008-08-29

    IPC分类号: G06F17/27

    CPC分类号: G06F17/2755

    摘要: Word sense probabilities are compressed for storage in a semantic index. Each word sense for a word is mapped to one of a number of “buckets” by assigning a bucket score to the word sense. A scoring function is utilized to assign the bucket scores that maximizes the entropy of the assigned bucket scores. Once the bucket scores have been assigned to the word senses, the bucket scores are stored in the semantic index. The bucket scores stored in the semantic index may be utilized to prune one or more of the word senses prior to construction of the semantic index. The bucket scores may also be utilized to prune and rank the word senses at the time a query is performed using the semantic index.

    摘要翻译: 字义概率被压缩以存储在语义索引中。 通过将桶分数分配给词语,将单词的每个单词感觉映射到多个水桶中的一个。 使用评分函数来分配使分配的桶分数的熵最大化的桶分数。 一旦桶分数被分配到单词感觉,桶分数被存储在语义索引中。 存储在语义索引中的桶分数可以用于在构建语义索引之前修剪一个或多个单词感觉。 桶分数也可用于在使用语义索引执行查询的时候对单词感觉进行修剪和排序。

    Efficiently Representing Word Sense Probabilities
    6.
    发明申请
    Efficiently Representing Word Sense Probabilities 有权
    有效地代表词义概率

    公开(公告)号:US20090094019A1

    公开(公告)日:2009-04-09

    申请号:US12200999

    申请日:2008-08-29

    IPC分类号: G06F17/27

    CPC分类号: G06F17/2755

    摘要: Word sense probabilities are compressed for storage in a semantic index. Each word sense for a word is mapped to one of a number of “buckets” by assigning a bucket score to the word sense. A scoring function is utilized to assign the bucket scores that maximizes the entropy of the assigned bucket scores. Once the bucket scores have been assigned to the word senses, the bucket scores are stored in the semantic index. The bucket scores stored in the semantic index may be utilized to prune one or more of the word senses prior to construction of the semantic index. The bucket scores may also be utilized to prune and rank the word senses at the time a query is performed using the semantic index.

    摘要翻译: 字义概率被压缩以存储在语义索引中。 通过将一个桶分数分配给单词感觉,将单词的每个单词感觉映射到多个“桶”中的一个。 使用评分函数来分配使分配的桶分数的熵最大化的桶分数。 一旦桶分数被分配到单词感觉,桶分数被存储在语义索引中。 存储在语义索引中的桶分数可以用于在构建语义索引之前修剪一个或多个单词感觉。 桶分数也可用于在使用语义索引执行查询的时候对单词感觉进行修剪和排序。