Rapid automatic keyword extraction for information retrieval and analysis
    1.
    发明授权
    Rapid automatic keyword extraction for information retrieval and analysis 有权
    快速自动关键词提取,用于信息检索和分析

    公开(公告)号:US08131735B2

    公开(公告)日:2012-03-06

    申请号:US12555916

    申请日:2009-09-09

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30616

    摘要: Methods and systems for rapid automatic keyword extraction for information retrieval and analysis. Embodiments can include parsing words in an individual document by delimiters, stop words, or both in order to identify candidate keywords. Word scores for each word within the candidate keywords are then calculated based on a function of co-occurrence degree, co-occurrence frequency, or both. Based on a function of the word scores for words within the candidate keyword, a keyword score is calculated for each of the candidate keywords. A portion of the candidate keywords are then extracted as keywords based, at least in part, on the candidate keywords having the highest keyword scores.

    摘要翻译: 快速自动关键词提取的信息检索和分析方法和系统。 实施例可以包括通过分隔符,停止词或两者来解析单个文档中的单词以识别候选关键字。 然后根据共同发生程度,共同发生频率或两者的函数计算候选关键字中每个单词的单词分数。 基于候选关键字中的单词的分数的函数,针对每个候选关键字计算关键词分数。 然后至少部分地基于具有最高关键词分数的候选关键词,将候选关键词的一部分提取为关键字。

    Rapid Automatic Keyword Extraction for Information Retrieval and Analysis
    4.
    发明申请
    Rapid Automatic Keyword Extraction for Information Retrieval and Analysis 有权
    快速自动关键词提取信息检索与分析

    公开(公告)号:US20110060747A1

    公开(公告)日:2011-03-10

    申请号:US12555916

    申请日:2009-09-09

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30616

    摘要: Methods and systems for rapid automatic keyword extraction for information retrieval and analysis. Embodiments can include parsing words in an individual document by delimiters, stop words, or both in order to identify candidate keywords. Word scores for each word within the candidate keywords are then calculated based on a function of co-occurrence degree, co-occurrence frequency, or both. Based on a function of the word scores for words within the candidate keyword, a keyword score is calculated for each of the candidate keywords. A portion of the candidate keywords are then extracted as keywords based, at least in part, on the candidate keywords having the highest keyword scores.

    摘要翻译: 快速自动关键词提取的信息检索和分析方法和系统。 实施例可以包括通过分隔符,停止词或两者来解析单个文档中的单词以识别候选关键字。 然后根据共同发生程度,共同发生频率或两者的函数计算候选关键字中每个单词的单词分数。 基于候选关键字中的单词的分数的函数,针对每个候选关键字计算关键词分数。 然后至少部分地基于具有最高关键词分数的候选关键词,将候选关键词的一部分提取为关键字。