Method for searching in a plurality of data sets and search engine
    1.
    发明授权
    Method for searching in a plurality of data sets and search engine 有权
    用于在多个数据集和搜索引擎中搜索的方法

    公开(公告)号:US09087119B2

    公开(公告)日:2015-07-21

    申请号:US13818180

    申请日:2011-08-17

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06F17/3064

    摘要: The invention relates to a method implemented by a computer for searching in a plurality of data sets. In the method a search query is received and a partial quantity having terms similar to or identical to the search term is derived from a reference quantity. A similarity measure to the search term and the probability of the occurrence of the term is then determined for each term. Furthermore, a weighted distribution depending on the term is applied to the terms, and a modified probability is determined for the term. The data sets are further evaluated with respect to the relevance thereof to the search query, and at least one partial data set quantity is output as a function of the relevance value thereof. The invention further relates to a search engine for performing said method.

    摘要翻译: 本发明涉及一种由计算机实现的用于在多个数据集中搜索的方法。 在该方法中,接收到搜索查询,并且从参考数量导出具有与搜索项相似或相同的项的部分数量。 然后针对每个术语确定搜索项的相似性度量和该术语发生的概率。 此外,根据术语的加权分布被应用于术语,并且确定该术语的修改的概率。 对于与搜索查询的相关性进一步评估数据集,并且根据其相关性值输出至少一个部分数据集数量。 本发明还涉及一种用于执行所述方法的搜索引擎。

    METHOD FOR SEARCHING IN A PLURALITY OF DATA SETS AND SEARCH ENGINE
    2.
    发明申请
    METHOD FOR SEARCHING IN A PLURALITY OF DATA SETS AND SEARCH ENGINE 有权
    搜索数据集和搜索引擎的方法

    公开(公告)号:US20130151499A1

    公开(公告)日:2013-06-13

    申请号:US13818180

    申请日:2011-08-17

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06F17/3064

    摘要: The invention relates to a method implemented by a computer for searching in a plurality of data sets. In the method a search query is received and a partial quantity having terms similar to or identical to the search term is derived from a reference quantity. A similarity measure to the search term and the probability of the occurrence of the term is then determined for each term. Furthermore, a weighted distribution depending on the term is applied to the terms, and a modified probability is determined for the term. The data sets are further evaluated with respect to the relevance thereof to the search query, and at least one partial data set quantity is output as a function of the relevance value thereof. The invention further relates to a search engine for performing said method.

    摘要翻译: 本发明涉及一种由计算机实现的用于在多个数据集中搜索的方法。 在该方法中,接收到搜索查询,并且从参考数量导出具有与搜索项相似或相同的项的部分数量。 然后针对每个术语确定搜索项的相似性度量和该术语发生的概率。 此外,根据术语的加权分布被应用于术语,并且确定该术语的修改的概率。 对于与搜索查询的相关性进一步评估数据集,并且根据其相关性值输出至少一个部分数据集数量。 本发明还涉及一种用于执行所述方法的搜索引擎。