INFORMATION RETRIEVAL SYSTEM AND METHOD USING A BAYESIAN ALGORITHM BASED ON PROBABILISTIC SIMILARITY SCORES
    1.
    发明申请
    INFORMATION RETRIEVAL SYSTEM AND METHOD USING A BAYESIAN ALGORITHM BASED ON PROBABILISTIC SIMILARITY SCORES 审中-公开
    信息检索系统和使用基于概率相似性的贝叶斯算法的方法

    公开(公告)号:US20100223258A1

    公开(公告)日:2010-09-02

    申请号:US12095637

    申请日:2006-12-01

    IPC分类号: G06F17/30

    摘要: An algorithm is provided which uses a model-based concept of a cluster and scores items using a score representative of the probability that a given item has been generated from the same distribution as one or more query items. The items are represented by a feature vector xi comprising a plurality of digitally represented features xij the method including: receiving an input identifying the query items; for each of the other items computing a score which is a function of a conditional probability of the feature vectors xij of the query items being generated from the generating distribution formula (I) given that the respective other item is generated from the generating distribution formula (I) and returning a score for each of the other items, a list of some or all of the other items, sorted by their respective score, or a list of n other items which have the highest score.

    摘要翻译: 提供了一种算法,其使用群集的基于模型的概念,并且使用表示从与一个或多个查询项目相同的分布生成给定项目的概率的分数来评分项目。 这些项目由包括多个数字表示的特征的特征向量xi表示,该方法包括:接收标识查询项目的输入; 对于每个其他项目,计算作为从生成分配公式(I)生成的查询项目的特征向量xij的条件概率的函数的分数,假设从生成分配公式生成了相应的其他项目 I)并返回每个其他项目的分数,按其各自分数排序的一些或所有其他项目的列表,或者具有最高分数的n个其他项目的列表。