RECOMMENDATIONS UTILIZING META-DATA BASED PAIR-WISE LIFT PREDICTIONS
    21.
    发明申请
    RECOMMENDATIONS UTILIZING META-DATA BASED PAIR-WISE LIFT PREDICTIONS 审中-公开
    推荐使用基于元数据的配对提升预测

    公开(公告)号:US20080097821A1

    公开(公告)日:2008-04-24

    申请号:US11552467

    申请日:2006-10-24

    IPC分类号: G07G1/00

    CPC分类号: G06Q30/02 G06Q30/0201

    摘要: The subject disclosure pertains to systems and methods for facilitating generation of item recommendations based at least in part upon pair-wise lift. Pair-wise lift is a measure of correlation between a pair of items and is generally calculated based upon past usage data. If usage data is insufficient or unavailable, pair-wise lift for a pair of items can be estimated based upon metadata associated with the items. In other aspects, pair-wise lift can be used to generate an explanation for recommended items. An explanation for an item recommendation can be based upon common metadata features associated with the item pair. The relative impact each metadata feature has on predicted pair-wise lift can be evaluated to determine the common feature(s) most likely to have caused the item to be recommended.

    摘要翻译: 本发明涉及用于至少部分地基于成对提升来促进产生项目建议的系统和方法。 成对升降是一对物品之间的相关性的度量,并且通常基于过去的使用数据计算。 如果使用数据不足或不可用,则可以基于与项目相关联的元数据来估计一对项目的成对提升。 在其他方面,可以使用成对提升来产生推荐项目的说明。 项目建议的解释可以基于与项目对相关联的公共元数据特征。 可以评估每个元数据特征对预测的成对提升的相对影响,以确定最可能导致该项目被推荐的共同特征。

    Handwriting recognition with mixtures of bayesian networks
    25.
    发明授权
    Handwriting recognition with mixtures of bayesian networks 有权
    手写识别与贝叶斯网络混合

    公开(公告)号:US07200267B1

    公开(公告)日:2007-04-03

    申请号:US11324444

    申请日:2005-12-30

    IPC分类号: G06K9/00 G06K9/62

    CPC分类号: G06K9/00422 G06K9/6296

    摘要: The invention performs handwriting recognition using mixtures of Bayesian networks. A mixture of Bayesian networks (MBN) consists of plural hypothesis-specific Bayesian networks (HSBNs) having possibly hidden and observed variables. A common external hidden variable is associated with the MBN, but is not included in any of the HSBNs. Each HSBN models the world under the hypothesis that the common external hidden variable is in a corresponding one of its states. The MBNs encode the probabilities of observing the sets of visual observations corresponding to a handwritten character. Each of the HSBNs encodes the probabilities of observing the sets of visual observations corresponding to a handwritten character and given a hidden common variable being in a particular state.

    摘要翻译: 本发明使用贝叶斯网络的混合来执行手写识别。 贝叶斯网络(MBN)的混合由多个具有隐藏和观察变量的假设特定贝叶斯网络(HSBN)组成。 常见的外部隐藏变量与MBN相关联,但不包括在任何HSBN中。 每个HSBN在假设下共同的外部隐藏变量处于相应的一个状态的模型中模拟世界。 MBN编码观察对应于手写字符的视觉观察组的概率。 每个HSBN编码观察对应于手写字符的视觉观察组的概率,并给出处于特定状态的隐藏的公共变量。

    System and method for approximating probabilities using a decision tree
    26.
    发明授权
    System and method for approximating probabilities using a decision tree 有权
    使用决策树近似概率的系统和方法

    公开(公告)号:US06718315B1

    公开(公告)日:2004-04-06

    申请号:US09740067

    申请日:2000-12-18

    IPC分类号: G06F1518

    CPC分类号: G06N99/005

    摘要: Disclosed is a system for approximating conditional probabilities using an annotated decision tree where predictor values that did not exist in training data for the system are tracked, stored, and referenced to determine if statistical aggregation should be invoked. Further disclosed is a system for storing statistics for deriving a non-leaf probability corresponding to predictor values, and a system for aggregating such statistics to approximate conditional probabilities.

    摘要翻译: 公开了一种使用注释决策树近似条件概率的系统,其中跟踪,存储和引用系统的训练数据中不存在的预测值,以确定是否应调用统计聚合。 进一步披露的是用于存储用于导出与预测值相对应的非叶概率的统计的系统,以及用于将这种统计量聚合以近似条件概率的系统。

    Goal-oriented clustering
    27.
    发明授权
    Goal-oriented clustering 有权
    面向目标的聚类

    公开(公告)号:US06694301B1

    公开(公告)日:2004-02-17

    申请号:US09540255

    申请日:2000-03-31

    IPC分类号: G06N502

    摘要: Clustering for purposes of data visualization and making predictions is disclosed. Embodiments of the invention are operable on a number of variables that have a predetermined representation. The variables include input-only variables, output-only variables, and both input-and-output variables. Embodiments of the invention generate a model that has a bottleneck architecture. The model includes a top layer of nodes of at least the input-only variables, one or more middle layer of hidden nodes, and a bottom layer of nodes of the output-only and the input-and-output variables. At least one cluster is determined from this model. The model can be a probabilistic neural network and/or a Bayesian network.

    摘要翻译: 公开了用于数据可视化和进行预测的聚类。 本发明的实施例可以对具有预定表示的多个变量进行操作。 变量包括仅输入变量,仅输出变量,以及输入和输出变量。 本发明的实施例生成具有瓶颈架构的模型。 该模型包括至少仅输入变量,一个或多个中间层隐藏节点的顶层,以及仅输出和输入和输出变量的节点的底层。 从该模型确定至少一个群集。 该模型可以是概率神经网络和/或贝叶斯网络。

    Collaborative filtering with mixtures of bayesian networks
    28.
    发明授权
    Collaborative filtering with mixtures of bayesian networks 有权
    使用贝叶斯网络混合进行协同过滤

    公开(公告)号:US06496816B1

    公开(公告)日:2002-12-17

    申请号:US09220199

    申请日:1998-12-23

    IPC分类号: G06N302

    摘要: One aspect of the invention is the construction of mixtures of Bayesian networks. Another aspect of the invention is the use of such mixtures of Bayesian networks to perform inferencing. A mixture of Bayesian networks (MBN) consists of plural hypothesis-specific Bayesian networks (HSBNs) having possibly hidden and observed variables. A common external hidden variable is associated with the MBN, but is not included in any of the HSBNs. The number of HSBNs in the MBN corresponds to the number of states of the common external hidden variable, and each HSBN is based upon the hypothesis that the common external hidden variable is in a corresponding one of those states. In one mode of the invention, the MBN having the highest MBN score is selected for use in performing inferencing. In another mode of the invention, some or all of the MBNs are retained as a collection of MBNs which perform inferencing in parallel, their outputs being weighted in accordance with the corresponding MBN scores and the MBN collection output being the weighted sum of all the MBN outputs. In one application of the invention, collaborative filtering may be performed by defining the observed variables to be choices made among a sample of users and the hidden variables to be the preferences of those users.

    摘要翻译: 本发明的一个方面是构建贝叶斯网络的混合物。 本发明的另一方面是使用贝叶斯网络的这种混合来执行推理。 贝叶斯网络(MBN)的混合由多个具有隐藏和观察变量的假设特定贝叶斯网络(HSBN)组成。 常见的外部隐藏变量与MBN相关联,但不包括在任何HSBN中。 MBN中的HSBN的数量对应于公共外部隐藏变量的状态数,并且每个HSBN基于公共外部隐藏变量在这些状态中的相应一个状态中的假设。 在本发明的一种模式中,选择具有最高MBN分数的MBN用于执行推定。 在本发明的另一模式中,一些或所有MBN被保留为并行执行推论的MBN的集合,其输出根据相应的MBN分数加权,并且MBN收集输出是所有MBN的加权和 输出。 在本发明的一个应用中,可以通过将观察到的变量定义为在用户样本中作出的选择和作为这些用户的偏好的隐藏变量来执行协同过滤。

    Architecture for automated data analysis
    29.
    发明授权
    Architecture for automated data analysis 有权
    自动数据分析架构

    公开(公告)号:US06330563B1

    公开(公告)日:2001-12-11

    申请号:US09298717

    申请日:1999-04-23

    IPC分类号: G06F1730

    摘要: An architecture for automated data analysis. In one embodiment, a computerized system comprising an automated problem formulation layer, a first learning engine, and a second learning engine. The automated problem formulation layer receives a data set. The data set has a plurality of records, where each record has a value for each of a plurality of raw transactional variables. The layer abstracts the raw transactional variables into cooked transactional variables. The first learning engine generates a model for the cooked transactional variables, while the second learning engine generates a model for the raw transactional variables.

    摘要翻译: 用于自动数据分析的架构。 在一个实施例中,包括自动化问题制定层,第一学习引擎和第二学习引擎的计算机化系统。 自动化问题制定层接收数据集。 数据集具有多个记录,其中每个记录具有多个原始事务变量中的每一个的值。 该层将原始事务变量抽象为熟的事务变量。 第一个学习引擎为煮熟的事务变量生成模型,而第二个学习引擎生成原始事务变量的模型。

    Belief networks with decision graphs
    30.
    发明授权
    Belief networks with decision graphs 失效
    信念网络与决策图

    公开(公告)号:US6154736A

    公开(公告)日:2000-11-28

    申请号:US902759

    申请日:1997-07-30

    CPC分类号: G06N7/005 G06N5/04

    摘要: An improved belief network is provided for assisting users in making decisions. The improved belief network utilizes a decision graph in each of its nodes to store the probabilities for that node. A decision graph is a much more flexible and efficient data structure for storing probabilities than either a tree or a table, because a decision graph can reflect any equivalence relationships between the probabilities and because leaf nodes having equivalent probabilities need not be duplicated. Additionally, by being able to reflect an equivalency relationship, multiple paths (or combinations of the parent values) refer to the same probability, which yields a more accurate probability.

    摘要翻译: 提供了一个改进的信念网络,用于帮助用户作出决策。 改进的信念网络利用其每个节点中的决策图来存储该节点的概率。 因为决策图可以反映概率之间的任何等价关系,并且因为具有等效概率的叶节点不需要重复,所以决策图是一种比树或表存储概率更灵活和高效的数据结构。 另外,通过能够反映等价关系,多个路径(或父值的组合)指的是相同的概率,这产生更准确的概率。