Data mining structure
    1.
    发明申请
    Data mining structure 审中-公开
    数据挖掘结构

    公开(公告)号:US20050021489A1

    公开(公告)日:2005-01-27

    申请号:US10624278

    申请日:2003-07-22

    IPC分类号: G06F7/00

    摘要: A mining structure is created which contains processed data from a data set. This data may be used to train one or more models. In addition to the selection of data to be used by model from data set, processing parameters are set, in one embodiment. For example, the discretization of a continuous variable into buckets, the number of buckets, and/or the sub-range corresponding to each bucket is set when the mining structure is created. The mining structure is processed, which causes the processing and storage of data from data set in the mining structure. After processing, the mining structure can be used by one or more models.

    摘要翻译: 创建一个挖掘结构,其中包含来自数据集的已处理数据。 该数据可用于训练一个或多个模型。 除了从数据集中选择要由模型使用的数据之外,在一个实施例中,设置处理参数。 例如,当创建采矿结构时,设置连续变量到桶的离散化,桶的数量和/或对应于每个桶的子范围。 对采矿结构进行处理,对采矿结构中数据集的数据进行处理和存储。 处理后,采矿结构可以由一个或多个型号使用。

    Drill-through queries from data mining model content
    2.
    发明申请
    Drill-through queries from data mining model content 失效
    来自数据挖掘模型内容的钻取查询

    公开(公告)号:US20050021482A1

    公开(公告)日:2005-01-27

    申请号:US10611119

    申请日:2003-06-30

    摘要: A drill-through feature is provided which provides a universal drill-through to mining model source data from a trained mining model. In order for a user or application to obtain model content information on a given node of a model, a universal function is provided whereby the user specifies the node for a model and data set, and the cases underlying that node for that model and data set are returned. A sampling of underlying cases may be provided, where only a sampling of the cases represented in the node is requested.

    摘要翻译: 提供钻取功能,其提供了从受过训练的挖掘模型挖掘模型来源数据的通用钻取。 为了使用户或应用程序获得模型的给定节点上的模型内容信息,提供通用功能,借此用户为模型和数据集指定节点,并为该模型和数据集指定该节点的情况 被归还。 可以提供对基础案例的抽样,其中仅请求节点中表示的案例的抽样。

    Drill-through queries from data mining model content
    3.
    发明授权
    Drill-through queries from data mining model content 失效
    来自数据挖掘模型内容的钻取查询

    公开(公告)号:US07188090B2

    公开(公告)日:2007-03-06

    申请号:US10611119

    申请日:2003-06-30

    IPC分类号: G06F17/00 G06F17/20

    摘要: A drill-through feature is provided which provides a universal drill-through to mining model source data from a trained mining model. In order for a user or application to obtain model content information on a given node of a model, a universal function is provided whereby the user specifies the node for a model and data set, and the cases underlying that node for that model and data set are returned. A sampling of underlying cases may be provided, where only a sampling of the cases represented in the node is requested.

    摘要翻译: 提供钻取功能,其提供了从受过训练的挖掘模型挖掘模型来源数据的通用钻取。 为了使用户或应用程序获得模型的给定节点上的模型内容信息,提供通用功能,借此用户为模型和数据集指定节点,并为该模型和数据集指定该节点的情况 被归还。 可以提供对基础案例的抽样,其中仅请求节点中表示的案例的抽样。

    Systems and methods for mining model accuracy display for multiple state prediction
    4.
    发明授权
    Systems and methods for mining model accuracy display for multiple state prediction 有权
    用于多种状态预测的挖掘模型精度显示的系统和方法

    公开(公告)号:US07379843B2

    公开(公告)日:2008-05-27

    申请号:US10932583

    申请日:2004-09-01

    IPC分类号: G06F15/00

    CPC分类号: G06N7/00

    摘要: Systems and methods are provided for producing a mining model accuracy display that depicts the model's accuracy at predicting a state for a multiple-state variable. The model predicts a state and provides an associated probability for each case. Points are graphed such that one coordinate of the data point corresponds to a number N of cases and the other coordinate corresponds to the number of correct predictions made in the top N cases by probability.

    摘要翻译: 提供了系统和方法来产生挖掘模型精度显示,其描绘了模型在预测多状态变量的状态时的准确性。 该模型预测状态并为每种情况提供相关联的概率。 点被绘制为使得数据点的一个坐标对应于N个情况,另一个坐标对应于通过概率在前N个情况中做出的正确预测的数量。

    System and method for mining model accuracy display
    5.
    发明申请
    System and method for mining model accuracy display 审中-公开
    挖掘模型精度显示的系统和方法

    公开(公告)号:US20070010966A1

    公开(公告)日:2007-01-11

    申请号:US11519317

    申请日:2006-09-11

    IPC分类号: G06F17/18 G06F19/00

    CPC分类号: G06F17/18 G06F16/2465

    摘要: Systems and methods are provided for producing displays of the accuracy of data mining or statistical models that produce associative predictions. For all cases in a testing data set, the model makes predictions and provides associated probabilities. The cases are sorted by their probability of making accurate predictions and a graph is made of the accuracy of the model over various subsets containing the highest probability cases as evaluated by the model. Where a number of probabilities are presented for the predictions in a basket of predictions, those probabilities are combined to yield a probability score for the entire basket. Additionally, the accuracy of a model over different basket sizes may be graphed. The accuracy graph may also be produced for any models making a prediction, by graphing the probability of making accurate predictions and a graph made of the accuracy of the model over various subsets of the data containing the highest probability cases.

    摘要翻译: 提供系统和方法用于产生数据挖掘的准确性的显示或产生关联预测的统计模型。 对于测试数据集中的所有情况,模型进行预测并提供相关概率。 这些案例按照准确预测的概率进行排序,并且通过模型评估,对包含最高概率案例的各种子集进行模型的精度图。 在对一篮子预测中的预测提出若干概率的情况下,将这些概率组合起来以产生整个篮子的概率得分。 此外,可以绘制不同篮子尺寸的模型的精度。 也可以通过绘制准确预测的概率和通过包含最高概率情况的数据的各种子集对模型的精度进行绘制的图形来产生准确度图。

    Systems and methods for mining model accuracy display for multiple state prediction
    6.
    发明申请
    Systems and methods for mining model accuracy display for multiple state prediction 有权
    用于多种状态预测的挖掘模型精度显示的系统和方法

    公开(公告)号:US20050027478A1

    公开(公告)日:2005-02-03

    申请号:US10932583

    申请日:2004-09-01

    CPC分类号: G06N7/00

    摘要: Systems and methods are provided for producing a mining model accuracy display that depicts the model's accuracy at predicting a state for a multiple-state variable. The model predicts a state and provides an associated probability for each case. Points are graphed such that one coordinate of the data point corresponds to a number N of cases and the other coordinate corresponds to the number of correct predictions made in the top N cases by probability.

    摘要翻译: 提供了系统和方法来产生挖掘模型精度显示,其描绘了模型在预测多状态变量的状态时的准确性。 该模型预测状态并为每种情况提供相关联的概率。 点被绘制为使得数据点的一个坐标对应于N个情况,另一个坐标对应于通过概率在前N个情况中做出的正确预测的数量。

    System and method for mining model accuracy display
    8.
    发明授权
    System and method for mining model accuracy display 有权
    挖掘模型精度显示的系统和方法

    公开(公告)号:US07124054B2

    公开(公告)日:2006-10-17

    申请号:US10186052

    申请日:2002-06-28

    IPC分类号: G06E1/00

    CPC分类号: G06F17/18 G06F17/30539

    摘要: Systems and methods are provided for producing displays of the accuracy of data mining or statistical models that produce associative predictions. For all cases in a testing data set, the model makes predictions and provides associated probabilities. The cases are sorted by their probability of making accurate predictions and a graph is made of the accuracy of the model over various subsets containing the highest probability cases as evaluated by the model. Where a number of probabilities are presented for the predictions in a basket of predictions, those probabilities are combined to yield a probability score for the entire basket. Additionally, the accuracy of a model over different basket sizes may be graphed. The accuracy graph may also be produced for any models making a prediction, by graphing the probability of making accurate predictions and a graph made of the accuracy of the model over various subsets of the data containing the highest probability cases.

    摘要翻译: 提供系统和方法用于产生数据挖掘的准确性的显示或产生关联预测的统计模型。 对于测试数据集中的所有情况,模型进行预测并提供相关概率。 这些案例按照准确预测的概率进行排序,并且通过模型评估,对包含最高概率案例的各种子集的模型精度进行了图形化。 在对一篮子预测中的预测提出若干概率的情况下,将这些概率组合起来以产生整个篮子的概率得分。 此外,可以绘制不同篮子尺寸的模型的精度。 也可以通过绘制准确预测的概率和通过包含最高概率情况的数据的各种子集对模型的精度进行绘制的图形来产生准确度图。

    Systems and methods for mining model accuracy display for multiple state prediction

    公开(公告)号:US06810357B2

    公开(公告)日:2004-10-26

    申请号:US10185049

    申请日:2002-06-28

    IPC分类号: G06F1500

    CPC分类号: G06N7/00

    摘要: Systems and methods are provided for producing a mining model accuracy display that depicts the model's accuracy at predicting a state for a multiple-state variable. The model predicts a state and provides an associated probability for each case. Points are graphed such that one coordinate of the data point corresponds to a number N of cases and the other coordinate corresponds to the number of correct predictions made in the top N cases by probability.

    System and method for visualization of categories
    10.
    发明申请
    System and method for visualization of categories 有权
    类别可视化的系统和方法

    公开(公告)号:US20050108196A1

    公开(公告)日:2005-05-19

    申请号:US10955738

    申请日:2004-09-30

    IPC分类号: G06T11/20 G06F17/30

    摘要: Distribution displays for categories are provided which illuminate the distribution of continuous attributes over all cases in a category, and which provide a histogram of the population of the different states of categorical attributes. An array of such displays by attribute (in one dimension) and category (in another dimension) may be provided. Category diagram displays are also provided for visualizing the different categories, and their distributions, populations, and similarities. These are displayed through different shading of nodes and edges representing categories and the relationship between two categories, and through proximity of nodes.

    摘要翻译: 提供了类别的分布显示,其显示了类别中所有情况下的连续属性的分布,并且提供了分类属性的不同状态的总体的直方图。 可以提供由属性(在一个维度)和类别(在另一维度中)的这种显示器的数组。 还提供类别图显示,用于可视化不同类别及其分布,人口和相似之处。 这些通过不同的节点和边缘的阴影显示,表示类别和两个类别之间的关系,以及通过节点的接近。