Computer system and computerized method for partitioning data for parallel processing

发明授权

US06415286B1 Computer system and computerized method for partitioning data for parallel processing 失效

标题翻译：用于并行处理分区数据的计算机系统和计算机化方法

请登陆查看更多内容

专利标题： Computer system and computerized method for partitioning data for parallel processing
专利标题（中）： 用于并行处理分区数据的计算机系统和计算机化方法
申请号： US09281984

申请日： 1999-03-29
公开(公告)号： US06415286B1

公开(公告)日： 2002-07-02
发明人: Anthony Passera , John R. Thorp , Michael J. Beckerle , Edward S. Zyszkowski
申请人： Anthony Passera , John R. Thorp , Michael J. Beckerle , Edward S. Zyszkowski
主分类号： G06F15163
IPC分类号： G06F15163

Computer system and computerized method for partitioning data for parallel processing

摘要：

A computer system splits a data space to partition data between processors or processes. The data space may be split into sub-regions which need not be orthogonal to the axes defined the data space's parameters, using a decision tree. The decision tree can have neural networks in each of its non-terminal nodes that are trained on, and are used to partition, training data. Each terminal, or leaf, node can have a hidden layer neural network trained on the training data that reaches the terminal node. The training of the non-terminal nodes' neural networks can be performed on one processor and the training of the leaf nodes' neural networks can be run on separate processors. Different target values can be used for the training of the networks of different non-terminal nodes. The non-terminal node networks may be hidden layer neural networks. Each non-terminal node automatically may send a desired ratio of the training records it receives to each of its child nodes, so the leaf node networks each receives approximately the same number of training records. The system may automatically configures the tree to have a number of leaf nodes equal to the number of separate processors available to train leaf node networks. After the non-terminal and leaf node networks have been trained, the records of a large data base can be passed through the tree for classification or for estimation of certain parameter values.

摘要（中）：

计算机系统将数据空间拆分为处理器或进程之间的数据分区。可以使用决策树将数据空间拆分成不需要与定义数据空间参数的轴正交的子区域。决策树可以在其非终端节点中的每个训练数据上进行训练并用于分割训练数据的神经网络。每个终端或叶节点可以具有对到达终端节点的训练数据训练的隐层神经网络。可以在一个处理器上执行非终端节点神经网络的训练，并且可以在单独的处理器上运行叶节点的神经网络的训练。不同目标值可用于不同非终端节点网络的训练。非终端节点网络可以是隐层神经网络。每个非终端节点可自动发送其接收到的每个子节点的培训记录的期望比例，因此叶节点网络每个接收大约相同数量的训练记录。该系统可以自动地配置该树以使得多个叶节点等于可用于训练叶节点网络的单独处理器的数量。在非终端和叶节点网络被训练之后，大数据库的记录可以通过树进行分类或估计某些参数值。

信息查询

Espacenet