发明授权
- 专利标题: Leveraging unlabeled data with a probabilistic graphical model
- 专利标题(中): 利用概率图形模型利用未标记的数据
-
申请号: US11170989申请日: 2005-06-30
-
公开(公告)号: US07937264B2公开(公告)日: 2011-05-03
- 发明人: Christopher J. C. Burges , John C. Platt
- 申请人: Christopher J. C. Burges , John C. Platt
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 代理机构: Lee & Hayes, PLLC
- 主分类号: G06F17/27
- IPC分类号: G06F17/27
摘要:
A general probabilistic formulation referred to as ‘Conditional Harmonic Mixing’ is provided, in which links between classification nodes are directed, a conditional probability matrix is associated with each link, and where the numbers of classes can vary from node to node. A posterior class probability at each node is updated by minimizing a divergence between its distribution and that predicted by its neighbors. For arbitrary graphs, as long as each unlabeled point is reachable from at least one training point, a solution generally always exists, is unique, and can be found by solving a sparse linear system iteratively. In one aspect, an automated data classification system is provided. The system includes a data set having at least one labeled category node in the data set. A semi-supervised learning component employs directed arcs to determine the label of at least one other unlabeled category node in the data set.
公开/授权文献
信息查询