发明申请
- 专利标题: GENERATING A PREDICTIVE MODEL FROM MULTIPLE DATA SOURCES
- 专利标题(中): 从多个数据源生成预测模型
-
申请号: US13048536申请日: 2011-03-15
-
公开(公告)号: US20120239613A1公开(公告)日: 2012-09-20
- 发明人: Marius I. Danciu , Fan Li , Michael McRoberts , Jing-Yun Shyr , Damir Spisic , Jing Xu
- 申请人: Marius I. Danciu , Fan Li , Michael McRoberts , Jing-Yun Shyr , Damir Spisic , Jing Xu
- 申请人地址: US NY Armonk
- 专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人地址: US NY Armonk
- 主分类号: G06F7/00
- IPC分类号: G06F7/00 ; G06F17/00 ; G06F17/30
摘要:
Techniques are disclosed for generating an ensemble model from multiple data sources. In one embodiment, the ensemble model is generated using a global validation sample, a global holdout sample and base models generated from the multiple data sources. An accuracy value may be determined for each base model, on the basis of the global validation dataset. The ensemble model may be generated from a subset of the base models, where the subset is selected on the basis of the determined accuracy values.
公开/授权文献
- US08990149B2 Generating a predictive model from multiple data sources 公开/授权日:2015-03-24
信息查询