- 专利标题: KNOWLEDGE DISTILLATION USING DEEP CLUSTERING
-
申请号: US17116117申请日: 2020-12-09
-
公开(公告)号: US20220180206A1公开(公告)日: 2022-06-09
- 发明人: Takashi Fukuda
- 申请人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 申请人地址: US NY Armonk
- 专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人地址: US NY Armonk
- 主分类号: G06N3/08
- IPC分类号: G06N3/08 ; G06F16/28
摘要:
Methods and systems for training a neural network include clustering a full set of training data samples into specialized training clusters. Specialized teacher neural networks are trained using respective specialized training clusters of the specialized training clusters. Soft labels are generated for the full set of training data samples using the specialized teacher neural networks. A student model is trained using the full set of training data samples, the specialized training clusters, and the soft labels.
信息查询