- 专利标题: Multi-task machine learning architectures and training procedures
-
申请号: US16443440申请日: 2019-06-17
-
公开(公告)号: US12008459B2公开(公告)日: 2024-06-11
- 发明人: Weizhu Chen , Pengcheng He , Xiaodong Liu , Jianfeng Gao
- 申请人: Microsoft Technology Licensing, LLC
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Technology Licensing, LLC
- 当前专利权人: Microsoft Technology Licensing, LLC
- 当前专利权人地址: US WA Redmond
- 代理机构: Rainier Patents, P.S.
- 主分类号: G06N3/088
- IPC分类号: G06N3/088 ; G06F40/20 ; G06N3/045 ; G06N3/047
摘要:
This document relates to architectures and training procedures for multi-task machine learning models, such as neural networks. One example method involves providing a multi-task machine learning model having one or more shared layers and two or more task-specific layers. The method can also involve performing a pretraining stage on the one or more shared layers using one or more unsupervised prediction tasks. The method can also involve performing a tuning stage on the one or more shared layers and the two or more task-specific layers using respective task-specific objectives.
公开/授权文献
信息查询