MULTI-TASK MACHINE LEARNING ARCHITECTURES AND TRAINING PROCEDURES

    公开(公告)号:US20200334520A1

    公开(公告)日:2020-10-22

    申请号:US16443440

    申请日:2019-06-17

    IPC分类号: G06N3/04 G06N3/08 G06F17/27

    摘要: This document relates to architectures and training procedures for multi-task machine learning models, such as neural networks. One example method involves providing a multi-task machine learning model having one or more shared layers and two or more task-specific layers. The method can also involve performing a pretraining stage on the one or more shared layers using one or more unsupervised prediction tasks. The method can also involve performing a tuning stage on the one or more shared layers and the two or more task-specific layers using respective task-specific objectives