发明授权
- 专利标题: Systems and methods for distilled BERT-based training model for text classification
-
申请号: US16877339申请日: 2020-05-18
-
公开(公告)号: US11922303B2公开(公告)日: 2024-03-05
- 发明人: Wenhao Liu , Ka Chun Au , Shashank Harinath , Bryan McCann , Govardana Sachithanandam Ramachandran , Alexis Roos , Caiming Xiong
- 申请人: Salesforce.com, Inc.
- 申请人地址: US CA San Francisco
- 专利权人: Salesforce, Inc.
- 当前专利权人: Salesforce, Inc.
- 当前专利权人地址: US CA San Francisco
- 代理机构: Haynes and Boone LLP
- 主分类号: G06N3/00
- IPC分类号: G06N3/00 ; G06F40/40 ; G06N3/045 ; G06N3/08
摘要:
Embodiments described herein provides a training mechanism that transfers the knowledge from a trained BERT model into a much smaller model to approximate the behavior of BERT. Specifically, the BERT model may be treated as a teacher model, and a much smaller student model may be trained using the same inputs to the teacher model and the output from the teacher model. In this way, the student model can be trained within a much shorter time than the BERT teacher model, but with comparable performance with BERT.
公开/授权文献
信息查询
IPC分类:
G | 物理 |
G06 | 计算;推算或计数 |
G06N | 基于特定计算模型的计算机系统 |
G06N3/00 | 基于生物学模型的计算机系统 |