Systems and methods for distilled BERT-based training model for text classification

发明授权

US11922303B2 Systems and methods for distilled BERT-based training model for text classification 有权

请登陆查看更多内容

专利标题： Systems and methods for distilled BERT-based training model for text classification
申请号： US16877339

申请日： 2020-05-18
公开(公告)号： US11922303B2

公开(公告)日： 2024-03-05
发明人: Wenhao Liu , Ka Chun Au , Shashank Harinath , Bryan McCann , Govardana Sachithanandam Ramachandran , Alexis Roos , Caiming Xiong
申请人： Salesforce.com, Inc.
申请人地址： US CA San Francisco
专利权人： Salesforce, Inc.
当前专利权人： Salesforce, Inc.
当前专利权人地址： US CA San Francisco
代理机构： Haynes and Boone LLP
主分类号： G06N3/00
IPC分类号： G06N3/00 ; G06F40/40 ; G06N3/045 ; G06N3/08

Systems and methods for distilled BERT-based training model for text classification

摘要：

Embodiments described herein provides a training mechanism that transfers the knowledge from a trained BERT model into a much smaller model to approximate the behavior of BERT. Specifically, the BERT model may be treated as a teacher model, and a much smaller student model may be trained using the same inputs to the teacher model and the output from the teacher model. In this way, the student model can be trained within a much shorter time than the BERT teacher model, but with comparable performance with BERT.

公开/授权文献

US20210150340A1 Systems and Methods for Distilled BERT-Based Training Model for Text Classification 公开/授权日：2021-05-20

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统