- 专利标题: SYSTEMS AND METHODS FOR CROSS-LINGUAL TRANSFER LEARNING
-
申请号: US18309330申请日: 2023-04-28
-
公开(公告)号: US20240330603A1公开(公告)日: 2024-10-03
- 发明人: Lifu Tu , Yingbo Zhou , Caiming Xiong , Jin Qu
- 申请人: Salesforce, Inc.
- 申请人地址: US CA San Francisco
- 专利权人: Salesforce, Inc.
- 当前专利权人: Salesforce, Inc.
- 当前专利权人地址: US CA San Francisco
- 主分类号: G06F40/47
- IPC分类号: G06F40/47 ; G06F40/284 ; G06F40/30 ; G06F40/51
摘要:
Embodiments described herein provide a method of training a language model by tuning a prompt. The method comprises masking tokens of first and second conversational texts which have the same semantic meaning but in different languages (e.g., a translation). The masked texts are input to a language model with a prepended soft prompt. The language model generates respective predicted outputs. A loss objective is computed including a masked language model loss. The prompt is updated based on the computed loss objective via backpropagation while keeping the language model frozen.
信息查询