Efficient Training of Embedding Models Using Negative Cache

Invention Publication

US20230153700A1 Efficient Training of Embedding Models Using Negative Cache 审中-公开

Please log in to see more content

Patent Title: Efficient Training of Embedding Models Using Negative Cache
Application No.: US17983130

Application Date: 2022-11-08
Publication No.: US20230153700A1

Publication Date: 2023-05-18
Inventor: Erik Michael Lindgren , Sashank Jakkam Reddi , Ruiqi Guo , Sanjiv Kumar
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Main IPC: G06N20/20
IPC: G06N20/20 ; G06F12/0875 ; G06F12/0891

Efficient Training of Embedding Models Using Negative Cache

Abstract:

Provided are systems and methods which more efficiency train embedding models through the use of a cache of item embeddings for candidate items over a number of training iterations. The cached item embeddings can be “stale” embeddings that were generated by a previous version of the model at a previous training iteration. Specifically, at each iteration, the (potentially stale) item embeddings included in the cache can be used when generating similarity scores that are the basis for sampling a number of items to use as negatives in the current training iteration. For example, a Gumbel-Max sampling approach can be used to sample negative items that will enable an approximation of a true gradient. New embeddings can be generated for the sampled negative items and can be used to train the model at the current iteration.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习
G06N20/20	.•集成学习