-
公开(公告)号:US20240013769A1
公开(公告)日:2024-01-11
申请号:US18038631
申请日:2021-11-22
Applicant: DeepMind Technologies Limited
Inventor: Ian Michael Gemp , Yoram Bachrach , Roma Patel , Christopher James Dyer
IPC: G10L13/047 , G10L13/08
CPC classification number: G10L13/047 , G10L13/08
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for selecting an input vocabulary for a machine learning model using power indices. One of the methods includes computing a respective score for each of a plurality of text tokens in an initial vocabulary and then selecting the text tokens in the input vocabulary based on the respective scores.