-
公开(公告)号:US20250094709A1
公开(公告)日:2025-03-20
申请号:US18827108
申请日:2024-09-06
Applicant: Samsung Electronics Co., Ltd.
Inventor: Shikhar TULI , Chi-Heng Lin , Yen-Chang Hsu , Yilin Shen , Hongxia Jin
IPC: G06F40/284
Abstract: A method for performing multi-token prediction by an apparatus includes receiving, from an artificial intelligence (AI) assistance device, a request for an output token sequence that is subsequent to an input token sequence indicated by the request, predicting, by a trained machine learning model, a plurality of candidate output tokens, estimating joint probability distributions of one or more combinations of the plurality of candidate output tokens, calculating joint probabilities of the one or more combinations by masking the joint probability distributions with a co-occurrence weighted mask, determining, based on the joint probabilities, whether to reduce the number of candidate output tokens included in each combination of the one or more combinations, identifying, based on the joint probabilities, a combination of the one or more combinations as the output token sequence, and outputting, to the AI assistance device, a response to the request, the response comprising the output token sequence.