-
公开(公告)号:US20250013432A1
公开(公告)日:2025-01-09
申请号:US18218448
申请日:2023-07-05
Applicant: Google LLC
Inventor: Vinayak Anand Gokhale , Matthew Leever Hedlund , Rahul Nagarajan , Naveen Muralimanohar , Shriram Nagarajan
Abstract: Aspects of the disclosed technology include techniques and mechanisms for using a custom scratchpad memory for partial dot product reductions. The custom scratchpad memory may be a special purpose memory that is dedicated to receiving and storing partial dot products determined by matrix multiplier units. Each partial dot product may correspond to tiles of a resultant matrix, where the resultant matrix is the product of matrix multiplication that can use a first matrix representing a user query as a left-hand side operand and a second matrix representing a trained model containing data that may be used to respond to the user query as a right-hand side operand. The custom scratchpad memory may append the tiles determined by the matrix multiplication, where the appended tiles may create the resultant matrix. Custom scratchpad memory may write the resultant matrix to general purpose memory, where it may be used to respond to the user query.