PROGRAMMABLE COMPUTE ENGINE HAVING TRANSPOSE OPERATIONS
Abstract:
A technique to execute transpose and compute operations may include retrieving a set of machine instructions from an instruction buffer of a data processor. The instruction buffer has multiple entries, and each entry stores one machine instruction. A machine instruction from the set of machine instructions is executed to transpose a submatrix of an input tensor and perform computations on column elements of the submatrix. The machine instruction combines the transpose operation with computational operations into a single machine instruction.
Public/Granted literature
Information query
Patent Agency Ranking
0/0