SYSTEMS AND METHODS FOR PERFORMING INSTRUCTIONS TO TRANSFORM MATRICES INTO ROW-INTERLEAVED FORMAT

    公开(公告)号:US20220357950A1

    公开(公告)日:2022-11-10

    申请号:US17865849

    申请日:2022-07-15

    Abstract: Disclosed embodiments relate to systems and methods for performing instructions to transform matrices into a row-interleaved format. In one example, a processor includes fetch and decode circuitry to fetch and decode an instruction having fields to specify an opcode and locations of source and destination matrices, wherein the opcode indicates that the processor is to transform the specified source matrix into the specified destination matrix having the row-interleaved format; and execution circuitry to respond to the decoded instruction by transforming the specified source matrix into the specified RowInt-formatted destination matrix by interleaving J elements of each J-element sub-column of the specified source matrix in either row-major or column-major order into a K-wide submatrix of the specified destination matrix, the K-wide submatrix having K columns and enough rows to hold the J elements.

    EFFICIENT MULTIPLY AND ACCUMULATE INSTRUCTION WHEN AN OPERAND IS EQUAL TO OR NEAR A POWER OF TWO

    公开(公告)号:US20220197595A1

    公开(公告)日:2022-06-23

    申请号:US17129636

    申请日:2020-12-21

    Abstract: Techniques and apparatuses for performing a near multiply and accumulate instruction are described. An apparatus includes decoder circuitry to decode an instruction, the instruction to include a field for an identifier of a first source operand, a field for an identifier of a second source operand, and a field for an identifier of a third source operand. The apparatus also includes execution circuitry to execute the decoded instruction to perform a multiplication of a pair of data elements from the first and second source operands to produce a product data element via a shift operation when at least one data element in the pair of data elements is equal to a power of two or near a power of two or via multiplication of the pair of data elements when the pair of data elements is neither equal to a power of two or near a power of two.

    SYSTEMS, APPARATUSES, AND METHODS FOR DUAL COMPLEX MULTIPLY ADD OF SIGNED WORDS

    公开(公告)号:US20220107804A1

    公开(公告)日:2022-04-07

    申请号:US17509917

    申请日:2021-10-25

    Abstract: Embodiments of systems, apparatuses, and methods for dual complex number multiplication and addition in a processor are described. For example, execution circuitry executes a decoded instruction to multiplex data values from positions in source operands to a multiplier, the source operands including pairs complex numbers, calculate a real part of a product of each pair of complex numbers, add the real part of the product of a first pair of complex numbers to the real part of the product of a second pair of complex numbers to calculate a first real result, and add the real part of the product of a third pair of complex numbers to the real part of the product of a fourth pair of complex numbers to calculate a second real result, and store the results to corresponding positions in the destination operand.

Patent Agency Ranking