DUAL DATA STREAMS SHARING DUAL CACHE ACCESS PORTS

    公开(公告)号:US20240403055A1

    公开(公告)日:2024-12-05

    申请号:US18800249

    申请日:2024-08-12

    Abstract: A streaming engine employed in a digital data processor specifies fixed first and second read only data streams. Corresponding stream address generator produces address of data elements of the two streams. Corresponding steam head registers stores data elements next to be supplied to functional units for use as operands. The two streams share two memory ports. A toggling preference of stream to port ensures fair allocation. The arbiters permit one stream to borrow the other's interface when the other interface is idle. Thus one stream may issue two memory requests, one from each memory port, if the other stream is idle. This spreads the bandwidth demand for each stream across both interfaces, ensuring neither interface becomes a bottleneck.

    Pipeline protection for CPUs with save and restore of intermediate results

    公开(公告)号:US11789742B2

    公开(公告)日:2023-10-17

    申请号:US17688260

    申请日:2022-03-07

    CPC classification number: G06F9/3867 G06F9/3838

    Abstract: Techniques related to executing a plurality of instructions by a processor comprising a method for executing a plurality of instructions by a processor. The method comprises detecting a pipeline hazard based on one or more instructions provided for execution by an instruction execution pipeline, beginning execution of an instruction, of the one or more instructions on the instruction execution pipeline, stalling a portion of the instruction execution pipeline based on the detected pipeline hazard, storing a register state associated with the execution of the instruction based on the stalling, determining that the pipeline hazard has been resolved, and restoring the register state to the instruction execution pipeline based on the determination.

    Vector bit transpose
    38.
    发明授权

    公开(公告)号:US11604648B2

    公开(公告)日:2023-03-14

    申请号:US17353908

    申请日:2021-06-22

    Abstract: A method to transpose source data in a processor in response to a vector bit transpose instruction includes specifying, in respective fields of the vector bit transpose instruction, a source register containing the source data and a destination register to store transposed data. The method also includes executing the vector bit transpose instruction by interpreting N×N bits of the source data as a two-dimensional array having N rows and N columns, creating transposed source data by transposing the bits by reversing a row index and a column index for each bit, and storing the transposed source data in the destination register.

    Streaming engine with compressed encoding for loop circular buffer sizes

    公开(公告)号:US11449429B2

    公开(公告)日:2022-09-20

    申请号:US17234874

    申请日:2021-04-20

    Inventor: Joseph Zbiciak

    Abstract: A streaming engine employed in a digital data processor specifies a fixed read only data stream defined by plural nested loops. An address generator produces address of data elements for the nested loops. A steam head register stores data elements next to be supplied to functional units for use as operands. A stream template register specifies a circular address mode for the loop, first and second block size numbers and a circular address block size selection. For a first circular address block size selection the block size corresponds to the first block size number. For a first circular address block size selection the block size corresponds to the first block size number. For a second circular address block size selection the block size corresponds to a sum of the first block size number and the second block size number.

    Processing device with vector transformation execution

    公开(公告)号:US11327761B2

    公开(公告)日:2022-05-10

    申请号:US16881327

    申请日:2020-05-22

    Abstract: An integrated circuit, comprising an instruction pipeline that includes instruction fetch phase circuitry, instruction decode phase circuitry, and instruction execution circuitry. The instruction execution circuitry includes transformation circuitry for receiving an interleaved dual vector operand as an input and for outputting a first natural order vector including a first set of data values from the interleaved dual vector operand and a second natural order vector including a second set of data values from the interleaved dual vector operand.

Patent Agency Ranking