Interruptible and restartable matrix multiplication instructions, processors, methods, and systems

    公开(公告)号:US11048508B2

    公开(公告)日:2021-06-29

    申请号:US16398200

    申请日:2019-04-29

    Abstract: A processor of an aspect includes a decode unit to decode a matrix multiplication instruction. The matrix multiplication instruction is to indicate a first memory location of a first source matrix, is to indicate a second memory location of a second source matrix, and is to indicate a third memory location where a result matrix is to be stored. The processor also includes an execution unit coupled with the decode unit. The execution unit, in response to the matrix multiplication instruction, is to multiply a portion of the first and second source matrices prior to an interruption, and store a completion progress indicator in response to the interruption. The completion progress indicator to indicate an amount of progress in multiplying the first and second source matrices, and storing corresponding result data to the third memory location, that is to have been completed prior to the interruption.

    METHODS, APPARATUS, INSTRUCTIONS AND LOGIC TO PROVIDE VECTOR PACKED HISTOGRAM FUNCTIONALITY
    8.
    发明申请
    METHODS, APPARATUS, INSTRUCTIONS AND LOGIC TO PROVIDE VECTOR PACKED HISTOGRAM FUNCTIONALITY 有权
    方法,装置,说明和逻辑提供矢量包装组织功能

    公开(公告)号:US20160378716A1

    公开(公告)日:2016-12-29

    申请号:US14752054

    申请日:2015-06-26

    Abstract: Instructions and logic provide SIMD vector packed histogram functionality. Some processor embodiments include first and second registers storing, in each of a plurality of data fields of a register lane portion, corresponding elements of a first and of a second data type, respectively. A decode stage decodes an instruction for SIMD vector packed histograms. One or more execution units, compare each element of the first data type, in the first register lane portion, with a range specified by the instruction. For any elements of the first register portion in said range, corresponding elements of the second data type, from the second register portion, are added into one of a plurality data fields of a destination register lane portion, selected according to the value of its corresponding element of the first data type, to generate packed weighted histograms for each destination register lane portion.

    Abstract translation: 指令和逻辑提供SIMD矢量压缩直方图功能。 一些处理器实施例包括分别在寄存器通道部分的多个数据字段的每一个中分别存储第一和第二数据类型的对应元件的第一和第二寄存器。 解码级对SIMD矢量压缩直方图的指令进行解码。 一个或多个执行单元将第一注册通道部分中的第一数据类型的每个元素与指令指定的范围进行比较。 对于所述范围中的第一寄存器部分的任何元件,来自第二寄存器部分的第二数据类型的对应元件被添加到目的地寄存器通道部分的多个数据字段中的一个,根据其相应的值 元素,以产生每个目的地寄存器通道部分的压缩的直方图。

    Providing extended cache replacement state information
    9.
    发明授权
    Providing extended cache replacement state information 有权
    提供扩展缓存替换状态信息

    公开(公告)号:US09170955B2

    公开(公告)日:2015-10-27

    申请号:US13685991

    申请日:2012-11-27

    CPC classification number: G06F12/126 G06F12/123 Y02D10/13

    Abstract: In an embodiment, a processor includes a decode logic to receive and decode a first memory access instruction to store data in a cache memory with a replacement state indicator of a first level, and to send the decoded first memory access instruction to a control logic. In turn, the control logic is to store the data in a first way of a first set of the cache memory and to store the replacement state indicator of the first level in a metadata field of the first way responsive to the decoded first memory access instruction. Other embodiments are described and claimed.

    Abstract translation: 在一个实施例中,处理器包括解码逻辑,用于接收和解码第一存储器访问指令以将数据存储在具有第一级的替换状态指示符的高速缓冲存储器中,并将解码的第一存储器访问指令发送到控制逻辑。 反过来,控制逻辑是以第一组高速缓冲存储器的第一种方式存储数据,并且响应于解码的第一存储器访问指令将第一级的替换状态指示符存储在第一方式的元数据字段中 。 描述和要求保护其他实施例。

Patent Agency Ranking