Systems, apparatuses, and methods for arithmetic recurrence

    公开(公告)号:US10120680B2

    公开(公告)日:2018-11-06

    申请号:US15396184

    申请日:2016-12-30

    IPC分类号: G06F9/30

    摘要: Embodiments of systems, apparatuses, and methods for broadcast arithmetic in a processor are described. For example, execution circuitry executes a decoded instruction to broadcast a data value from a least significant packed data element position of a first packed data source operand to a plurality of arithmetic circuits and for each packed data element position of a second packed data source operand, other than a least significant packed data element position, perform the arithmetic operation defined by the instruction on a data value from that packed data element position of the second packed data source operand and all data values from packed data element positions of the second packed data source operand that are of lesser position significance to the broadcast data value from the least significant packed data element position of the first packed data source operand, and stores a result of each arithmetic operation into a packed data element position of the packed data destination operand that corresponds to a most significant packed data element position of the second packed data source operand.

    Systems, Apparatuses, and Methods for Lane-Based Strided Gather

    公开(公告)号:US20170192784A1

    公开(公告)日:2017-07-06

    申请号:US14984233

    申请日:2015-12-30

    IPC分类号: G06F9/30

    摘要: Embodiments of systems, apparatuses, and methods for lane-based strided gather are disclosed. In an embodiment, an apparatus includes a decoder to decode an instruction, wherein the instruction to include fields for indices of addresses to memory, and a packed data destination register operand; and execution circuitry to execute the decoded instruction to extract data elements of a defined number of types from memory using the indices of the instruction, and for each type, store the extracted data elements in one or more lanes of a packed data destination register dedicated to that type, wherein relative data elements between types are strided data elements apart.

    Systems, apparatuses, and methods for performing delta decoding on packed data elements
    8.
    发明授权
    Systems, apparatuses, and methods for performing delta decoding on packed data elements 有权
    用于对压缩数据元素执行增量解码的系统,装置和方法

    公开(公告)号:US09557998B2

    公开(公告)日:2017-01-31

    申请号:US13997662

    申请日:2011-12-28

    IPC分类号: G06F9/30 H04N19/42

    摘要: Systems, apparatuses, and methods for performing delta decoding on packed data elements of a source and storing the results in packed data elements of a destination using a single packed delta decode instruction are described. A processor may include a decoder to decode an instruction, and execution unit to execute the decoded instruction to calculate for each packed data element position of a source operand, other than a first packed data element position, a value that comprises a packed data element of that packed data element position and all packed data elements of packed data element positions that are of lesser significance, store a first packed data element from the first packed data element position of the source operand into a corresponding first packed data element position of a destination operand, and for each calculated value, store the value into a corresponding packed data element position of the destination operand.

    摘要翻译: 描述了用于对源的压缩数据元素执行增量解码并使用单个压缩增量解码指令将结果存储在目的地的打包数据元素中的系统,装置和方法。 处理器可以包括用于对指令进行解码的解码器,以及执行单元,用于执行解码指令,以计算除第一打包数据元素位置以外的源操作数的每个压缩数据元素位置,该值包括 打包数据元素位置和打包数据元素位置的所有压缩数据元素都不太重要,将来自源操作数的第一打包数据元素位置的第一打包数据元素存储到目的地操作数的对应的第一打包数据元素位置 ,并且对于每个计算值,将该值存储到目的地操作数的对应的打包数据元素位置。