APPARATUS AND METHOD FOR SHIFTING QUADWORDS AND EXTRACTING PACKED WORDS

    公开(公告)号:US20190102177A1

    公开(公告)日:2019-04-04

    申请号:US15721382

    申请日:2017-09-29

    Abstract: An apparatus and method for performing left-shifting operations on packed quadword data. For example, one embodiment of a processor comprises: a decoder to decode a left-shift instruction to generate a decoded left-shift instruction; a first source register to store a plurality of packed quadwords data elements; execution circuitry to execute the decoded left-shift instruction, the execution circuitry comprising shift circuitry to left-shift at least first and second packed quadword data elements from first and second packed quadword data element locations, respectively, in the first source register by an amount specified in an immediate value or in a control value in a second source register, to generate first and second left-shifted quadwords; the execution circuitry to cause selection of 16 most significant bits of the first and second left-shifted quadwords to be written to 16 least significant bit regions of first and second quadword data element locations, respectively, of a destination register; and the destination register to store the specified set of the 16 most significant bits of the first and second left-shifted quadwords.

    APPARATUS AND METHOD FOR VECTOR BROADCAST AND XORAND LOGICAL INSTRUCTION
    52.
    发明申请
    APPARATUS AND METHOD FOR VECTOR BROADCAST AND XORAND LOGICAL INSTRUCTION 审中-公开
    矢量广播和XORAND逻辑指导的装置和方法

    公开(公告)号:US20160179523A1

    公开(公告)日:2016-06-23

    申请号:US14582171

    申请日:2014-12-23

    CPC classification number: G06F9/30029 G06F9/30018 G06F9/30036

    Abstract: An apparatus and method are described for performing a vector broadcast and XORAND logical instruction. For example, one embodiment of a processor comprises: fetch logic to fetch an instruction from memory indicating a destination packed data operand, a first source packed data operand, a second source packed data operand, and an immediate operand, and execution logic to determine a bit in the second source packed data operand based a position corresponding to the immediate value, perform a bitwise AND between the first source packed data operand and the determined bit to generate an intermediate result, perform a bitwise XOR between the destination packed data operand and the intermediate result to generate a final result, and store the final result in a storage location indicated by the destination packed data operand.

    Abstract translation: 描述了用于执行向量广播和XORAND逻辑指令的装置和方法。 例如,处理器的一个实施例包括:提取逻辑,用于从存储器取出指令,指示目的地打包数据操作数,第一源打包数据操作数,第二源打包数据操作数和立即操作数,以及执行逻辑,以确定 在第二源打包数据操作数中基于对应于立即数的位置的位,在第一源打包数据操作数和确定的位之间执行按位AND,以产生中间结果,在目的地打包数据操作数与 中间结果以产生最终结果,并将最终结果存储在由目的地打包数据操作数指示的存储位置中。

    VECTOR INSTRUCTION TO COMPUTE COORDIANTE OF NEXT POINT IN A Z-ORDER CURVE
    53.
    发明申请
    VECTOR INSTRUCTION TO COMPUTE COORDIANTE OF NEXT POINT IN A Z-ORDER CURVE 审中-公开
    在Z-ORDER曲线中计算下一个点的向量的向量

    公开(公告)号:US20160139921A1

    公开(公告)日:2016-05-19

    申请号:US14542457

    申请日:2014-11-14

    Abstract: In one embodiment, a processor includes machine level instructions to compute a next point in a Z-order curve of a specified dimension for a specified coordinate. A processor decode unit is configured to decode an instruction having a source and immediate operands including a first z-curve index, the specified dimension and the specified coordinate. A processor execution unit is configured to execute the decoded instruction to compute the coordinate of the next point by incrementing the coordinate value associated with the specified coordinate to generate a second z-curve index including the incremented coordinate.

    Abstract translation: 在一个实施例中,处理器包括用于计算指定坐标的指定维度的Z次曲线中的下一个点的机器级指令。 处理器解码单元被配置为对具有包括第一z-曲线索引,指定尺寸和指定坐标的源和立即操作数的指令进行解码。 处理器执行单元被配置为通过增加与指定坐标相关联的坐标值来执行解码指令以计算下一点的坐标,以生成包括递增坐标的第二z-曲线索引。

    INSTRUCTIONS FOR VECTOR MULTIPLICATION OF UNSIGNED WORDS WITH ROUNDING

    公开(公告)号:US20220318009A1

    公开(公告)日:2022-10-06

    申请号:US17573556

    申请日:2022-01-11

    Abstract: Disclosed embodiments relate to executing a vector multiplication instruction. In one example, a processor includes fetch circuitry to fetch the vector multiplication instruction having fields for an opcode, first and second source identifiers, and a destination identifier, decode circuitry to decode the fetched instruction, execution circuitry to, on each of a plurality of corresponding pairs of fixed-sized elements of the identified first and second sources, execute the decoded instruction to generate a double-sized product of each pair of fixed-sized elements, the double-sized product being represented by at least twice a number of bits of the fixed size, and generate an unsigned fixed-sized result by rounding the most significant fixed-sized portion of the double-sized product to fit into the identified destination.

Patent Agency Ranking