Instruction and logic to provide SIMD secure hashing round slice functionality
    291.
    发明授权
    Instruction and logic to provide SIMD secure hashing round slice functionality 有权
    提供SIMD安全散列圆切片功能的指令和逻辑

    公开(公告)号:US08924741B2

    公开(公告)日:2014-12-30

    申请号:US13731004

    申请日:2012-12-29

    Abstract: Instructions and logic provide SIMD secure hashing round slice functionality. Some embodiments include a processor comprising: a decode stage to decode an instruction for a SIMD secure hashing algorithm round slice, the instruction specifying a source data operand set, a message-plus-constant operand set, a round-slice portion of the secure hashing algorithm round, and a rotator set portion of rotate settings. Processor execution units, are responsive to the decoded instruction, to perform a secure hashing round-slice set of round iterations upon the source data operand set, applying the message-plus-constant operand set and the rotator set, and store a result of the instruction in a SIMD destination register. One embodiment of the instruction specifies a hash round type as one of four MD5 round types. Other embodiments may specify a hash round type by an immediate operand as one of three SHA-1 round types or as a SHA-2 round type.

    Abstract translation: 说明和逻辑提供SIMD安全散列圆切片功能。 一些实施例包括处理器,包括:解码级,用于解码用于SIMD安全散列算法圆切片的指令,指定源数据操作数集合的指令,消息加常数操作数集合,安全散列的圆切片部分 圆周运算,旋转设定部分旋转设定。 处理器执行单元响应于解码的指令,在源数据操作数集合上执行循环迭代的安全散列圆切片集合,应用消息加常数操作数集合和旋转器集合,并且存储 SIMD目的寄存器中的指令。 该指令的一个实施例将哈希循环类型指定为四个MD5循环类型之一。 其他实施例可以通过立即操作数来指定散列循环类型,作为三种SHA-1轮型之一或SHA-2轮型。

    APPARATUS AND METHOD OF EXECUTION UNIT FOR CALCULATING MULTIPLE ROUNDS OF A SKEIN HASHING ALGORITHM
    292.
    发明申请
    APPARATUS AND METHOD OF EXECUTION UNIT FOR CALCULATING MULTIPLE ROUNDS OF A SKEIN HASHING ALGORITHM 有权
    用于计算多重环绕滑移算法的执行单元的装置和方法

    公开(公告)号:US20140122839A1

    公开(公告)日:2014-05-01

    申请号:US13997186

    申请日:2011-12-22

    Abstract: An apparatus is described that includes an execution unit within an instruction pipeline. The execution unit has multiple stages of a circuit that includes a) and b) as follows. a) a first logic circuitry section having multiple mix logic sections each having: i) a first input to receive a first quad word and a second input to receive a second quad word; ii) an adder having a pair of inputs that are respectively coupled to the first and second inputs; iii) a rotator having a respective input coupled to the second input; iv) an XOR gate having a first input coupled to an output of the adder and a second input coupled to an output of the rotator. b) permute logic circuitry having inputs coupled to the respective adder and XOR gate outputs of the multiple mix logic sections.

    Abstract translation: 描述了包括指令流水线内的执行单元的装置。 执行单元具有包括a)和b)的电路的多个级,如下所述。 a)具有多个混合逻辑部分的第一逻辑电路部分,每个混合逻辑部分具有:i)用于接收第一四进制字的第一输入和用于接收第二四字的第二输入; ii)具有分别耦合到第一和第二输入的一对输入的加法器; iii)具有耦合到第二输入的相应输入的旋转器; iv)具有耦合到加法器的输出的第一输入和耦合到转子的输出的第二输入的异或门。 b)具有耦合到多个混合逻辑部分的相应加法器和异或门输出的输入的置换逻辑电路。

    Compressed cache memory with decompress on fault

    公开(公告)号:US12130738B2

    公开(公告)日:2024-10-29

    申请号:US17130632

    申请日:2020-12-22

    CPC classification number: G06F12/0802 H03M7/60 G06F2212/401 G06F2212/60

    Abstract: An embodiment of an integrated circuit may comprise, coupled to a core, a hardware decompression accelerator, a compressed cache, a processor and communicatively coupled to the hardware decompression accelerator and the compressed cache, and memory and communicatively coupled to the processor, wherein the memory stores microcode instructions which when executed by the processor causes the processor to store a first address to a decompression work descriptor, retrieve a second address where a compressed page is stored in the compressed cache from the decompression work descriptor at the first address in response to an indication of a page fault, and send instructions to the hardware decompression accelerator to decompress the compressed page at the second address. Other embodiments are disclosed and claimed.

    MULTIPLE OPERATION FUSED ADDITION AND SUBTRACTION INSTRUCTION SET

    公开(公告)号:US20230297389A1

    公开(公告)日:2023-09-21

    申请号:US17695533

    申请日:2022-03-15

    CPC classification number: G06F9/3893 G06F9/3001

    Abstract: An embodiment of an apparatus comprises decode circuitry to decode a single instruction, the single instruction to include respective fields for one or more source operands, one or more destination operands, and an opcode, the opcode to indicate execution circuitry is to perform a fused addition and subtraction operation, and execution circuitry to execute the decoded instruction according to the opcode to retrieve data from one or more locations indicated by the one or more source operands, to perform the fused addition and subtraction operation indicated by the opcode on three or more arguments indicated by the retrieved data to produce one or more results. Other embodiments are disclosed and claimed.

    Apparatuses, methods, and systems for hashing instructions

    公开(公告)号:US11681530B2

    公开(公告)日:2023-06-20

    申请号:US17688728

    申请日:2022-03-07

    Abstract: Systems, methods, and apparatuses relating to performing hashing operations on packed data elements are described. In one embodiment, a processor includes a decode circuit to decode a single instruction into a decoded single instruction, the single instruction including at least one first field that identifies eight 32-bit state elements A, B, C, D, E, F, G, and H for a round according to a SM3 hashing standard and at least one second field that identifies an input message; and an execution circuit to execute the decoded single instruction to: rotate state element C left by 9 bits to form a rotated state element C, rotate state element D left by 9 bits to form a rotated state element D, rotate state element G left by 19 bits to form a rotated state element G, rotate state element H left by 19 bits to form a rotated state element H, perform two rounds according to the SM3 hashing standard on the input message and state element A, state element B, rotated state element C, rotated state element D, state element E, state element F, rotated state element G, and rotated state element H to generate an updated state element A, an updated state element B, an updated state element E, and an updated state element F, and store the updated state element A, the updated state element B, the updated state element E, and the updated state element F into a location specified by the single instruction.

    CIRCUITRY AND METHODS FOR ACCELERATING STREAMING DATA-TRANSFORMATION OPERATIONS

    公开(公告)号:US20230100586A1

    公开(公告)日:2023-03-30

    申请号:US17484840

    申请日:2021-09-24

    Abstract: Systems, methods, and apparatuses for accelerating streaming data-transformation operations are described. In one example, a system on a chip (SoC) includes a hardware processor core comprising a decoder circuit to decode an instruction comprising an opcode into a decoded instruction, the opcode to indicate an execution circuit is to generate a single descriptor and cause the single descriptor to be sent to an accelerator circuit coupled to the hardware processor core, and the execution circuit to execute the decoded instruction according to the opcode; and the accelerator circuit comprising a work dispatcher circuit and one or more work execution circuits to, in response to the single descriptor sent from the hardware processor core: when a field of the single descriptor is a first value, cause a single job to be sent by the work dispatcher circuit to a single work execution circuit of the one or more work execution circuits to perform an operation indicated in the single descriptor to generate an output, and when the field of the single descriptor is a second different value, cause a plurality of jobs to be sent by the work dispatcher circuit to the one or more work execution circuits to perform the operation indicated in the single descriptor to generate the output as a single stream.

    VERIFYING COMPRESSED STREAM FUSED WITH COPY OR TRANSFORM OPERATIONS

    公开(公告)号:US20230075667A1

    公开(公告)日:2023-03-09

    申请号:US17470089

    申请日:2021-09-09

    Abstract: Methods and apparatus relating to verifying a compressed stream fused with copy or transform operation(s) are described. In an embodiment, compression logic circuitry compresses input data and stores the compressed data in a temporary buffer. The compression logic circuitry determines a first checksum value corresponding to the compressed data stored in the temporary buffer. Decompression logic circuitry performs a decompress-verify operation and a copy operation. The decompress-verify operation decompresses the compressed data stored in the temporary buffer to determine a second checksum value corresponding to the decompressed data from the temporary buffer. The copy operation transfers the compressed data from the temporary buffer to a destination buffer in response to a match between the first checksum value and the second checksum value. Other embodiments are also disclosed and claimed.

Patent Agency Ranking