APPARATUSES, METHODS, AND SYSTEMS FOR INSTRUCTIONS TO MULTIPLY FLOATING-POINT VALUES OF ABOUT ZERO

    公开(公告)号:US20210182068A1

    公开(公告)日:2021-06-17

    申请号:US16714667

    申请日:2019-12-13

    Abstract: Systems, methods, and apparatuses relating to instructions to multiply floating-point values of about zero are described. In one embodiment, a hardware processor includes a decoder to decode a single instruction into a decoded single instruction, the single instruction having a first field that identifies a first floating-point number, a second field that identifies a second floating-point number, and a third field that indicates an about zero threshold; and an execution circuit to execute the decoded single instruction to: cause a first comparison of an exponent of the first floating-point number to the about zero threshold, cause a second comparison of an exponent of the second floating-point number to the about zero threshold, provide as a resultant of the single instruction a value of zero when the first comparison indicates the exponent of the first floating-point number does not exceed the about zero threshold, provide as the resultant of the single instruction the value of zero when the second comparison indicates the exponent of the second floating-point number does not exceed the about zero threshold, and provide as the resultant of the single instruction a product of a multiplication of the first floating-point number and the second floating-point number when the first comparison indicates the exponent of the first floating-point number exceeds the about zero threshold and the second comparison indicates the exponent of the second floating-point number exceeds the about zero threshold.

    APPARATUS AND METHOD OF IMPROVED EXTRACT INSTRUCTIONS

    公开(公告)号:US20180081689A1

    公开(公告)日:2018-03-22

    申请号:US15809818

    申请日:2017-11-10

    Abstract: An apparatus is described that includes instruction execution circuitry to execute first, second, third, and fourth instructions, the first and second instructions select a first group of input vector elements from one of multiple first non-overlapping sections of respective first and second input vectors. Each of the multiple first non-overlapping sections have a same bit width as the first group. Both the third and fourth instructions select a second group of input vector elements from one of multiple second non-overlapping sections of respective third and fourth input vectors. The second group has a second bit width that is larger than the first bit width. Each of multiple second non-overlapping sections have a same bit width as the second group. The apparatus includes masking layer circuitry to mask the first and second groups at a first granularity and second granularity.

    INSTRUCTION FOR ELEMENT OFFSET CALCULATION IN A MULTI-DIMENSIONAL ARRAY
    16.
    发明申请
    INSTRUCTION FOR ELEMENT OFFSET CALCULATION IN A MULTI-DIMENSIONAL ARRAY 审中-公开
    元素偏差计算在多维阵列中的指导

    公开(公告)号:US20170075691A1

    公开(公告)日:2017-03-16

    申请号:US15363785

    申请日:2016-11-29

    Abstract: An apparatus is described having functional unit logic circuitry. The functional unit logic circuitry has a first register to store a first input vector operand having an element for each dimension of a multi-dimensional data structure. Each element of the first vector operand specifying the size of its respective dimension. The functional unit has a second register to store a second input vector operand specifying coordinates of a particular segment of the multi-dimensional structure. The functional unit also has logic circuitry to calculate an address offset for the particular segment relative to an address of an origin segment of the multi-dimensional structure.

    Abstract translation: 描述了具有功能单元逻辑电路的装置。 功能单元逻辑电路具有第一寄存器以存储具有用于多维数据结构的每个维度的元素的第一输入向量操作数。 第一个向量操作数的每个元素指定其相应维度的大小。 功能单元具有第二寄存器,用于存储指定多维结构的特定段的坐标的第二输入向量操作数。 功能单元还具有逻辑电路,用于相对于多维结构的原点片段的地址计算特定片段的地址偏移。

    METHOD AND APPARATUS FOR PERFORMING A VECTOR BIT REVERSAL AND CROSSING
    17.
    发明申请
    METHOD AND APPARATUS FOR PERFORMING A VECTOR BIT REVERSAL AND CROSSING 有权
    用于执行向量位反转和交叉的方法和装置

    公开(公告)号:US20160179529A1

    公开(公告)日:2016-06-23

    申请号:US14581738

    申请日:2014-12-23

    CPC classification number: G06F9/30036 G06F9/30018 G06F9/30032

    Abstract: An apparatus and method for performing a vector bit reversal and crossing. For example, one embodiment of a processor comprises: a first source vector register to store a first plurality of source bit groups, wherein a size for the bit groups is to be specified in an immediate of an instruction; a second source vector to store a second plurality of source bit groups; vector bit reversal and crossing logic to determine a bit group size from the immediate and to responsively reverse positions of contiguous bit groups within the first source vector register to generate a set of reversed bit groups, wherein the vector bit reversal and crossing logic is to additionally interleave the set of reversed bit groups with the second plurality of bit groups; and a destination vector register to store the reversed bit groups interleaved with the first plurality of bit groups.

    Abstract translation: 用于执行向量位反转和交叉的装置和方法。 例如,处理器的一个实施例包括:第一源向量寄存器,用于存储第一多个源位组,其中用于位组的大小将在指令的立即指定中; 用于存储第二多个源比特组的第二源向量; 矢量位反转和交叉逻辑,以从第一源向量寄存器内的连续位组的立即和响应地反向位置确定位组大小,以产生一组反向位组,其中向量位反转和交叉逻辑额外地 将所述一组反转位组与所述第二多个位组进行交织; 以及目的地向量寄存器,用于存储与第一多个比特组交织的反向比特组。

    METHOD AND APPARATUS FOR VECTOR INDEX LOAD AND STORE
    18.
    发明申请
    METHOD AND APPARATUS FOR VECTOR INDEX LOAD AND STORE 有权
    矢量索引装载和存储的方法和装置

    公开(公告)号:US20160179526A1

    公开(公告)日:2016-06-23

    申请号:US14581289

    申请日:2014-12-23

    Abstract: An apparatus and method for performing vector index loads and stores. For example, one embodiment of a processor comprises: a vector index register to store a plurality of index values; a mask register to store a plurality of mask bits; a vector register to store a plurality of vector data elements loaded from memory; and vector index load logic to identify an index stored in the vector index register to be used for a load operation using an immediate value and to responsively combine the index with a base memory address to determine a memory address for the load operation, the vector index load logic to load vector data elements from the memory address to the vector register in accordance with the plurality of mask bits.

    Abstract translation: 用于执行向量索引加载和存储的装置和方法。 例如,处理器的一个实施例包括:矢量索引寄存器,用于存储多个索引值; 掩模寄存器,用于存储多个掩码位; 向量寄存器,用于存储从存储器加载的多个向量数据元素; 以及矢量索引负载逻辑,以识别存储在矢量索引寄存器中的索引,以用于使用立即值的加载操作,并且响应地将索引与基本存储器地址组合以确定用于加载操作的存储器地址,向量索引 负载逻辑,以根据多个掩码位将矢量数据元素从存储器地址加载到向量寄存器。

    METHOD AND APPARATUS FOR EXPANDING A MASK TO A VECTOR OF MASK VALUES
    19.
    发明申请
    METHOD AND APPARATUS FOR EXPANDING A MASK TO A VECTOR OF MASK VALUES 审中-公开
    将掩模扩展到掩蔽值矢量的方法和装置

    公开(公告)号:US20160179521A1

    公开(公告)日:2016-06-23

    申请号:US14581578

    申请日:2014-12-23

    CPC classification number: G06F9/30018 G06F9/30032 G06F9/30036 G06F9/30072

    Abstract: An apparatus and method for performing a mask expand. For example, one embodiment of a processor comprises: a source mask register to store a plurality of mask values; mask expand logic to identify a first mask bit in the source mask register to be expanded using an index value and to determine a number of bit positions within a destination mask register into which the first mask bit is to be expanded using a second value, the mask expand logic to responsively copy the first mask bit to each of the determined bit positions within the destination mask register.

    Abstract translation: 一种用于执行掩模扩展的装置和方法。 例如,处理器的一个实施例包括:源掩码寄存器,用于存储多个掩码值; 掩码扩展逻辑,以使用索引值来识别要被扩展的源掩码寄存器中的第一掩码位,并且使用第二值确定目标掩码寄存器中要扩展第一掩码位的位位数, 掩码扩展逻辑以将第一掩码位响应地复制到目的掩码寄存器中的每个确定的位位置。

Patent Agency Ranking