Apparatus and method for redundant zero micro-operation removal
    61.
    发明申请
    Apparatus and method for redundant zero micro-operation removal 失效
    用于冗余零微操作去除的装置和方法

    公开(公告)号:US20050027964A1

    公开(公告)日:2005-02-03

    申请号:US10631628

    申请日:2003-07-30

    摘要: A method and apparatus for redundant zero micro-operation removal. In one embodiment, the method includes the identification of a predetermined macro-instruction. Once identified, a value associated with a source register operand of the identified macro-instruction is determined. Once determined, the identified macro-instruction is decoded into a first macro operation and a second micro-operation if the determined value is not set. Otherwise, the identified macro-instruction is decoded into a single micro-operation if the determined value is set. Accordingly, the method described prevents the generation of redundant micro-operations that use valuable resources, such as allocation slots, as well as execution units within the processor core.

    摘要翻译: 一种用于冗余零微操作移除的方法和装置。 在一个实施例中,该方法包括对预定宏指令的识别。 一旦确定,确定与所识别的宏指令的源寄存器操作数相关联的值。 一旦确定,如果未设置确定的值,则将所识别的宏指令解码为第一宏操作和第二微操作。 否则,如果确定的值被设置,则所识别的宏指令被解码为单个微操作。 因此,所描述的方法防止了使用诸如分配时隙的有价值的资源以及处理器核心内的执行单元的冗余微操作的产生。

    Fused multiply add operations using bit masks
    62.
    发明授权
    Fused multiply add operations using bit masks 有权
    融合乘法使用位掩码添加操作

    公开(公告)号:US09542154B2

    公开(公告)日:2017-01-10

    申请号:US13926175

    申请日:2013-06-25

    IPC分类号: G06F7/483 G06F7/544 G06F7/76

    摘要: Systems and methods of performing a fused multiply add (FMA) operations are provided. In one embodiment, the length of the adder used by the FMA operation is less than 3*N, where N is the number of bits in the mantissa term of a floating point number. A mask may be used to perform the addition portion of the FMA operation using the adder. A second mask may be used to denormalize the result of the addition portion of the FMA operation if an underflow occurs.

    摘要翻译: 提供了执行融合乘法(FMA)操作的系统和方法。 在一个实施例中,由FMA操作使用的加法器的长度小于3 * N,其中N是浮点数的尾数项中的位数。 可以使用掩码来使用加法器来执行FMA操作的相加部分。 如果发生下溢,则可以使用第二掩模来对FMA操作的添加部分的结果进行非规范化。

    Efficient parallel floating point exception handling in a processor
    69.
    发明授权
    Efficient parallel floating point exception handling in a processor 有权
    处理器中的高效并行浮点异常处理

    公开(公告)号:US08103858B2

    公开(公告)日:2012-01-24

    申请号:US12217084

    申请日:2008-06-30

    IPC分类号: G06F9/00

    摘要: Methods and apparatus are disclosed for handling floating point exceptions in a processor that executes single-instruction multiple-data (SIMD) instructions. In one embodiment a numerical exception is identified for a SIMD floating point operation and SIMD micro-operations are initiated to generate two packed partial results of a packed result for the SIMD floating point operation. A SIMD denormalization micro-operation is initiated to combine the two packed partial results and to denormalize one or more elements of the combined packed partial results to generate a packed result for the SIMD floating point operation having one or more denormal elements. Flags are set and stored with packed partial results to identify denormal elements. In one embodiment a SIMD normalization micro-operation is initiated to generate a normalized pseudo internal floating point representation prior to the SIMD floating point operation when it uses multiplication.

    摘要翻译: 公开了用于处理执行单指令多数据(SIMD)指令的处理器中的浮点异常的方法和装置。 在一个实施例中,识别用于SIMD浮点运算的数字异常,并启动SIMD微操作以产生用于SIMD浮点运算的打包结果的两个打包部分结果。 启动SIMD非规范化微操作以组合两个打包的部分结果并且对组合的打包部分结果的一个或多个元素进行非规范化,以生成具有一个或多个异常元素的SIMD浮点运算的打包结果。 标志被设置和存储与打包部分结果以识别异常元素。 在一个实施例中,当SIMD标准化微操作在使用乘法时在SIMD浮点运算之前产生归一化的伪内部浮点表示。

    GENERATING AND PERFORMING DEPENDENCY CONTROLLED FLOW COMPRISING MULTIPLE MICRO-OPERATIONS (uops)
    70.
    发明申请
    GENERATING AND PERFORMING DEPENDENCY CONTROLLED FLOW COMPRISING MULTIPLE MICRO-OPERATIONS (uops) 审中-公开
    生成和执行包含多个微操作的依赖性控制流(uop)

    公开(公告)号:US20090327657A1

    公开(公告)日:2009-12-31

    申请号:US12146390

    申请日:2008-06-25

    IPC分类号: G06F9/22

    摘要: A processor to perform an out-of-order (OOO) processing in which a reservation station (RS) may generate and process a dependency controlled flow comprising multiple micro-operations (uops) with specific clock based dispatch scheme. The RS may either combine two or more uops into a single RS entry or make a direct connection between two or more RS entries. The RS may allow more than two source values to be associated with a single RS by combining sources from the two or more uops. One or more execution units may be provisioned to perform the function defined by the uops. The execution units may receive more than two sources at a given time point and produce two or more results on different ports.

    摘要翻译: 执行无序(OOO)处理的处理器,其中保留站(RS)可以生成并处理包括具有特定的基于时钟的调度方案的多个微操作(uop)的依赖性控制流。 RS可以将两个或更多个uops组合成单个RS条目,或者在两个或更多个RS条目之间建立直接连接。 RS可以通过组合来自两个或更多个uops的源来允许多于两个源值与单个RS相关联。 可以提供一个或多个执行单元来执行由uops定义的功能。 执行单元可以在给定的时间点接收多于两个的源,并且在不同端口上产生两个或更多个结果。