PROCESSING DENORMAL NUMBERS IN FMA HARDWARE
    12.
    发明申请

    公开(公告)号:US20180095749A1

    公开(公告)日:2018-04-05

    申请号:US15283606

    申请日:2016-10-03

    发明人: THOMAS ELMER

    摘要: A microprocessor includes FMA execution logic that determines whether to accumulate an accumulator operand C to the partial products of multiplier and multiplicand operands A and B in the partial product adder or in a second accumulation stage. The logic calculates an exponent delta of Aexp+Bexp−Cexp and determines the number of leading zeroes in C, if C is denormal. The microprocessor accumulates C with the partial products of A and B when the accumulation of C to the product of A and B could result in mass cancellation, when ExpDelta is greater than or equal to −K (where K is related to a width of a datapath in the partial product adder), and when a C is denormal and its number of leading zeroes plus K exceeds −ExpDelta. The strategic use of resources in the partial product adder and second accumulation stage reduces latency.

    LOW ENERGY CONSUMPTION MANTISSA MULTIPLICATION FOR FLOATING POINT MULTIPLY-ADD OPERATIONS

    公开(公告)号:US20180095728A1

    公开(公告)日:2018-04-05

    申请号:US15283295

    申请日:2016-10-01

    申请人: Intel Corporation

    IPC分类号: G06F7/544 G06F7/487

    CPC分类号: G06F7/5443 G06F7/4876

    摘要: A floating point multiply-add unit having inputs coupled to receive a floating point multiplier data element, a floating point multiplicand data element, and a floating point addend data element. The multiply-add unit including a mantissa multiplier to multiply a mantissa of the multiplier data element and a mantissa of the multiplicand data element to calculate a mantissa product. The mantissa multiplier including a most significant bit portion to calculate most significant bits of the mantissa product, and a least significant bit portion to calculate least significant bits of the mantissa product. The mantissa multiplier has a plurality of different possible sizes of the least significant bit portion. Energy consumption reduction logic to selectively reduce energy consumption of the least significant bit portion, but not the most significant bit portion, to cause the least significant bit portion to not calculate the least significant bits of the mantissa product.

    Multiply-Accumulate Circuits
    14.
    发明申请

    公开(公告)号:US20180095722A1

    公开(公告)日:2018-04-05

    申请号:US15282021

    申请日:2016-09-30

    IPC分类号: G06F7/523 G11C13/00

    摘要: In some examples, a method may be performed by a multiply-accumulate circuit. As part of the method a row driver of the multiply-accumulate circuit may drive a row value line based on an input vector bit of an input vector received by the row driver. The row driver may also drive a row line that controls a corresponding memristor according to the input vector bit. The corresponding memristor may store a weight value bit of a weight value to apply to the input vector for a multiply-accumulate operation. The method may further include a sense amplifier generating an output voltage based on a current output from the corresponding memristor and counter circuitry adjusting a counter value that represents a running total of the multiply-accumulate operation based on the row value line, the output voltage generated by the sense amplifier, or a combination of both.

    Circuit for Performing a Multiply-and-Accumulate Operation

    公开(公告)号:US20180088908A1

    公开(公告)日:2018-03-29

    申请号:US15275037

    申请日:2016-09-23

    IPC分类号: G06F7/544

    CPC分类号: G06F7/5443

    摘要: A circuit includes a multiplier, an adder, a first result register and a second result register coupled to outputs of the multiplier and the adder, respectively. The circuit further includes: a first selection unit configured to selectively provide, to the multiplier and in response to a first control signal, a first value from a first plurality of values; and a second selection unit configured to selectively provide, to the multiplier and in response to a second control signal, a second value from a second plurality of values. The circuit also includes: a third selection unit configured to selectively provide, to the adder and in response to a third control signal, a third value from a third plurality of values; and a fourth selection unit configured to selectively provide, to the adder and in response to a fourth control signal, a fourth value from a fourth plurality of values.

    Microarchitecture for floating point fused multiply-add with exponent scaling

    公开(公告)号:US09841948B2

    公开(公告)日:2017-12-12

    申请号:US14824547

    申请日:2015-08-12

    发明人: Liang-Kai Wang

    IPC分类号: G06F7/483 G06F7/544

    CPC分类号: G06F7/483 G06F7/5443

    摘要: Systems and methods for implementing a floating point fused multiply and accumulate with scaling (FMASc) operation. A floating point unit receives input multiplier, multiplicand, addend, and scaling factor operands. A multiplier block is configured to multiply mantissas of the multiplier and multiplicand to generate an intermediate product. Alignment logic is configured to pre-align the addend with the intermediate product based on the scaling factor and exponents of the addend, multiplier, and multiplicand, and accumulation logic is configured to add or subtract a mantissa of the pre-aligned addend with the intermediate product to obtain a result of the floating point unit. Normalization and rounding are performed on the result, avoiding rounding during intermediate stages.