Enhanced low precision binary floating-point formatting

    公开(公告)号:US11775257B2

    公开(公告)日:2023-10-03

    申请号:US16840847

    申请日:2020-04-06

    IPC分类号: G06F7/499

    CPC分类号: G06F7/49915 G06F7/49968

    摘要: Techniques for operating on and calculating binary floating-point numbers using an enhanced floating-point number format are presented. The enhanced format can comprise a single sign bit, six bits for the exponent, and nine bits for the fraction. Using six bits for the exponent can provide an enhanced exponent range that facilitates desirably fast convergence of computing-intensive algorithms and low error rates for computing-intensive applications. The enhanced format can employ a specified definition for the lowest binade that enables the lowest binade to be used for zero and normal numbers; and a specified definition for the highest binade that enables it to be structured to have one data point used for a merged Not-a-Number (NaN)/infinity symbol and remaining data points used for finite numbers. The signs of zero and merged NaN/infinity can be “don't care” terms. The enhanced format employs only one rounding mode, which is for rounding toward nearest up.

    Low latency floating-point division operations

    公开(公告)号:US11314482B2

    公开(公告)日:2022-04-26

    申请号:US16684081

    申请日:2019-11-14

    IPC分类号: G06F7/487 G06F7/499

    摘要: Methods and systems for division operation are described. A processor can initialize an estimated quotient between the dividend and the divisor separately from a floating-point unit (FPU) pipeline. The processor can implement the FPU pipeline to execute a refinement process that can include at least a first iteration of operations and a second iteration of operations. The refinement process can include, in the first iteration of operations, generating a first unnormalized floating-point value using the initialized estimated quotient. The refinement process can include, in the second iteration of operations, generating a second unnormalized floating-point value using the first unnormalized floating-point value. The processor can determine a final quotient based on the second unnormalized floating-point value.

    DYNAMIC FUSION BASED ON OPERAND SIZE
    4.
    发明申请

    公开(公告)号:US20190179639A1

    公开(公告)日:2019-06-13

    申请号:US15834403

    申请日:2017-12-07

    IPC分类号: G06F9/30 G06F9/38

    摘要: Aspects of the invention include receiving, by a processor, a plurality of instructions at an instruction pipeline. The processor can further determine an operand bit field size for each of the received plurality of instructions. The processor can further compare the operand bit field size of at least a subset of the received instructions to a predetermined threshold. The processor can further fuse at least two of the received instructions that have an operand bit field size that meets the predetermined threshold. The processor can further perform an execution stage within the instruction pipeline to execute the received instructions, including the fused instructions.

    Half-precision floating-point arrays at low overhead

    公开(公告)号:US11281745B2

    公开(公告)日:2022-03-22

    申请号:US16542447

    申请日:2019-08-16

    IPC分类号: G06F17/16 G06F7/544

    摘要: Methods and systems of matrix multiplication are described. In an example, a processor can multiply a first entry of a first vector of a first data array with a second vector of a second data array to generate a third vector of a third data array. The processor can store the third vector of the third data array in the second register file. The processor can multiply a second entry of the first vector with the second vector to generate a fourth vector of the third data array. The processor can store the fourth vector of the third data array in the second register file. The processor can combine vectors of the third data array that are stored in the second register file to produce the third data array.

    Instruction initialization in a dataflow architecture

    公开(公告)号:US11223703B2

    公开(公告)日:2022-01-11

    申请号:US16358356

    申请日:2019-03-19

    IPC分类号: H04L29/08 H04L29/06

    摘要: Various embodiments are provided for implementing instruction initialization in a dataflow architecture in a computing environment. A data packet may be transmitted from a selected node to one or more of a plurality of nodes using one or more existing data paths in an initialization network. A determination operation is performed to determine whether one or more of a plurality of nodes is a target node intended for the data packet. Those of the plurality of nodes determined to be a target node initialize one or more components of the target node using the data packet. The data packet may be forwarded by each of the one or more of a plurality of nodes to a subsequent node in the initialization network.

    Enhanced low precision binary floating-point formatting

    公开(公告)号:US10656913B2

    公开(公告)日:2020-05-19

    申请号:US16000435

    申请日:2018-06-05

    IPC分类号: G06F7/483 G06F7/499

    摘要: Techniques for operating on and calculating binary floating-point numbers using an enhanced floating-point number format are presented. The enhanced format can comprise a single sign bit, six bits for the exponent, and nine bits for the fraction. Using six bits for the exponent can provide an enhanced exponent range that facilitates desirably fast convergence of computing-intensive algorithms and low error rates for computing-intensive applications. The enhanced format can employ a specified definition for the lowest binade that enables the lowest binade to be used for zero and normal numbers; and a specified definition for the highest binade that enables it to be structured to have one data point used for a merged Not-a-Number (NaN)/infinity symbol and remaining data points used for finite numbers. The signs of zero and merged NaN/infinity can be “don't care” terms. The enhanced format employs only one rounding mode, which is for rounding toward nearest up.