SYSTEM AND METHOD FOR HANDLING FLOATING POINT HARDWARE EXCEPTION

    公开(公告)号:US20220188109A1

    公开(公告)日:2022-06-16

    申请号:US17686682

    申请日:2022-03-04

    摘要: A method includes receiving an input data at a floating point arithmetic operating unit, wherein the floating point operating unit is configured to perform a floating point arithmetic operation on the input data. The method includes determining whether the received input data is a qnan (quiet not-a-number) or whether the received input data is an snan (signaling not-a-number) prior to performing the floating point arithmetic operation. The method also includes converting a value of the received input data to a modified value prior to performing the floating point arithmetic operation if the received input data is either qnan or snan, wherein the converting eliminates special handling associated with the floating point arithmetic operation on the input data being either qnan or snan.

    SYSTEM AND METHOD FOR INT9 QUANTIZATION

    公开(公告)号:US20230096994A1

    公开(公告)日:2023-03-30

    申请号:US18075678

    申请日:2022-12-06

    IPC分类号: G06N20/00

    摘要: A method of converting a data stored in a memory from a first format to a second format is disclosed. The method includes extending a number of bits in the data stored in a double data rate (DDR) memory by one bit to form an extended data. The method further includes determining whether the data stored in the DDR is signed or unsigned data. Moreover, responsive to determining that the data is signed, a sign value is added to the most significant bit of the extended data and the data is copied to lower order bits of the extended data. Responsive to determining that the data is unsigned, the data is copied to lower order bits of the extended data and the most significant bit is set to an unsigned value, e.g., zero. The extended data is stored in an on-chip memory (OCM) of a processing tile of a machine learning computer array.

    SYSTEM AND METHOD TO MANAGE POWER THROTTLING

    公开(公告)号:US20220244767A1

    公开(公告)日:2022-08-04

    申请号:US17726924

    申请日:2022-04-22

    IPC分类号: G06F1/26 G06F1/10

    摘要: A power throttling engine includes a register configured to receive a power throttling signal. The power throttling engine further includes a decoder configured to generate a vector based on a value of the power throttling signal. The value of the power throttling signal is an amount of power throttling of a device. The power throttling engine further includes a clock gating logic configured to receive the vector and further configured to receive a clocking signal. The clock gating logic is configured to remove clock edges of the clocking signal based on the vector to generate a throttled clocking signal.

    SYSTEM AND METHOD FOR HANDLING FLOATING POINT HARDWARE EXCEPTION

    公开(公告)号:US20220188108A1

    公开(公告)日:2022-06-16

    申请号:US17686676

    申请日:2022-03-04

    摘要: A method includes receiving an input data at a floating point arithmetic operating unit, wherein the floating point operating unit is configured to perform a floating point arithmetic operation on the input data to generate an output result. The method also includes determining whether the output result is going to cause a floating point hardware exception responsive to the floating point arithmetic operation on the input data. The method further includes converting a value of the output result to a modified value responsive to the determining that the output result is going to cause the floating point hardware exception, wherein the modified value eliminates the floating point hardware exception responsive to the floating point arithmetic operation on the input data.

    SYSTEM AND METHOD FOR INT9 QUANTIZATION

    公开(公告)号:US20210342734A1

    公开(公告)日:2021-11-04

    申请号:US16862549

    申请日:2020-04-29

    IPC分类号: G06N20/00

    摘要: A method of converting a data stored in a memory from a first format to a second format is disclosed. The method includes extending a number of bits in the data stored in a double data rate (DDR) memory by one bit to form an extended data. The method further includes determining whether the data stored in the DDR is signed or unsigned data. Moreover, responsive to determining that the data is signed, a sign value is added to the most significant bit of the extended data and the data is copied to lower order bits of the extended data. Responsive to determining that the data is unsigned, the data is copied to lower order bits of the extended data and the most significant bit is set to an unsigned value, e.g., zero. The extended data is stored in an on-chip memory (OCM) of a processing tile of a machine learning computer array.