-
公开(公告)号:US11907719B2
公开(公告)日:2024-02-20
申请号:US16914009
申请日:2020-06-26
申请人: Intel Corporation
IPC分类号: G06F9/30 , G06N20/00 , G06F30/343 , G06F30/34 , G06F30/38 , G06F7/50 , G06F7/523 , H03K19/17748 , H03M7/24 , G06F7/556 , H03K19/177 , G06F7/483
CPC分类号: G06F9/30101 , G06F7/50 , G06F7/523 , G06F7/556 , G06F9/30105 , G06F30/34 , G06F30/343 , G06F30/38 , G06N20/00 , H03K19/177 , H03K19/17748 , H03M7/24 , G06F7/483
摘要: The present disclosure describes a digital signal processing (DSP) block that includes a plurality of columns of weight registers and a plurality of inputs configured to receive a first plurality of values and a second plurality of values. The first plurality of values is stored in the plurality of columns of weight registers after being received. Additionally, the DSP block includes a plurality of multipliers configured to simultaneously multiply each value of the first plurality of values by each value of the second plurality of values.
-
公开(公告)号:US11726744B2
公开(公告)日:2023-08-15
申请号:US17214526
申请日:2021-03-26
申请人: Intel Corporation
发明人: Martin Langhammer , Dongdong Chen , Kevin Hurd
CPC分类号: G06F7/4876 , G06F7/5443 , G06F2207/382 , G06F2207/3824
摘要: An integrated circuit with specialized processing blocks is provided. A specialized processing block may be optimized for machine learning algorithms and may include a multiplier data path that feeds an adder data path. The multiplier data path may be decomposed into multiple partial product generators, multiple compressors, and multiple carry-propagate adders of a first precision. Results from the carry-propagate adders may be added using a floating-point adder of the first precision. Results from the floating-point adder may be optionally cast to a second precision that is higher or more accurate than the first precision. The adder data path may include an adder of the second precision that combines the results from the floating-point adder with zero, with a general-purpose input, or with other dot product terms. Operated in this way, the specialized processing block provides a technical improvement of greatly increasing the functional density for implementing machine learning algorithms.
-
公开(公告)号:US20210182023A1
公开(公告)日:2021-06-17
申请号:US16914107
申请日:2020-06-26
申请人: Intel Corporation
IPC分类号: G06F7/523 , G06F7/50 , G06N20/00 , H03M7/24 , H03K19/17748
摘要: The present disclosure describes a digital signal processing (DSP) block that includes a plurality of columns of weight registers and a plurality of inputs configured to receive a first plurality of values and a second plurality of values. The first plurality of values is stored in the plurality of columns of weight registers after being received. Additionally, the DSP block includes a plurality of multipliers configured to simultaneously multiply each value of the first plurality of values by each value of the second plurality of values.
-
公开(公告)号:US20210182022A1
公开(公告)日:2021-06-17
申请号:US16914098
申请日:2020-06-26
申请人: Intel Corporation
摘要: The present disclosure describes a digital signal processing (DSP) block that includes a plurality of columns of weight registers and a plurality of inputs configured to receive a first plurality of values and a second plurality of values. The first plurality of values is stored in the plurality of columns of weight registers after being received. Additionally, the DSP block includes a plurality of multipliers configured to simultaneously multiply each value of the first plurality of values by each value of the second plurality of values.
-
公开(公告)号:US12056461B2
公开(公告)日:2024-08-06
申请号:US17484845
申请日:2021-09-24
申请人: Intel Corporation
发明人: Martin Langhammer , Dongdong Chen
CPC分类号: G06F7/5443 , G06F7/485 , G06F2207/382 , G06F2207/3824
摘要: An integrated circuit with specialized processing blocks are provided. A specialized processing block may be optimized for machine learning algorithms and may include a multiplier data path that feeds an adder data path. The multiplier data path may be decomposed into multiple partial product generators, multiple compressors, and multiple carry-propagate adders of a first precision. Results from the carry-propagate adders may be added using a floating-point adder of the first precision. Results from the floating-point adder may be optionally cast to a second precision that is higher or more accurate than the first precision. The adder data path may include an adder of the second precision that combines the results from the floating-point adder with zero, with a general-purpose input, or with other dot product terms. Operated in this way, the specialized processing block provides a technical improvement of greatly increasing the functional density for implementing machine learning algorithms.
-
公开(公告)号:US20230342111A1
公开(公告)日:2023-10-26
申请号:US18216797
申请日:2023-06-30
申请人: Intel Corporation
发明人: Martin Langhammer , Dongdong Chen , Kevin Hurd
CPC分类号: G06F7/4876 , G06F7/5443 , G06F2207/3824 , G06F2207/382
摘要: An integrated circuit with specialized processing blocks is provided. A specialized processing block may be optimized for machine learning algorithms and may include a multiplier data path that feeds an adder data path. The multiplier data path may be decomposed into multiple partial product generators, multiple compressors, and multiple carry-propagate adders of a first precision. Results from the carry-propagate adders may be added using a floating-point adder of the first precision. Results from the floating-point adder may be optionally cast to a second precision that is higher or more accurate than the first precision. The adder data path may include an adder of the second precision that combines the results from the floating-point adder with zero, with a general-purpose input, or with other dot product terms. Operated in this way, the specialized processing block provides a technical improvement of greatly increasing the functional density for implementing machine learning algorithms.
-
公开(公告)号:US20220012015A1
公开(公告)日:2022-01-13
申请号:US17484845
申请日:2021-09-24
申请人: Intel Corporation
发明人: Martin Langhammer , Dongdong Chen , Kevin Hurd
摘要: An integrated circuit with specialized processing blocks are provided. A specialized processing block may be optimized for machine learning algorithms and may include a multiplier data path that feeds an adder data path. The multiplier data path may be decomposed into multiple partial product generators, multiple compressors, and multiple carry-propagate adders of a first precision. Results from the carry-propagate adders may be added using a floating-point adder of the first precision. Results from the floating-point adder may be optionally cast to a second precision that is higher or more accurate than the first precision. The adder data path may include an adder of the second precision that combines the results from the floating-point adder with zero, with a general-purpose input, or with other dot product terms. Operated in this way, the specialized processing block provides a technical improvement of greatly increasing the functional density for implementing machine learning algorithms.
-
公开(公告)号:US11520584B2
公开(公告)日:2022-12-06
申请号:US16914098
申请日:2020-06-26
申请人: Intel Corporation
IPC分类号: G06F9/30 , G06N20/00 , G06F30/343 , G06F30/34 , G06F30/38 , G06F7/50 , G06F7/523 , H03K19/17748 , H03M7/24 , G06F7/556 , H03K19/177 , G06F7/483
摘要: The present disclosure describes a digital signal processing (DSP) block that includes a plurality of columns of weight registers and a plurality of inputs configured to receive a first plurality of values and a second plurality of values. The first plurality of values is stored in the plurality of columns of weight registers after being received. Additionally, the DSP block includes a plurality of multipliers configured to simultaneously multiply each value of the first plurality of values by each value of the second plurality of values.
-
公开(公告)号:US20210240440A1
公开(公告)日:2021-08-05
申请号:US17214526
申请日:2021-03-26
申请人: Intel Corporation
发明人: Martin Langhammer , Dongdong Chen , Kevin Hurd
摘要: An integrated circuit with specialized processing blocks is provided. A specialized processing block may be optimized for machine learning algorithms and may include a multiplier data path that feeds an adder data path. The multiplier data path may be decomposed into multiple partial product generators, multiple compressors, and multiple carry-propagate adders of a first precision. Results from the carry-propagate adders may be added using a floating-point adder of the first precision. Results from the floating-point adder may be optionally cast to a second precision that is higher or more accurate than the first precision. The adder data path may include an adder of the second precision that combines the results from the floating-point adder with zero, with a general-purpose input, or with other dot product terms. Operated in this way, the specialized processing block provides a technical improvement of greatly increasing the functional density for implementing machine learning algorithms.
-
公开(公告)号:US10970042B2
公开(公告)日:2021-04-06
申请号:US16144904
申请日:2018-09-27
申请人: Intel Corporation
发明人: Martin Langhammer , Dongdong Chen , Kevin Hurd
摘要: An integrated circuit with specialized processing blocks is provided. A specialized processing block may be optimized for machine learning algorithms and may include a multiplier data path that feeds an adder data path. The multiplier data path may be decomposed into multiple partial product generators, multiple compressors, and multiple carry-propagate adders of a first precision. Results from the carry-propagate adders may be added using a floating-point adder of the first precision. Results from the floating-point adder may be optionally cast to a second precision that is higher or more accurate than the first precision. The adder data path may include an adder of the second precision that combines the results from the floating-point adder with zero, with a general-purpose input, or with other dot product terms. Operated in this way, the specialized processing block provides a technical improvement of greatly increasing the functional density for implementing machine learning algorithms.
-
-
-
-
-
-
-
-
-