专利检索 ap:("Marvell Asia Pte Ltd") AND inv:"Chia-Hsin Chen" 第 1 页

1.

发明授权
Architecture to support tanh and sigmoid operations for inference acceleration in machine learning 有权

公开(公告)号：US11966857B2

公开(公告)日：2024-04-23

申请号：US17223921

申请日：2021-04-06

申请人： Marvell Asia Pte, Ltd.

发明人： Avinash Sodani , Ulf Hanebutte , Chia-Hsin Chen

IPC分类号： G06N5/04 , G06F9/50 , G06F17/16 , G06N20/00

CPC分类号： G06N5/04 , G06F9/5027 , G06F17/16 , G06N20/00

摘要： A processing unit to support inference acceleration for machine learning (ML) comprises an inline post processing unit configured to accept and maintain one or more lookup tables for performing a tanh and/or sigmoid operation/function. The inline post processing unit is further configured to accept data from a set of registers configured to maintain output from a processing block instead of streaming the data from an on-chip memory (OCM), perform the tanh and/or sigmoid operation on each element of the data from the processing block on a per-element basis via the one or more lookup tables, and stream post processing result of the per-element tanh and/or sigmoid operation back to the OCM after the tanh and/or sigmoid operation is complete.

2.

发明授权
System and method to manage power throttling 有权

公开(公告)号：US11687136B2

公开(公告)日：2023-06-27

申请号：US17726924

申请日：2022-04-22

申请人： Marvell Asia Pte Ltd

发明人： Avinash Sodani , Srinivas Sripada , Ramacharan Sundararaman , Chia-Hsin Chen , Nikhil Jayakumar

IPC分类号： G06F1/00 , G06F1/26 , G06F1/10 , G11C19/00 , H03L7/08 , G06N20/00 , G06F1/3203

CPC分类号： G06F1/26 , G06F1/10 , G06F1/3203 , G06N20/00 , G11C19/00 , H03L7/08

摘要： A power throttling engine includes a register configured to receive a power throttling signal. The power throttling engine further includes a decoder configured to generate a vector based on a value of the power throttling signal. The value of the power throttling signal is an amount of power throttling of a device. The power throttling engine further includes a clock gating logic configured to receive the vector and further configured to receive a clocking signal. The clock gating logic is configured to remove clock edges of the clocking signal based on the vector to generate a throttled clocking signal.

3.

发明申请
SYSTEM AND METHOD FOR HANDLING FLOATING POINT HARDWARE EXCEPTION 有权

公开(公告)号：US20220188109A1

公开(公告)日：2022-06-16

申请号：US17686682

申请日：2022-03-04

申请人： Marvell Asia Pte Ltd

发明人： Chia-Hsin Chen , Avinash Sodani , Ulf Hanebutte , Rishan Tan , Soumya Gollamudi

IPC分类号： G06F9/30 , G06F7/499 , G06F7/485 , G06F9/38

摘要： A method includes receiving an input data at a floating point arithmetic operating unit, wherein the floating point operating unit is configured to perform a floating point arithmetic operation on the input data. The method includes determining whether the received input data is a qnan (quiet not-a-number) or whether the received input data is an snan (signaling not-a-number) prior to performing the floating point arithmetic operation. The method also includes converting a value of the received input data to a modified value prior to performing the floating point arithmetic operation if the received input data is either qnan or snan, wherein the converting eliminates special handling associated with the floating point arithmetic operation on the input data being either qnan or snan.

4.

发明申请
POWER MANAGEMENT AND STAGGERING TRANSITIONING FROM IDLE MODE TO OPERATIONAL MODE 有权

公开(公告)号：US20210318740A1

公开(公告)日：2021-10-14

申请号：US16947446

申请日：2020-07-31

申请人： Marvell Asia Pte, Ltd. (Registration No. 199702379M)

发明人： Srinivas Sripada , Chia-Hsin Chen , Avinash Sodani , Atul Bhattarai , Nikhil Jayakumar

IPC分类号： G06F1/3203 , G06F1/10 , G06F1/08

摘要： A system includes a first and a second group of cores in a multicore system. Each core of the first/second group is configured to process data. Each core within the first/second group is configured to enter into an idle state in response to being idle for a first/second period of time respectively. Every idle core in the first/second group is configured to transition out of the idle state and into an operational mode in response to receiving a signal having a first/second value respectively and further in response to having a pending operation to process.

5.

发明申请
SYSTEM AND METHOD FOR INT9 QUANTIZATION 有权

公开(公告)号：US20230096994A1

公开(公告)日：2023-03-30

申请号：US18075678

申请日：2022-12-06

申请人： Marvell Asia Pte Ltd

发明人： Avinash Sodani , Ulf Hanebutte , Chia-Hsin Chen

IPC分类号： G06N20/00

摘要： A method of converting a data stored in a memory from a first format to a second format is disclosed. The method includes extending a number of bits in the data stored in a double data rate (DDR) memory by one bit to form an extended data. The method further includes determining whether the data stored in the DDR is signed or unsigned data. Moreover, responsive to determining that the data is signed, a sign value is added to the most significant bit of the extended data and the data is copied to lower order bits of the extended data. Responsive to determining that the data is unsigned, the data is copied to lower order bits of the extended data and the most significant bit is set to an unsigned value, e.g., zero. The extended data is stored in an on-chip memory (OCM) of a processing tile of a machine learning computer array.

6.

发明授权
Architecture for table-based mathematical operations for inference acceleration in machine learning 有权

公开(公告)号：US11494676B2

公开(公告)日：2022-11-08

申请号：US17247826

申请日：2020-12-23

申请人： Marvell Asia Pte, Ltd.

发明人： Avinash Sodani , Ulf Hanebutte , Chia-Hsin Chen

IPC分类号： G06F17/10 , G06N5/04 , G06F1/03 , G06F15/78 , G06F7/483 , G06N20/00 , G06F17/17

摘要： A processing unit to support inference acceleration for machine learning (ML) comprises an inline post processing unit configured to accept and maintain one or more lookup tables for performing each of one or more non-linear mathematical operations. The inline post processing unit is further configured to accept data from a set of registers maintaining output from a processing block instead of streaming the data from an on-chip memory (OCM), perform the one or more non-linear mathematical operations on elements of the data from the processing block via their corresponding lookup tables, and stream post processing result of the one or more non-linear mathematical operations back to the OCM after the one or more non-linear mathematical operations are complete.

7.

发明申请
SYSTEM AND METHOD TO MANAGE POWER THROTTLING 有权

公开(公告)号：US20220244767A1

公开(公告)日：2022-08-04

申请号：US17726924

申请日：2022-04-22

申请人： Marvell Asia Pte Ltd

发明人： Avinash Sodani , Srinivas Sripada , Ramacharan Sundararaman , Chia-Hsin Chen , Nikhil Jayakumar

IPC分类号： G06F1/26 , G06F1/10

摘要： A power throttling engine includes a register configured to receive a power throttling signal. The power throttling engine further includes a decoder configured to generate a vector based on a value of the power throttling signal. The value of the power throttling signal is an amount of power throttling of a device. The power throttling engine further includes a clock gating logic configured to receive the vector and further configured to receive a clocking signal. The clock gating logic is configured to remove clock edges of the clocking signal based on the vector to generate a throttled clocking signal.

8.

发明申请
SYSTEM AND METHOD FOR HANDLING FLOATING POINT HARDWARE EXCEPTION 有权

公开(公告)号：US20220188108A1

公开(公告)日：2022-06-16

申请号：US17686676

申请日：2022-03-04

申请人： Marvell Asia Pte Ltd

发明人： Chia-Hsin Chen , Avinash Sodani , Ulf Hanebutte , Rishan Tan , Soumya Gollamudi

IPC分类号： G06F9/30 , G06F7/499 , G06F7/485 , G06F9/38

摘要： A method includes receiving an input data at a floating point arithmetic operating unit, wherein the floating point operating unit is configured to perform a floating point arithmetic operation on the input data to generate an output result. The method also includes determining whether the output result is going to cause a floating point hardware exception responsive to the floating point arithmetic operation on the input data. The method further includes converting a value of the output result to a modified value responsive to the determining that the output result is going to cause the floating point hardware exception, wherein the modified value eliminates the floating point hardware exception responsive to the floating point arithmetic operation on the input data.

9.

发明申请
SYSTEM AND METHOD FOR INT9 QUANTIZATION 有权

公开(公告)号：US20210342734A1

公开(公告)日：2021-11-04

申请号：US16862549

申请日：2020-04-29

申请人： Marvell Asia Pte, Ltd. (Registration No. 199702379M)

发明人： Avinash Sodani , Ulf Hanebutte , Chia-Hsin Chen

IPC分类号： G06N20/00

摘要： A method of converting a data stored in a memory from a first format to a second format is disclosed. The method includes extending a number of bits in the data stored in a double data rate (DDR) memory by one bit to form an extended data. The method further includes determining whether the data stored in the DDR is signed or unsigned data. Moreover, responsive to determining that the data is signed, a sign value is added to the most significant bit of the extended data and the data is copied to lower order bits of the extended data. Responsive to determining that the data is unsigned, the data is copied to lower order bits of the extended data and the most significant bit is set to an unsigned value, e.g., zero. The extended data is stored in an on-chip memory (OCM) of a processing tile of a machine learning computer array.

10.

发明授权
Architecture to support tanh and sigmoid operations for inference acceleration in machine learning 有权

公开(公告)号：US11995569B2

公开(公告)日：2024-05-28

申请号：US17223921

申请日：2021-04-06

申请人： Marvell Asia Pte, Ltd.

发明人： Avinash Sodani , Ulf Hanebutte , Chia-Hsin Chen

IPC分类号： G06N5/04 , G06F9/50 , G06F17/16 , G06N20/00

CPC分类号： G06N5/04 , G06F9/5027 , G06F17/16 , G06N20/00

摘要： A processing unit to support inference acceleration for machine learning (ML) comprises an inline post processing unit configured to accept and maintain one or more lookup tables for performing a tanh and/or sigmoid operation/function. The inline post processing unit is further configured to accept data from a set of registers configured to maintain output from a processing block instead of streaming the data from an on-chip memory (OCM), perform the tanh and/or sigmoid operation on each element of the data from the processing block on a per-element basis via the one or more lookup tables, and stream post processing result of the per-element tanh and/or sigmoid operation back to the OCM after the tanh and/or sigmoid operation is complete.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类