-
公开(公告)号:US20240185100A1
公开(公告)日:2024-06-06
申请号:US18076221
申请日:2022-12-06
Applicant: NVIDIA Corporation
Inventor: Alvin Ihsani , Shaul Arazi , Elena Agostini , Penn Tasinga , Carl Everett Lacey, JR. , Dana Groff , Dotan David Levi , Wojciech Wasko , Vishwesh Nath , Sachidanand Alle
Abstract: Methods and systems for obtaining data having a first format, converting the data to a second format, storing the converted data in memory accessible by at least one parallel processing unit, and processing the converted data stored in the memory using the at least one parallel processing unit.
-
公开(公告)号:US20230144662A1
公开(公告)日:2023-05-11
申请号:US17522562
申请日:2021-11-09
Applicant: NVIDIA Corporation
Inventor: Penn Tasinga , David B. Yastremsky
CPC classification number: G06F9/5088 , G06F9/5077 , G06N3/10 , G06N5/04
Abstract: Apparatuses, systems, and techniques to partition neural networks. In at least one embodiment, one or more circuits are to cause one or more neural networks to be dynamically partitioned based, at least in part, on one or more performance metrics of the one or more neural networks.
-
公开(公告)号:US20220180178A1
公开(公告)日:2022-06-09
申请号:US17115631
申请日:2020-12-08
Applicant: NVIDIA Corporation
Inventor: Penn Tasinga , David B. Yastremsky , Jeremy Wyman , Alvin Ihsani , Pritish Nahar , Piyush Bhatt
Abstract: Apparatuses, systems, and techniques to allocate computing resources to perform inferences. In at least one embodiment, one or more neural networks cause computing resources to be identified based, at least in part, on performance requirements of one or more neural networks to perform inferences.
-
-