-
公开(公告)号:US20240220831A1
公开(公告)日:2024-07-04
申请号:US18149248
申请日:2023-01-03
Applicant: Nvidia Corporation
Inventor: J Wyman , Pritish Nahar , Dana Groff
Abstract: Approaches presented herein provide for the management of artificial intelligence (AI)-related resources in a distributed resource environment, such as may be used to support accelerated machine learning (ML) applications on behalf of different users. Management functionality can be provided using an AI manager, such as a management service, that can determine the requirements, capabilities, and limitations of various available AI-related components, such as those of a plurality of AI models, engines, and accelerators, as well as the hardware (e.g., graphics processing units (GPUs)) that run or make up these AI-related resources. An AI manager can determine a selection and configuration of resources that is not only appropriate for use with a specific AI model, but that can also be optimized for factors such as throughput, resource utilization, and inference latency. An AI manager can ensure compatibility of resources and configuration, and can enforce access control to models and data.
-
公开(公告)号:US20240185100A1
公开(公告)日:2024-06-06
申请号:US18076221
申请日:2022-12-06
Applicant: NVIDIA Corporation
Inventor: Alvin Ihsani , Shaul Arazi , Elena Agostini , Penn Tasinga , Carl Everett Lacey, JR. , Dana Groff , Dotan David Levi , Wojciech Wasko , Vishwesh Nath , Sachidanand Alle
Abstract: Methods and systems for obtaining data having a first format, converting the data to a second format, storing the converted data in memory accessible by at least one parallel processing unit, and processing the converted data stored in the memory using the at least one parallel processing unit.
-