-
1.
公开(公告)号:US20230205837A1
公开(公告)日:2023-06-29
申请号:US17561227
申请日:2021-12-23
Applicant: Advanced Micro Devices, Inc.
Inventor: Laurent S. White , Ganesh Dasika , Saketh Venkata Rama
Abstract: A physical system is simulated using a model including a plurality of elements in a mesh or grid. The elements are divided into partitions processed by different processing units. For some time steps, flux data is transmitted between partitions for updating the state of edge elements of the partitions. Periodically, transmission of flux data is suppressed and flux data is obtained by linear interpolation based on past flux data. Alternatively, flux data is obtained by processing state variables of an edge element and past flux data using a machine learning model, such as a DNN.
-
2.
公开(公告)号:US20250005236A1
公开(公告)日:2025-01-02
申请号:US18344544
申请日:2023-06-29
Applicant: Advanced Micro Devices, Inc.
Inventor: Laurent S. White , Darian Osahar Nwankwo , Gurpreet Singh Hora
IPC: G06F30/27
Abstract: Method and devices are provided for performing a physics-based simulation. A processing devices comprises memory and a processor. The processor is configured to perform a physics-based simulation by executing a portion of the physics-based simulation, training a neural network model based on results from executing the first portion of the physics-based simulation, performing inference processing based on the results of the training of the neural network model and providing a prediction, based on the inference processing, as an input back to the physics-based simulation.
-
公开(公告)号:US20240119198A1
公开(公告)日:2024-04-11
申请号:US17958058
申请日:2022-09-30
Applicant: Advanced Micro Devices, Inc.
Inventor: Laurent S. White , Johnathan Alsop , Ganesh Dasika
CPC classification number: G06F30/23 , G06F30/27 , G06F2119/02
Abstract: A physical system is simulated using a model including a plurality of elements in a mesh or grid. The elements are divided into partitions processed by different processing units. For some time steps, state data is transmitted between partitions and used to calculate flux data for updating the state of edge elements of the partitions. Periodically, transmission of state data is suppressed, and flux data is obtained by linear interpolation based on past flux data. Alternatively, flux data is obtained by processing state variables of an edge element and past flux data using a machine learning model, such as a DNN. Whether to suppress transmission of state data may be determined based on one or both of (a) uncertainty in an output of the machine learning model (e.g., Bayesian neural network) and (b) complexity of model of the physical system (e.g., spatial or temporal gradients).
-
公开(公告)号:US20240004653A1
公开(公告)日:2024-01-04
申请号:US17853613
申请日:2022-06-29
Applicant: Advanced Micro Devices, Inc.
Inventor: Johnathan Alsop , Laurent S. White , Shaizeen Aga
IPC: G06F9/30
CPC classification number: G06F9/3009 , G06F9/3004 , G06F9/30101
Abstract: An approach is provided for managing near-memory processing commands (“PIM commands”) from multiple processor threads in a manner to prevent interference and maintain correctness at near-memory processing elements. A memory controller uses thread identification information and last command information to issue a PIM command sequence from a first processor thread, directed to a PIM-enabled memory element, while deferring the issuance of PIM command sequences from other processor threads, directed to the same PIM-enabled memory element. After the last PIM command in the PIM command sequence for the first processor thread has been issued, a PIM command sequence for another processor thread is issued, and so on. The approach allows multiple processor threads to concurrently issue fine grained PIM commands to the same PIM-enabled memory element without having to be aware of address-to-memory element mapping, and without having to coordinate with other threads.
-
公开(公告)号:US20230186149A1
公开(公告)日:2023-06-15
申请号:US17550882
申请日:2021-12-14
Applicant: Advanced Micro Devices, Inc.
Inventor: Saketh Venkata Rama , Ganesh Dasika , Laurent S. White
IPC: G06N20/00
CPC classification number: G06N20/00
Abstract: An approach is provided for using machine learning to provide compensation for roundoff error in algorithmic computations. The approach includes training a machine learning model based low precision data and corresponding high precision data. The low precision data includes pairs of low precision values of a specific datatype that correspond to pairs of high precision values from the high precision data. The high precision data includes pairs of high precision values of a specific datatype that correspond to the pairs of low precision values from the low precision data. When the machine learning model has been trained, the machine learning model is used as a basis for determining a compensation value is used to compensate for roundoff error in a particular algorithmic computation. Techniques discussed herein provide compensation for roundoff error during otherwise unstable computations, enabling high-performance computing and other scientific applications to use lower precision data types more readily.
-
-
-
-