-
公开(公告)号:US20230153218A1
公开(公告)日:2023-05-18
申请号:US17526218
申请日:2021-11-15
Applicant: Advanced Micro Devices, Inc.
Inventor: Shrikanth Ganapathy , Yasuko Eckert , Anthony Gutierrez , Karthik Ramu Sangaiah , Vedula Venkata Srikant Bharadwaj
CPC classification number: G06F11/3051 , G06F15/80 , G06F11/3024
Abstract: A processor includes a controller and plurality of chiplets, each chiplet including a plurality of processor cores. The controller provides chiplet-level performance information for the chiplets that identifies a performance of each chiplet at each of a plurality of performance levels for specified sets of processor cores on that chiplet. The controller receives an identification of one or more selected chiplets from among the plurality of chiplets for which a specified number of processor cores are to be configured at a given performance level, the one or more selected chiplets having been selected based on the chiplet-level performance information and performance requirements. The controller configures the specified number of processor cores of the one or more selected chiplets at the given performance level. A task is then run on the specified number of processor cores of the one or more selected chiplets at the given performance level.
-
公开(公告)号:US12131186B2
公开(公告)日:2024-10-29
申请号:US17993490
申请日:2022-11-23
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: Anthony Gutierrez , Sooraj Puthoor
CPC classification number: G06F9/4881 , G06F9/3877 , G06F9/542 , G06F9/545 , G06F9/546
Abstract: A processor core is configured to execute a parent task that is described by a data structure stored in a memory. A coprocessor is configured to dispatch a child task to the at least one processor core in response to the coprocessor receiving a request from the parent task concurrently with the parent task executing on the at least one processor core. In some cases, the parent task registers the child task in a task pool and the child task is a future task that is configured to monitor a completion object and enqueue another task associated with the future task in response to detecting the completion object. The future task is configured to self-enqueue by adding a continuation future task to a continuation queue for subsequent execution in response to the future task failing to detect the completion object.
-
公开(公告)号:US11875425B2
公开(公告)日:2024-01-16
申请号:US17134904
申请日:2020-12-28
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: Sooraj Puthoor , Bradford Beckmann , Nuwan Jayasena , Anthony Gutierrez
CPC classification number: G06T1/20 , G06F9/30036 , G06F9/3836 , G06F9/3877 , G06F9/3887 , G06F9/545 , G06T2210/52
Abstract: Implementing heterogeneous wavefronts on a graphics processing unit (GPU) is disclosed. A scheduler assigns heterogeneous wavefronts for execution on a compute unit of a processing device. The heterogeneous wavefronts include different types of wavefronts such as vector compute wavefronts and service-level wavefronts that vary in resource requirements and instruction sets. As one example, heterogeneous wavefronts may include scalar wavefronts and vector compute wavefronts that execute on scalar units and vector units, respectively. Distinct sets of instructions are executed for the heterogeneous wavefronts on the compute unit. Heterogeneous wavefronts are processed in the same pipeline of the processing device.
-
公开(公告)号:US20220100563A1
公开(公告)日:2022-03-31
申请号:US17037727
申请日:2020-09-30
Applicant: Advanced Micro Devices, Inc.
Inventor: Anthony Gutierrez , Vedula Venkata Srikant Bharadwaj , Yasuko Eckert , Mark H. Oskin
IPC: G06F9/50 , G06F9/38 , G06F9/30 , G06F9/4401
Abstract: A dynamically configurable overprovisioned microprocessor optimally supports a variety of different compute application workloads and with the capability to tradeoff among compute performance, energy consumption, and clock frequency on a per-compute application basis, using general-purpose microprocessor designs. In some embodiments, the overprovisioned microprocessor comprises a physical compute resource and a dynamic configuration logic configured to: detect an activation-warranting operating condition; undarken the physical compute resource responsive to detecting the activation-warranting operating condition; detect a configuration-warranting operating condition; and configure the overprovisioned microprocessor to use the undarkened physical compute resource responsive to detecting the configuration-warranting operating condition.
-
公开(公告)号:US20250004963A1
公开(公告)日:2025-01-02
申请号:US18217079
申请日:2023-06-30
Applicant: Advanced Micro Devices, Inc.
Inventor: William Peter Ehrett , Anthony Gutierrez , Vedula Venkata Srikant Bharadwaj , Karthik Ramu Sangaiah , Prachi Shukla , Sriseshan Srikanth , Ganesh Dasika , John Kalamatianos
IPC: G06F13/36
Abstract: A semiconductor device, referred to herein as a Globally Interconnected Operations (GIO) layer, provides global operations in the form of global data reduction for one or more PE arrays. The GIO layer includes processing elements that perform global data reduction on processing results from one or more PE arrays. The GIO layer includes connectors that allow it to be arranged in a 3D stack with one or more PE arrays, for example, on top of or beneath a PE array. This allows reduction operations to be implemented across PE arrays using an efficient topology with superior flexibility, scalability, latency and/or power characteristics that is customizable for particular use cases at assembly time, without requiring costly and time-consuming redesign of PE arrays, and without being constrained by particular PE array designs.
-
公开(公告)号:US20230153672A1
公开(公告)日:2023-05-18
申请号:US17840417
申请日:2022-06-14
Applicant: Advanced Micro Devices, Inc.
Inventor: Salonik Resch , Anthony Gutierrez , Mark H. Oskin
IPC: G06N10/20
CPC classification number: G06N10/20
Abstract: An electronic device includes a quantum processor including a plurality of qubits. The quantum processor runs a plurality of instances of a quantum program using a separate set of qubits from among the qubits for each instance of the quantum program. The quantum processor then sets quantum states for ancilla qubits from among the qubits based on quantum states of respective groups of associated qubits from the separate sets of qubits. The quantum processor next provides an output of the instances of the quantum program based on the quantum states of the ancilla qubits.
-
17.
公开(公告)号:US11586563B2
公开(公告)日:2023-02-21
申请号:US17130604
申请日:2020-12-22
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: Max Ruttenberg , Vendula Venkata Srikant Bharadwaj , Yasuko Eckert , Anthony Gutierrez , Mark H. Oskin
IPC: G06F13/16 , G11C11/4076
Abstract: A processor distributes memory timing parameters and data among different memory modules based upon memory access patterns. The memory access patterns indicate different types, or classes, of data for an executing workload, with each class associated with different memory access characteristics, such as different row buffer hit rate levels, different frequencies of access, different criticalities, and the like. The processor assigns each memory module to a data class and sets the memory timing parameters for each memory module according to the module's assigned data class, thereby tailoring the memory timing parameters for efficient access of the corresponding data.
-
公开(公告)号:US10963299B2
公开(公告)日:2021-03-30
申请号:US16134695
申请日:2018-09-18
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: Anthony Gutierrez , Sooraj Puthoor
Abstract: A processor core is configured to execute a parent task that is described by a data structure stored in a memory. A coprocessor is configured to dispatch a child task to the at least one processor core in response to the coprocessor receiving a request from the parent task concurrently with the parent task executing on the at least one processor core. In some cases, the parent task registers the child task in a task pool and the child task is a future task that is configured to monitor a completion object and enqueue another task associated with the future task in response to detecting the completion object. The future task is configured to self-enqueue by adding a continuation future task to a continuation queue for subsequent execution in response to the future task failing to detect the completion object.
-
-
-
-
-
-
-