-
1.
公开(公告)号:US20230244492A1
公开(公告)日:2023-08-03
申请号:US18298723
申请日:2023-04-11
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: JAGADISH B. KOTRA , JOHN KALAMATIANOS
CPC classification number: G06F9/3836 , G06F9/3001 , G06F9/522 , G06F9/3877
Abstract: Preserving memory ordering between offloaded instructions and non-offloaded instructions is disclosed. An offload instruction for an operation to be offloaded is processed and a lock is placed on a memory address associated with the offload instruction. In response to completing a cache operation targeting the memory address, the lock on the memory address is removed. For multithreaded applications, upon determining that a plurality of processor cores have each begun executing a sequence of offload instructions, the execution of non-offload instructions that are younger than any of the offload instructions is restricted. In response to determining that each processor core has completed executing its sequence of offload instructions, the restriction is removed. The remote device may be, for example, a processing-in-memory device or an accelerator coupled to a memory.
-
公开(公告)号:US20220188233A1
公开(公告)日:2022-06-16
申请号:US17473242
申请日:2021-09-13
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: JOHN KALAMATIANOS , JAGADISH B. KOTRA , GAGANDEEP PANWAR
IPC: G06F12/0817
Abstract: A system-on-chip configured for eager invalidation and flushing of cached data used by PIM (Processing-in-Memory) instructions includes: one or more processor cores; one or more caches and an I/O (input/output) die comprising logic to: receive a cache probe request, wherein the cache probe request including a physical memory address associated with a PIM instruction, and the PIM instruction is to be offloaded to a PIM device for execution; and issue, based on the physical memory address, a cache probe to one or more of the caches prior to receiving the PIM instruction for dispatch to the PIM device.
-
公开(公告)号:US20210117195A1
公开(公告)日:2021-04-22
申请号:US16658474
申请日:2019-10-21
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: JOHN KALAMATIANOS , KRISHNAN V. RAMANI , SUSUMU MASHIMO
Abstract: Address-based filtering for load/store speculation includes maintaining a filtering table including table entries associated with ranges of addresses; in response to receiving an ordering check triggering transaction, querying the filtering table using a target address of the ordering check triggering transaction to determine if an instruction dependent upon the ordering check triggering transaction has previously been generated a physical address; and in response to determining that the filtering table lacks an indication that the instruction dependent upon the ordering check triggering transaction has previously been generated a physical address, bypassing a lookup operation in an ordering violation memory structure to determine whether the instruction dependent upon the ordering check triggering transaction is currently in-flight.
-
公开(公告)号:US20220206901A1
公开(公告)日:2022-06-30
申请号:US17136549
申请日:2020-12-29
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: SHRIKANTH GANAPATHY , ROSS V. LA FETRA , JOHN KALAMATIANOS , SUDHANVA GURUMURTHI , SHAIZEEN AGA , VILAS SRIDHARAN , MICHAEL IGNATOWSKI , NUWAN JAYASENA
Abstract: Providing host-based error detection capabilities in a remote execution device is disclosed. A remote execution device performs a host-offloaded operation that modifies a block of data stored in memory. Metadata is generated locally for the modified of block of data such that the local metadata generation emulates host-based metadata generation. Stored metadata for the block of data is updated with the locally generated metadata for the modified portion of the block of data. When the host performs an integrity check on the modified block of data using the updated metadata, the host does not distinguish between metadata generated by the host and metadata generated in the remote execution device.
-
公开(公告)号:US20220206855A1
公开(公告)日:2022-06-30
申请号:US17136767
申请日:2020-12-29
Applicant: ADVANCED MICRO DEVICES, INC.
IPC: G06F9/50
Abstract: Offloading computations from a processor to remote execution logic is disclosed. Offload instructions for remote execution on a remote device are dispatched in the form of processor instructions like conventional instructions. In the processor, an offload instruction is inserted in an offload queue. The offload instruction may be inserted at the dispatch stage or the retire stage of the processor pipeline. Metadata for the offload instruction is added to the offload instruction in the offload queue. After retirement of the offload instruction, the processor transmits an offload request generated from the offload instruction.
-
公开(公告)号:US20220100606A1
公开(公告)日:2022-03-31
申请号:US17033398
申请日:2020-09-25
Applicant: Advanced Micro Devices, Inc.
Inventor: SUDHANVA GURUMURTHI , VILAS SRIDHARAN , SHAIZEEN AGA , NUWAN JAYASENA , MICHAEL IGNATOWSKI , SHRIKANTH GANAPATHY , JOHN KALAMATIANOS
Abstract: A memory module includes one or more programmable ECC engines that may be programed by a host processing element with a particular ECC implementation. As used herein, the term “ECC implementation” refers to ECC functionality for performing error detection and subsequent processing, for example using the results of the error detection to perform error correction and to encode corrupted data that cannot be corrected, etc. The approach allows an SoC designer or company to program and reprogram ECC engines in memory modules in a secure manner without having to disclose the particular ECC implementations used by the ECC engines to memory vendors or third parties.
-
公开(公告)号:US20240126552A1
公开(公告)日:2024-04-18
申请号:US18393657
申请日:2023-12-21
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: JOHN KALAMATIANOS , MICHAEL T. CLARK , MARIUS EVERS , WILLIAM L. WALKER , PAUL MOYER , JAY FLEISCHMAN , JAGADISH B. KOTRA
CPC classification number: G06F9/30181 , G06F9/30043 , G06F9/30098 , G06F9/30138 , G06F9/3834 , G06F9/3877 , G06F9/52
Abstract: Processor-guided execution of offloaded instructions using fixed function operations is disclosed. Instructions designated for remote execution by a target device are received by a processor. Each instruction includes, as an operand, a target register in the target device. The target register may be an architected virtual register. For each of the plurality of instructions, the processor transmits an offload request in the order that the instructions are received. The offload request includes the instruction designated for remote execution. The target device may be, for example, a processing-in-memory device or an accelerator coupled to a memory.
-
公开(公告)号:US20240111678A1
公开(公告)日:2024-04-04
申请号:US17958120
申请日:2022-09-30
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: JAGADISH B. KOTRA , JOHN KALAMATIANOS , PAUL MOYER , GABRIEL H. LOH
IPC: G06F12/0862 , G06F12/0811
CPC classification number: G06F12/0862 , G06F12/0811
Abstract: Systems and methods for pushed prefetching include: multiple core complexes, each core complex having multiple cores and multiple caches, the multiple caches configured in a memory hierarchy with multiple levels; an interconnect device coupling the core complexes to each other and coupling the core complexes to shared memory, the shared memory at a lower level of the memory hierarchy than the multiple caches; and a push-based prefetcher having logic to: monitor memory traffic between caches of a first level of the memory hierarchy and the shared memory; and based on the monitoring, initiate a prefetch of data to a cache of the first level of the memory hierarchy.
-
9.
公开(公告)号:US20230205693A1
公开(公告)日:2023-06-29
申请号:US17564155
申请日:2021-12-28
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: JAGADISH B. KOTRA , JOHN KALAMATIANOS , YASUKO ECKERT , YONGHAE KIM
IPC: G06F12/0811
CPC classification number: G06F12/0811
Abstract: Leveraging processing-in-memory (PIM) resources to expedite non-PIM instructions executed on a host is disclosed. In an implementation, a memory controller identifies a first write instruction to write first data to a first memory location, where the first write instruction is not a processing-in-memory (PIM) instruction. The memory controller then writes the first data to a first PIM register. Opportunistically, the memory controller moves the first data from the first PIM register to the first memory location. In another implementation, a memory controller identifies a first memory location associated with a first read instruction, where the first read instruction is not a processing-in-memory (PIM) instruction. The memory controller identifies that a PIM register is associated with the first memory location. The memory controller then reads, in response to the first read instruction, first data from the PIM register.
-
公开(公告)号:US20230077933A1
公开(公告)日:2023-03-16
申请号:US17474372
申请日:2021-09-14
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: JOHN KALAMATIANOS , NUWAN JAYASENA
Abstract: A processor for supporting PIM (Processing-in-Memory) execution in a multiprocessing environment includes logic configured to: receive a request to initiate an offload of a number of PIM instructions to a PIM device. The request is issued by a first thread of a processor. The logic is also configured to reserve, based on information in the request, resources of the PIM device for execution of the plurality of instructions.
-
-
-
-
-
-
-
-
-