-
公开(公告)号:US20240004584A1
公开(公告)日:2024-01-04
申请号:US17855109
申请日:2022-06-30
Applicant: Advanced Micro Devices, Inc.
Inventor: Niti Madan , Yasuko Eckert , Varun Agrawal , John Kalamatianos
IPC: G06F3/06
CPC classification number: G06F3/0659 , G06F3/0653 , G06F3/0679 , G06F3/0604
Abstract: In accordance with described techniques for DRAM row management for processing in memory, a plurality of instructions are obtained for execution by a processing in memory component embedded in a dynamic random access memory. An instruction is identified that last accesses a row of the dynamic random access memory, and a subsequent instruction is identified that first accesses an additional row of the dynamic random access memory. A first command is issued to close the row and a second command is issued to open the additional row after the row is last accessed by the instruction.
-
公开(公告)号:US11847061B2
公开(公告)日:2023-12-19
申请号:US17385783
申请日:2021-07-26
Applicant: Advanced Micro Devices, Inc.
Inventor: Shaizeen Aga , Nuwan Jayasena , John Kalamatianos
IPC: G06F12/08 , G06F13/16 , G06F12/02 , G06F12/0891 , G06F12/0811
CPC classification number: G06F12/0891 , G06F12/0238 , G06F12/0811 , G06F13/1668
Abstract: A technical solution to the technical problem of how to support memory-centric operations on cached data uses a novel memory-centric memory operation that invokes write back functionality on cache controllers and memory controllers. The write back functionality enforces selective flushing of dirty, i.e., modified, cached data that is needed for memory-centric memory operations from caches to the completion level of the memory-centric memory operations, and updates the coherence state appropriately at each cache level. The technical solution ensures that commands to implement the selective cache flushing are ordered before the memory-centric memory operation at the completion level of the memory-centric memory operation.
-
123.
公开(公告)号:US11714652B2
公开(公告)日:2023-08-01
申请号:US17384646
申请日:2021-07-23
Applicant: Advanced Micro Devices, Inc.
Inventor: John Kalamatianos , Ganesh Dasika
CPC classification number: G06F9/3832 , G06F9/3001 , G06F9/30181
Abstract: A processing device includes a zero detection circuit to determine that an operand of a first instruction is zero and instruction conversion logic coupled with the zero detection circuit to, in response to the zero detection circuit determining that the operand is zero, convert the first instruction to a register move instruction executable by the processing device.
-
公开(公告)号:US20230161710A1
公开(公告)日:2023-05-25
申请号:US18158212
申请日:2023-01-23
Applicant: Advanced Micro Devices, Inc.
Inventor: Alexander D. Breslow , John Kalamatianos
IPC: G06F12/0895 , H03M7/30
CPC classification number: G06F12/0895 , H03M7/3088 , G06F2212/1044 , G06F2212/608
Abstract: Systems, apparatuses, and methods for implementing flexible dictionary sharing techniques for caches are disclosed. A set-associative cache includes a dictionary for each data array set. When a cache line is to be allocated in the cache, a cache controller determines to which set a base index of the cache line address maps. Then, a selector unit determines which dictionary of a group of dictionaries stored by those sets neighboring this set would achieve the most compression for the cache line. This dictionary is then selected to compress the cache line. An offset is added to the base index of the cache line to generate a full index in order to map the cache line to the set corresponding to this chosen dictionary. The compressed cache line is stored in this set with the chosen dictionary, and the offset is stored in the corresponding tag array entry.
-
公开(公告)号:US11645073B2
公开(公告)日:2023-05-09
申请号:US17238844
申请日:2021-04-23
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: John Kalamatianos , Krishnan V. Ramani , Susumu Mashimo
CPC classification number: G06F9/30043 , G06F9/35 , G06F9/3842
Abstract: Address-based filtering for load/store speculation includes maintaining a filtering table including table entries associated with ranges of addresses; in response to receiving an ordering check triggering transaction, querying the filtering table using a target address of the ordering check triggering transaction to determine if an instruction dependent upon the ordering check triggering transaction has previously been generated a physical address; and in response to determining that the filtering table lacks an indication that the instruction dependent upon the ordering check triggering transaction has previously been generated a physical address, bypassing a lookup operation in an ordering violation memory structure to determine whether the instruction dependent upon the ordering check triggering transaction is currently in-flight.
-
公开(公告)号:US11487671B2
公开(公告)日:2022-11-01
申请号:US16446119
申请日:2019-06-19
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: Xianwei Zhang , John Kalamatianos , Bradford Beckmann
IPC: G06F12/0891 , G06F12/0888 , G06F12/0895 , G06F9/54 , G06F9/38
Abstract: Wavefront loading in a processor is managed and includes monitoring a selected wavefront of a set of wavefronts. Reuse of memory access requests for the selected wavefront is counted. A cache hit rate in one or more caches of the processor is determined based on the counted reuse. Based on the cache hit rate, subsequent memory requests of other wavefronts of the set of wavefronts are modified by including a type of reuse of cache lines in requests to the caches. In the caches, storage of data in the caches is based on the type of reuse indicated by the subsequent memory access requests. Reused cache lines are protected by preventing cache line contents from being replaced by another cache line for a duration of processing the set of wavefronts. Caches are bypassed when streaming access requests are made.
-
公开(公告)号:US11442727B2
公开(公告)日:2022-09-13
申请号:US16895825
申请日:2020-06-08
Applicant: Advanced Micro Devices, Inc.
Inventor: Varun Agrawal , John Kalamatianos
Abstract: An electronic device includes a processor, a branch predictor in the processor, and a predictor controller in the processor. The branch predictor includes multiple prediction functional blocks, each prediction functional block configured for generating predictions for control transfer instructions (CTIs) in program code based on respective prediction information, the branch predictor configured to select, from among predictions generated by the prediction functional blocks for each CTI, a selected prediction to be used for that CTI. The predictor controller keeps a record of prediction functional blocks from which the branch predictor previously selected predictions for CTIs. The predictor controller uses information from the record for controlling which prediction functional blocks are used by the branch predictor for generating predictions for CTIs.
-
公开(公告)号:US20220239315A1
公开(公告)日:2022-07-28
申请号:US17722931
申请日:2022-04-18
Applicant: Advanced Micro Devices, Inc.
Inventor: Alexander D. Breslow , Nuwan Jayasena , John Kalamatianos
Abstract: A data processing platform, method, and program product perform compression and decompression of a set of data items. Suffix data and a prefix are selected for each respective data item in the set of data items based on data content of the respective data item. The set of data items is sorted based on the prefixes. The prefixes are encoded by querying multiple encoding tables to create a code word containing compressed information representing values of all prefixes for the set of data items. The code word and suffix data for each of the data items are stored in memory. The code word is decompressed to recover the prefixes. The recovered prefixes are paired with their respective suffix data.
-
公开(公告)号:US20220206899A1
公开(公告)日:2022-06-30
申请号:US17133843
申请日:2020-12-24
Applicant: Advanced Micro Devices, Inc.
Inventor: John Kalamatianos , Nuwan Jayasena , Sudhanva Gurumurthi , Shaizeen Aga , Shrikanth Ganapathy
Abstract: Methods and processing devices are provided for error protection to support instruction replay for executing idempotent instructions at a processing in memory PIM device. The processing apparatus includes a PIM device configured to execute an idempotent instruction. The processing apparatus also includes a processor, in communication with the PIM device, configured to issue the idempotent instruction to the PIM device for execution at the PIM device and reissue the idempotent instruction to the PIM device when one of execution of the idempotent instruction at the PIM device results in an error and a predetermined latency period expires from when the idempotent instruction is issued.
-
公开(公告)号:US20220197809A1
公开(公告)日:2022-06-23
申请号:US17133581
申请日:2020-12-23
Applicant: Advanced Micro Devices, Inc.
Inventor: Furkan Eris , Paul S. Keltcher , John Kalamatianos , Mayank Chhablani , Alok Garg
IPC: G06F12/0862 , G06N20/00 , G06F16/901
Abstract: Techniques for identifying a hardware configuration for operation are disclosed. The techniques include applying feature measurements to a trained model; obtaining output values from the trained model, the output values corresponding to different hardware configurations; and operating according to the output values, wherein the output values include one of a certainty score, a ranking, or a regression value.
-
-
-
-
-
-
-
-
-