-
公开(公告)号:US12067642B2
公开(公告)日:2024-08-20
申请号:US17030024
申请日:2020-09-23
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: Niti Madan , Michael L. Chu , Ashwin Aji
CPC classification number: G06T1/60 , G06F3/0604 , G06F3/0631 , G06F3/0679 , G06F9/5016 , G06T1/20
Abstract: One or more processing units, such as a graphics processing unit (GPU), execute an application. A resource manager selectively allocates a first memory portion or a second memory portion to the processing units based on memory access characteristics. The first memory portion has a first latency that is lower that a second latency of the second memory portion. In some cases, the memory access characteristics indicate a latency sensitivity. In some cases, hints included in corresponding program code are used to determine the memory access characteristics. The memory access characteristics can also be determined by monitoring memory access requests, measuring a cache miss rate or a row buffer miss rate for the monitored memory access requests, and determining the memory access characteristics based on the cache miss rate or the row buffer miss rate.
-
公开(公告)号:US12066965B2
公开(公告)日:2024-08-20
申请号:US16863149
申请日:2020-04-30
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: SeyedMohammad Seyedzadehdelcheh , Steven Raasch , Sergey Blagodurov
CPC classification number: G06F13/4027 , G06F13/4282 , H03M7/30 , H03M9/00
Abstract: Data are serially communicated over an interconnect between an encoder and a decoder. The encoder includes a first training unit to count a frequency of symbol values in symbol blocks of a set of N number of symbol blocks in an epoch. A circular shift unit of the encoder stores a set of most-recently-used (MRU) amplitude values. An XOR unit is coupled to the first training unit and the first circular shift unit as inputs and to the interconnect as output. A transmitter is coupled to the encoder XOR unit and the interconnect and thereby contemporaneously sends symbols and trains on the symbols. In a system, a device includes a receiver and decoder that receive, from the encoder, symbols over the interconnect. The decoder includes its own training unit for decoding the transmitted symbols.
-
公开(公告)号:US12066944B2
公开(公告)日:2024-08-20
申请号:US16723780
申请日:2019-12-20
Applicant: Advanced Micro Devices, Inc.
Inventor: Vydhyanathan Kalyanasundharam , Amit P. Apte
IPC: G06F12/0815 , G06F12/0817 , G06F12/0888 , G06F12/0891
CPC classification number: G06F12/0815 , G06F12/0817 , G06F12/0888 , G06F12/0891 , G06F2212/608
Abstract: A coherency management device receives requests to read data from or write data to an address in a main memory. On a write, if the data includes zero data, an entry corresponding to the memory address is created in a cache directory if it does not already exist, is set to an invalid state, and indicates that the data includes zero data. The zero data is not written to main memory or a cache. On a read, the cache directory is checked for an entry corresponding to the memory address. If the entry exists in the cache directory, is invalid, and includes an indication that data corresponding to the memory address includes zero data, the coherency management device returns zero data in response to the request without fetching the data from main memory or a cache.
-
公开(公告)号:US12056522B2
公开(公告)日:2024-08-06
申请号:US17530936
申请日:2021-11-19
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: Michael L. Golden , Paul Blinzer , Magiting M. Talisayon , Srikanth Masanam , Ripal Butani , Upasanah Swaminathan
CPC classification number: G06F9/4881 , G06F2209/482
Abstract: Within a processing system, thread count asymmetries manifest when one or more cores of a processing device are disabled. To determine such thread count asymmetries, discovery operations are performed to determine thread count asymmetries for one or more hierarchy levels of a processing device based on a number of threads per enumerated instance within the hierarchy level. In response to the determining a thread count asymmetry, one thread identifier for each enumerated instance within the asymmetric hierarchy level is defined to determine a representation of the asymmetry. Using the representation of the symmetry, software tasks associated with one or more application within the processing system are performed.
-
公开(公告)号:US20240256005A1
公开(公告)日:2024-08-01
申请号:US18104736
申请日:2023-02-01
Applicant: Advanced Micro Devices, Inc.
Inventor: Andrew Thomas JUNKINS
CPC classification number: G06F1/185 , H05K7/1417
Abstract: A load frame with a load balance clamp and electronic devices having the same are disclosed herein. The load frame is configured to simultaneously apply clamping forces to opposite sides an integrated circuit (IC) device, thus reducing potential damage to components of the IC device.
-
公开(公告)号:US12052153B2
公开(公告)日:2024-07-30
申请号:US16118848
申请日:2018-08-31
Applicant: Advanced Micro Devices, Inc. , ATI Technologies ULC
Inventor: Alexander J. Branover , Thomas James Gibney , Michael J. Tresidder , Nat Barbiero
IPC: H04L43/0876 , G06F1/3234 , G06F1/3237 , G06F1/324 , G06F1/3296 , H04L41/0813
CPC classification number: H04L43/0876 , G06F1/3237 , G06F1/324 , G06F1/3253 , G06F1/3296 , H04L41/0813 , Y02D10/00
Abstract: Systems, apparatuses, and methods for enabling localized control of link states in a computing system are disclosed. A computing system includes at least a host processor, a communication fabric, one or more devices, one or more links, and a local link controller to monitor the one or more links. In various implementations, the local link controller detects and controls states of a link without requiring communication with, or intervention by, the host processor. In various implementations, this local control by the link controller includes control over the clock signals provided to the link. For example, the local link controller can directly control the frequency of a clock supplied to the link. In addition, in various implementations the link controller controls the power supplied to the link. For example, the link controller can control the voltage supplied to the link.
-
公开(公告)号:US12039337B2
公开(公告)日:2024-07-16
申请号:US17032494
申请日:2020-09-25
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: Robert B. Cohen , Tzu-Wei Lin , Anthony J. Bybell , Bill Kai Chiu Kwan , Frank C. Galloway
CPC classification number: G06F9/3804 , G06F9/30058 , G06F9/3822 , G06F9/3867
Abstract: A processor employs a plurality of fetch and decode pipelines by dividing an instruction stream into instruction blocks with identified boundaries. The processor includes a branch predictor that generates branch predictions. Each branch prediction corresponds to a branch instruction and includes a prediction that the corresponding branch is to be taken or not taken. In addition, each branch prediction identifies both an end of the current branch prediction window and the start of another branch prediction window. Using these known boundaries, the processor provides different sequential fetch streams to different ones of the plurality of fetch and decode states, which concurrently process the instructions of the different fetch streams, thereby improving overall instruction throughput at the processor.
-
公开(公告)号:US12038847B2
公开(公告)日:2024-07-16
申请号:US17952933
申请日:2022-09-26
Applicant: Advanced Micro Devices, Inc.
Inventor: William A. Moyes
IPC: G06F12/1009 , G06F12/0811
CPC classification number: G06F12/1009 , G06F12/0811
Abstract: A/D bit storage, processing, and mode management techniques through use of a dense A/D bit representation are described. In one example, a memory management unit employs an A/D bit representation generation module to generate the dense A/D bit representation. In an implementation, the A/D bit representation is stored adjacent to existing page table structures of the multilevel page table hierarchy. In another example, memory management unit supports use of modes as part of A/D bit storage.
-
公开(公告)号:US12033239B2
公开(公告)日:2024-07-09
申请号:US17563950
申请日:2021-12-28
Applicant: Advanced Micro Devices, Inc.
Inventor: Priyadarshi Sharma , Anshuman Mittal , Saurabh Sharma
IPC: G06T1/20 , G06F12/0891 , G06T1/60
CPC classification number: G06T1/60 , G06F12/0891 , G06T1/20 , G06F2212/455
Abstract: Systems, apparatuses, and methods for performing dead surface invalidation are disclosed. An application sends draw call commands to a graphics processing unit (GPU) via a driver, with the draw call commands rendering to surfaces. After it is determined that a given surface will no longer be accessed by subsequent draw calls, the application sends a surface invalidation command for the given surface to a command processor of the GPU. After the command processor receives the surface invalidation command, the command processor waits for a shader engine to send a draw call completion message for a last draw call to access the given surface. Once the command processor receives the draw call completion message, the command processor sends a surface invalidation command to a cache to invalidate cache lines for the given surface to free up space in the cache for other data.
-
公开(公告)号:US12032967B2
公开(公告)日:2024-07-09
申请号:US17845938
申请日:2022-06-21
Applicant: Advanced Micro Devices, Inc.
Inventor: Matthäus G. Chajdas , Christopher J. Brennan
CPC classification number: G06F9/3887 , G06F9/3012 , G06F9/4881 , G06F9/5016
Abstract: Devices and methods for partial sorting for coherence recovery are provided. The partial sorting is efficiently executed by utilizing existing hardware along the memory path (e.g., memory local to the compute unit). The devices include an accelerated processing device which comprises memory and a processor. The processor is, for example, a compute unit of a GPU which comprises a plurality of SIMD units and is configured to determine, for data entries each comprising a plurality of bits, a number of occurrences of different types of the data entries by storing the number of occurrences in one or more portions of the memory local to the processor, sort the data entries based on the determined number of occurrences stored in the one or more portions of the memory local to the processor and execute the sorted data entries.
-
-
-
-
-
-
-
-
-