Patent search caee:"Advanced Micro Devices Inc." Page 11

101.

发明授权
Memory latency-aware GPU architecture 有权

公开(公告)号：US12067642B2

公开(公告)日：2024-08-20

申请号：US17030024

申请日：2020-09-23

Applicant: ADVANCED MICRO DEVICES, INC.

Inventor： Niti Madan , Michael L. Chu , Ashwin Aji

IPC: G06T15/00 , G06F3/06 , G06F9/50 , G06T1/20 , G06T1/60

CPC classification number: G06T1/60 , G06F3/0604 , G06F3/0631 , G06F3/0679 , G06F9/5016 , G06T1/20

Abstract: One or more processing units, such as a graphics processing unit (GPU), execute an application. A resource manager selectively allocates a first memory portion or a second memory portion to the processing units based on memory access characteristics. The first memory portion has a first latency that is lower that a second latency of the second memory portion. In some cases, the memory access characteristics indicate a latency sensitivity. In some cases, hints included in corresponding program code are used to determine the memory access characteristics. The memory access characteristics can also be determined by monitoring memory access requests, measuring a cache miss rate or a row buffer miss rate for the monitored memory access requests, and determining the memory access characteristics based on the cache miss rate or the row buffer miss rate.

102.

发明授权
Encoding of symbols for a computer interconnect based on frequency of symbol values 有权

公开(公告)号：US12066965B2

公开(公告)日：2024-08-20

申请号：US16863149

申请日：2020-04-30

Applicant: ADVANCED MICRO DEVICES, INC.

Inventor： SeyedMohammad Seyedzadehdelcheh , Steven Raasch , Sergey Blagodurov

IPC: G06F13/40 , G06F13/42 , H03M7/30 , H03M9/00

CPC classification number: G06F13/4027 , G06F13/4282 , H03M7/30 , H03M9/00

Abstract: Data are serially communicated over an interconnect between an encoder and a decoder. The encoder includes a first training unit to count a frequency of symbol values in symbol blocks of a set of N number of symbol blocks in an epoch. A circular shift unit of the encoder stores a set of most-recently-used (MRU) amplitude values. An XOR unit is coupled to the first training unit and the first circular shift unit as inputs and to the interconnect as output. A transmitter is coupled to the encoder XOR unit and the interconnect and thereby contemporaneously sends symbols and trains on the symbols. In a system, a device includes a receiver and decoder that receive, from the encoder, symbols over the interconnect. The decoder includes its own training unit for decoding the transmitted symbols.

103.

发明授权
Zero value memory compression 有权

公开(公告)号：US12066944B2

公开(公告)日：2024-08-20

申请号：US16723780

申请日：2019-12-20

Applicant: Advanced Micro Devices, Inc.

Inventor： Vydhyanathan Kalyanasundharam , Amit P. Apte

IPC: G06F12/0815 , G06F12/0817 , G06F12/0888 , G06F12/0891

CPC classification number: G06F12/0815 , G06F12/0817 , G06F12/0888 , G06F12/0891 , G06F2212/608

Abstract: A coherency management device receives requests to read data from or write data to an address in a main memory. On a write, if the data includes zero data, an entry corresponding to the memory address is created in a cache directory if it does not already exist, is set to an invalid state, and indicates that the data includes zero data. The zero data is not written to main memory or a cache. On a read, the cache directory is checked for an entry corresponding to the memory address. If the entry exists in the cache directory, is invalid, and includes an indication that data corresponding to the memory address includes zero data, the coherency management device returns zero data in response to the request without fetching the data from main memory or a cache.

104.

发明授权
Hierarchical asymmetric core attribute detection 有权

公开(公告)号：US12056522B2

公开(公告)日：2024-08-06

申请号：US17530936

申请日：2021-11-19

Applicant: ADVANCED MICRO DEVICES, INC.

Inventor： Michael L. Golden , Paul Blinzer , Magiting M. Talisayon , Srikanth Masanam , Ripal Butani , Upasanah Swaminathan

IPC: G06F9/44 , G06F9/48

CPC classification number: G06F9/4881 , G06F2209/482

Abstract: Within a processing system, thread count asymmetries manifest when one or more cores of a processing device are disabled. To determine such thread count asymmetries, discovery operations are performed to determine thread count asymmetries for one or more hierarchy levels of a processing device based on a number of threads per enumerated instance within the hierarchy level. In response to the determining a thread count asymmetry, one thread identifier for each enumerated instance within the asymmetric hierarchy level is defined to determine a representation of the asymmetry. Using the representation of the symmetry, software tasks associated with one or more application within the processing system are performed.

105.

发明公开
Load from with load balanced clamp and electronic device having the same 审中-公开

公开(公告)号：US20240256005A1

公开(公告)日：2024-08-01

申请号：US18104736

申请日：2023-02-01

Applicant: Advanced Micro Devices, Inc.

Inventor： Andrew Thomas JUNKINS

IPC: G06F1/18 , H05K7/14

CPC classification number: G06F1/185 , H05K7/1417

Abstract: A load frame with a load balance clamp and electronic devices having the same are disclosed herein. The load frame is configured to simultaneously apply clamping forces to opposite sides an integrated circuit (IC) device, thus reducing potential damage to components of the IC device.

106.

发明授权
Dynamic fine grain link control 有权

公开(公告)号：US12052153B2

公开(公告)日：2024-07-30

申请号：US16118848

申请日：2018-08-31

Applicant: Advanced Micro Devices, Inc. , ATI Technologies ULC

Inventor： Alexander J. Branover , Thomas James Gibney , Michael J. Tresidder , Nat Barbiero

IPC: H04L43/0876 , G06F1/3234 , G06F1/3237 , G06F1/324 , G06F1/3296 , H04L41/0813

CPC classification number: H04L43/0876 , G06F1/3237 , G06F1/324 , G06F1/3253 , G06F1/3296 , H04L41/0813 , Y02D10/00

Abstract: Systems, apparatuses, and methods for enabling localized control of link states in a computing system are disclosed. A computing system includes at least a host processor, a communication fabric, one or more devices, one or more links, and a local link controller to monitor the one or more links. In various implementations, the local link controller detects and controls states of a link without requiring communication with, or intervention by, the host processor. In various implementations, this local control by the link controller includes control over the clock signals provided to the link. For example, the local link controller can directly control the frequency of a clock supplied to the link. In addition, in various implementations the link controller controls the power supplied to the link. For example, the link controller can control the voltage supplied to the link.

107.

发明授权
Processor with multiple fetch and decode pipelines 有权

公开(公告)号：US12039337B2

公开(公告)日：2024-07-16

申请号：US17032494

申请日：2020-09-25

Applicant: ADVANCED MICRO DEVICES, INC.

Inventor： Robert B. Cohen , Tzu-Wei Lin , Anthony J. Bybell , Bill Kai Chiu Kwan , Frank C. Galloway

IPC: G06F9/38 , G06F9/30

CPC classification number: G06F9/3804 , G06F9/30058 , G06F9/3822 , G06F9/3867

Abstract: A processor employs a plurality of fetch and decode pipelines by dividing an instruction stream into instruction blocks with identified boundaries. The processor includes a branch predictor that generates branch predictions. Each branch prediction corresponds to a branch instruction and includes a prediction that the corresponding branch is to be taken or not taken. In addition, each branch prediction identifies both an end of the current branch prediction window and the start of another branch prediction window. Using these known boundaries, the processor provides different sequential fetch streams to different ones of the plurality of fetch and decode states, which concurrently process the instructions of the different fetch streams, thereby improving overall instruction throughput at the processor.

108.

发明授权
A/D bit storage, processing, and modes 有权

公开(公告)号：US12038847B2

公开(公告)日：2024-07-16

申请号：US17952933

申请日：2022-09-26

Applicant: Advanced Micro Devices, Inc.

Inventor： William A. Moyes

IPC: G06F12/1009 , G06F12/0811

CPC classification number: G06F12/1009 , G06F12/0811

Abstract: A/D bit storage, processing, and mode management techniques through use of a dense A/D bit representation are described. In one example, a memory management unit employs an A/D bit representation generation module to generate the dense A/D bit representation. In an implementation, the A/D bit representation is stored adjacent to existing page table structures of the multilevel page table hierarchy. In another example, memory management unit supports use of modes as part of A/D bit storage.

109.

发明授权
Dead surface invalidation 有权

公开(公告)号：US12033239B2

公开(公告)日：2024-07-09

申请号：US17563950

申请日：2021-12-28

Applicant: Advanced Micro Devices, Inc.

Inventor： Priyadarshi Sharma , Anshuman Mittal , Saurabh Sharma

IPC: G06T1/20 , G06F12/0891 , G06T1/60

CPC classification number: G06T1/60 , G06F12/0891 , G06T1/20 , G06F2212/455

Abstract: Systems, apparatuses, and methods for performing dead surface invalidation are disclosed. An application sends draw call commands to a graphics processing unit (GPU) via a driver, with the draw call commands rendering to surfaces. After it is determined that a given surface will no longer be accessed by subsequent draw calls, the application sends a surface invalidation command for the given surface to a command processor of the GPU. After the command processor receives the surface invalidation command, the command processor waits for a shader engine to send a draw call completion message for a last draw call to access the given surface. Once the command processor receives the draw call completion message, the command processor sends a surface invalidation command to a cache to invalidate cache lines for the given surface to free up space in the cache for other data.

110.

发明授权
Partial sorting for coherency recovery 有权

公开(公告)号：US12032967B2

公开(公告)日：2024-07-09

申请号：US17845938

申请日：2022-06-21

Applicant: Advanced Micro Devices, Inc.

Inventor： Matthäus G. Chajdas , Christopher J. Brennan

IPC: G06F9/38 , G06F9/30 , G06F9/48 , G06F9/50

CPC classification number: G06F9/3887 , G06F9/3012 , G06F9/4881 , G06F9/5016

Abstract: Devices and methods for partial sorting for coherence recovery are provided. The partial sorting is efficiently executed by utilizing existing hardware along the memory path (e.g., memory local to the compute unit). The devices include an accelerated processing device which comprises memory and a processor. The processor is, for example, a compute unit of a GPU which comprises a plurality of SIMD units and is configured to determine, for data entries each comprising a plurality of bits, a number of occurrences of different types of the data entries by storing the number of occurrences in one or more portions of the memory local to the processor, sort the data entries based on the determined number of occurrences stored in the one or more portions of the memory local to the processor and execute the sorted data entries.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification