Patent search aee:"Advanced Micro Devices Inc." Page 62

611.

发明授权
Network of memory modules with logarithmic access 有权

公开(公告)号：US10394726B2

公开(公告)日：2019-08-27

申请号：US15229708

申请日：2016-08-05

Applicant: Advanced Micro Devices, Inc.

Inventor： Gabriel Loh

IPC: G06F13/28 , G06F13/16

Abstract: A memory network includes a plurality of memory nodes each identifiable by an ordinal number m, and a set of links divided into N subsets of links, where each subset of links is identifiable by an ordinal number n. For each subset of the plurality of N subsets of links, each link in the subset connects two memory nodes that have ordinal numbers m differing by b(n-1), where b is a positive number. Each of the memory nodes is communicatively coupled to a processor via at least two non-overlapping pathways through the plurality of links.

612.

发明申请
ACCURATE ON-CHIP TEMPERATURE SENSING USING THERMAL OSCILLATOR 审中-公开

公开(公告)号：US20190257696A1

公开(公告)日：2019-08-22

申请号：US15902101

申请日：2018-02-22

Applicant: Advanced Micro Devices, Inc.

Inventor： Ravinder Reddy Rachala , Stephen Victor Kosonocky , Stephen C. Ennis

IPC: G01K7/01 , G06F1/32

Abstract: A calibrated temperature sensor includes a power on oscillator responsive to a calibration enable signal for providing a power on clock signal, a temperature dependent oscillator responsive to said calibration enable signal for providing a temperature dependent clock signal, and a measurement logic circuit. The measurement logic circuit counts a first number of pulses of the temperature dependent clock signal during a first calibration period using the power on clock signal, a second number of pulses of the temperature dependent clock signal during a second calibration period using a system clock signal, and a third number of pulses of the power on clock signal over a third calibration period using the system clock signal, and a fourth number of pulses of the temperature dependent clock signal using the system clock signal during a normal operation mode, wherein the first calibration period precedes both the second and third calibration periods.

613.

发明授权
Programmable write word line boost for low voltage memory operation 有权

公开(公告)号：US10366734B2

公开(公告)日：2019-07-30

申请号：US15424418

申请日：2017-02-03

Applicant: Advanced Micro Devices, Inc.

Inventor： Alexander W. Schaefer , Ravi T. Jotwani , Samiul Haque Khan , David Hugh McIntyre , Stephen Victor Kosonocky , John J. Wuu , Russell Schreiber

IPC: G11C8/08 , G11C11/418 , G11C5/14 , G11C11/413 , G11C11/419

Abstract: A system and method for efficient power, performance and stability tradeoffs of memory accesses under a variety of conditions are described. A system management unit in a computing system interfaces with a memory and a processing unit, and uses boosting of word line voltage levels in the memory to assist write operations. The computing system supports selecting one of multiple word line boost values, each with an associated cross-over region. A cross-over region is a range of operating voltages for the memory used for determining whether to enable or disable boosting of word line voltage levels in the memory. The system management unit selects between enabling and disabling the boosting of word line voltage levels based on a target operational voltage for the memory and the cross-over region prior to updating the operating parameters of the memory to include the target operational voltage.

614.

发明授权
Tag and data organization in large memory caches 有权

公开(公告)号：US10366008B2

公开(公告)日：2019-07-30

申请号：US15376275

申请日：2016-12-12

Applicant: Advanced Micro Devices, Inc.

Inventor： Ganesh Balakrishnan , Vydhyanathan Kalyanasundharam , Kevin M. Lepak

IPC: G06F12/08 , G06F12/0853 , G06F12/0811 , G06F12/084

Abstract: A data processing system includes a processor and a cache controller coupled to the processor, and adapted to be coupled to a memory. The cache controller uses the memory to form a pseudo direct mapped cache having a plurality of groups of pages. The memory forms a first number of selected pages, including a first page for storing a plurality of sets of tags and a plurality of remaining pages for storing data. Each tag, of the plurality of sets of tags, stores tags for respective entries in a corresponding one of the plurality of remaining pages.

615.

发明授权
Silent active page migration faults 有权

公开(公告)号：US10365824B2

公开(公告)日：2019-07-30

申请号：US15495296

申请日：2017-04-24

Applicant: Advanced Micro Devices, Inc. , ATI Technologies ULC

Inventor： Wade K. Smith , Anthony Asaro

IPC: G06F3/06 , G06F12/1009 , G06F12/1027

Abstract: Systems, apparatuses, and methods for migrating memory pages are disclosed herein. In response to detecting that a migration of a first page between memory locations is being initiated, a first page table entry (PTE) corresponding to the first page is located and a migration pending indication is stored in the first PTE. In one embodiment, the migration pending indication is encoded in the first PTE by disabling read and write permissions. If a translation request targeting the first PTE is received by the MMU and the translation request corresponds to a read request, a read operation is allowed to the first page. Otherwise, if the translation request corresponds to a write request, a write operation to the first page is blocked and a silent retry request is generated and conveyed to the requesting client.

616.

发明申请
ADAPTIVE DCO VF CURVE SLOPE CONTROL 审中-公开

公开(公告)号：US20190229736A1

公开(公告)日：2019-07-25

申请号：US16370479

申请日：2019-03-29

Applicant: Advanced Micro Devices, Inc. , ATI Technologies ULC

Inventor： Stephen Victor Kosonocky , Mikhail Rodionov , Joyce Cheuk Wai Wong

IPC: H03L7/099 , H03K5/14

Abstract: An oscillator circuit is provided that adapts to voltage supply variations. The circuit first and second delays lines connected inputs of an edge detector, one delay line supplied by a reference voltage and the other with a drooping supply voltage. The edge detector generates an output clock based on a relationship between the inputs. The output clock applied to the signal inputs of the first and second delay lines. The output clock has a voltage dependent frequency performance curve with a slope dependent at least on the second delay line delay and a delay of the edge detector. At least one of the first delay line, the second delay line, and the edge detector delay are adjusted to change the slope of the performance curve.

617.

发明授权
Method and processing apparatus for gating redundant threads 有权

公开(公告)号：US10360177B2

公开(公告)日：2019-07-23

申请号：US15189054

申请日：2016-06-22

Applicant: Advanced Micro Devices, Inc. , ATI Technologies ULC

Inventor： Syed Zohaib M. Gilani , Jiasheng Chen , QingCheng Wang , YunXiao Zou , Michael Mantor , Bin He , Timour T. Paltashev

IPC: G06F15/80 , G06F1/3234 , G06T15/00

Abstract: Described is a method and processing apparatus to improve power efficiency by gating redundant threads processing. In particular, the method for gating redundant threads in a graphics processor includes determining if data for a thread and data for at least another thread are within a predetermined similarity threshold, gating execution of the at least another thread if the data for the thread and the data for the at least another thread are within the predetermined similarity threshold, and using an output data from the thread as an output data for the at least another thread.

618.

发明授权
Managing variations among nodes in parallel system frameworks 有权

公开(公告)号：US10355966B2

公开(公告)日：2019-07-16

申请号：US15081558

申请日：2016-03-25

Applicant: Advanced Micro Devices, Inc.

Inventor： Samuel Lawrence Wasmundt , Leonardo Piga , Indrani Paul , Wei Huang , Manish Arora

IPC: H04L12/26 , H04L29/08

Abstract: Systems, apparatuses, and methods for managing variations among nodes in parallel system frameworks. Sensor and performance data associated with the nodes of a multi-node cluster may be monitored to detect variations among the nodes. A variability metric may be calculated for each node of the cluster based on the sensor and performance data associated with the node. The variability metrics may then be used by a mapper to efficiently map tasks of a parallel application to the nodes of the cluster. In one embodiment, the mapper may assign the critical tasks of the parallel application to the nodes with the lowest variability metrics. In another embodiment, the hardware of the nodes may be reconfigured so as to reduce the node-to-node variability.

619.

发明授权
Strided loading of non-sequential memory locations by skipping memory locations between consecutive loads 有权

公开(公告)号：US10353708B2

公开(公告)日：2019-07-16

申请号：US15273916

申请日：2016-09-23

Applicant: Advanced Micro Devices, Inc.

Inventor： Anupama Rajesh Rasale , Dibyendu Das , Ashutosh Nema , Md Asghar Ahmad Shahid , Prathiba Kumar

IPC: G06F9/30 , G06F15/80 , G06F9/345

Abstract: Systems, apparatuses, and methods for utilizing efficient vectorization techniques for operands in non-sequential memory locations are disclosed. A system includes a vector processing unit (VPU) and one or more memory devices. In response to determining that a plurality of vector operands are stored in non-sequential memory locations, the VPU performs a plurality of vector load operations to load the plurality of vector operands into a plurality of vector registers. Next, the VPU performs a shuffle operation to consolidate the plurality of vector operands from the plurality of vector registers into a single vector register. Then, the VPU performs a vector operation on the vector operands stored in the single vector register. The VPU can also perform a vector store operation by permuting and storing a plurality of vector operands in appropriate locations within multiple vector registers and then storing the vector registers to locations in memory using a mask.

620.

发明授权
Fused shader programs 有权

公开(公告)号：US10353591B2

公开(公告)日：2019-07-16

申请号：US15442499

申请日：2017-02-24

Applicant: Advanced Micro Devices, Inc.

Inventor： Michael L. Schmitt , Radhakrishna Giduthuri

IPC: G06F3/06 , G06T1/20

Abstract: Improvements in compute shader programs executed on parallel processing hardware are disclosed. An application or other entity defines a sequence of shader programs to execute. Each shader program defines inputs and outputs which would, if unmodified, execute as loads and stores to a general purpose memory, incurring high latency. A compiler combines the shader programs into groups that can operate in a lower-latency, but lower-capacity local data store memory. The boundaries of these combined shader programs are defined by several aspects including where memory barrier operations are to execute, whether combinations of shader programs can execute using only the local data store and not the global memory (except for initial reads and writes) and other aspects.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification