-
公开(公告)号:US10013240B2
公开(公告)日:2018-07-03
申请号:US15188304
申请日:2016-06-21
Applicant: Advanced Micro Devices, Inc.
Inventor: Daniel I. Lowell
CPC classification number: G06F8/30 , G06F8/41 , G06F8/454 , G06F11/1629 , G06F2201/805
Abstract: A first processing element is configured to execute a first thread and one or more second processing elements are configured to execute one or more second threads that are redundant to the first thread. The first thread and the one or more second threads are to selectively bypass one or more comparisons of results of operations performed by the first thread and the one or more second threads depending on whether an event trigger for the comparison has occurred a configurable number of times since a previous comparison of previously encoded values of the results. In some cases the comparison can be performed based on hashed (or encoded) values of the results of a current operation and one or more previous operations.
-
公开(公告)号:US20180165790A1
公开(公告)日:2018-06-14
申请号:US15377998
申请日:2016-12-13
Applicant: Advanced Micro Devices, Inc.
Inventor: Daniel Schneider , Fataneh Ghodrat
IPC: G06T1/60 , G06F12/0877 , G06F12/0815 , G06F15/80
CPC classification number: G06T1/60 , G06F12/0815 , G06F12/0877 , G06F15/8007 , G06F2212/455 , G06F2212/60 , G06F2212/621 , G06T1/20 , G06T15/005
Abstract: Techniques for allowing cache access returns out of order are disclosed. A return ordering queue exists for each of several cache access types and stores outstanding cache accesses in the order in which those accesses were made. When a cache access request for a particular type is at the head of the return ordering queue for that type and the cache access is available for return to the wavefront that made that access, the cache system returns the cache access to the wavefront. Thus, cache accesses can be returned out of order with respect to cache accesses of different types. Allowing out-of-order returns can help to improve latency, for example in the situation where a relatively low-latency access type (e.g., a read) is issued after a relatively high-latency access type (e.g., a texture sampler operation).
-
公开(公告)号:US20180165314A1
公开(公告)日:2018-06-14
申请号:US15824771
申请日:2017-11-28
Applicant: Advanced Micro Devices, Inc.
Inventor: Steven R. Havlir , Patrick J. Shyvers
IPC: G06F17/30
CPC classification number: G06F16/2264 , G06F9/3844 , G06F16/2246 , G06F16/2255
Abstract: Described herein is a system and method for multiplexer tree (muxtree) indexing. Muxtree indexing performs hashing and row reduction in parallel by use of each select bit only once in a particular path of the muxtree. The muxtree indexing generates a different final index as compared to conventional hashed indexing but still results in a fair hash, where all table entries get used with equal distribution with uniformly random selects.
-
公开(公告)号:US09990203B2
公开(公告)日:2018-06-05
申请号:US14981310
申请日:2015-12-28
Applicant: Advanced Micro Devices, Inc.
Inventor: Leonardo de Paula Rosa Piga , Abhinandan Majumdar , Indrani Paul , Wei Huang , Manish Arora , Joseph L. Greathouse
IPC: G06F9/30
CPC classification number: G06F9/30192 , G06F9/30014 , G06F9/30083 , G06F9/30145 , G06F11/00
Abstract: Methods, devices, and systems for capturing an accuracy of an instruction executing on a processor. An instruction may be executed on the processor, and the accuracy of the instruction may be captured using a hardware counter circuit. The accuracy of the instruction may be captured by analyzing bits of at least one value of the instruction to determine a minimum or maximum precision datatype for representing the field, and determining whether to adjust a value of the hardware counter circuit accordingly. The representation may be output to a debugger or logfile for use by a developer, or may be output to a runtime or virtual machine to automatically adjust instruction precision or gating of portions of the processor datapath.
-
公开(公告)号:US09983655B2
公开(公告)日:2018-05-29
申请号:US14963352
申请日:2015-12-09
Applicant: Advanced Micro Devices, Inc.
Inventor: Mitesh R. Meswani , David A. Roberts , Dmitri Yudanov , Arkaprava Basu , Sergey Blagodurov
CPC classification number: G06F1/3243 , G06F9/3885
Abstract: A method and apparatus for performing inter-lane power management includes de-energizing one or more execution lanes upon a determination that the one or more execution lanes are to be predicated. Energy from the predicated execution lanes is redistributed to one or more active execution lanes.
-
公开(公告)号:US09983652B2
公开(公告)日:2018-05-29
申请号:US14959669
申请日:2015-12-04
Applicant: Advanced Micro Devices, Inc.
Inventor: Leonardo Piga , Indrani Paul , Wei Huang
CPC classification number: G06F1/3203 , G06F1/3206 , G06F1/3287 , Y02D10/171
Abstract: Systems, apparatuses, and methods for balancing computation and communication power in power constrained environments. A data processing cluster with a plurality of compute nodes may perform parallel processing of a workload in a power constrained environment. Nodes that finish tasks early may be power-gated based on one or more conditions. In some scenarios, a node may predict a wait duration and go into a reduced power consumption state if the wait duration is predicted to be greater than a threshold. The power saved by power-gating one or more nodes may be reassigned for use by other nodes. A cluster agent may be configured to reassign the unused power to the active nodes to expedite workload processing.
-
公开(公告)号:US20180143781A1
公开(公告)日:2018-05-24
申请号:US15360518
申请日:2016-11-23
Applicant: Advanced Micro Devices, Inc.
Inventor: Joseph L. Greathouse , Christopher D. Erb , Michael G. Collins
CPC classification number: G06F3/0647 , G06F3/0619 , G06F3/0656 , G06F3/0685 , G06F9/4443 , G06F9/451 , G06T1/20 , G06T1/60
Abstract: A processing apparatus is provided that includes a plurality of memory regions each corresponding to a memory address and configured to store data associated with the corresponding memory address. The processing apparatus also includes an accelerated processing device in communication with the memory regions and configured to determine a request to allocate an initial memory buffer comprising a number of contiguous memory regions, create a new memory buffer comprising one or more additional memory regions adjacent to the contiguous memory regions of the initial memory buffer, assign one or more values to the one or more additional memory regions and detect a change to the one or more values at the one or more additional memory regions.
-
公开(公告)号:US09977609B2
公开(公告)日:2018-05-22
申请号:US15063186
申请日:2016-03-07
Applicant: Advanced Micro Devices, Inc.
Inventor: Nuwan S. Jayasena , Dong Ping Zhang , Paula Aguilera Diez
CPC classification number: G06F3/0613 , G06F3/0658 , G06F3/0673 , G06F12/084 , G06F12/0888 , G06F12/10 , G06F2212/1024
Abstract: Systems, apparatuses, and methods for implementing efficient queues and other data structures. A queue may be shared among multiple processors and/or threads without using explicit software atomic instructions to coordinate access to the queue. System software may allocate an atomic queue and corresponding queue metadata in system memory and return, to the requesting thread, a handle referencing the queue metadata. Any number of threads may utilize the handle for accessing the atomic queue. The logic for ensuring the atomicity of accesses to the atomic queue may reside in a management unit in the memory controller coupled to the memory where the atomic queue is allocated.
-
公开(公告)号:US20180129504A1
公开(公告)日:2018-05-10
申请号:US15804655
申请日:2017-11-06
Applicant: Advanced Micro Devices, Inc.
Inventor: Lee W. Howes , Benedict R. Gaster , Michael C. Houston
CPC classification number: G06F9/3851 , G06F8/458 , G06F9/3009
Abstract: A method, system, and computer program product synchronize a group of workitems executing an instruction stream on a processor. The processor is yielded by a first workitem responsive to a synchronization instruction in the instruction stream. A first one of a plurality of program counters is updated to point to a next instruction following the synchronization instruction in the instruction stream to be executed by the first workitem. A second workitem is run on the processor after the yielding.
-
公开(公告)号:US09965134B2
公开(公告)日:2018-05-08
申请号:US13620122
申请日:2012-09-14
Applicant: David M. Lynch
Inventor: David M. Lynch
IPC: G06F3/048 , G06F3/0481 , G06F9/44
CPC classification number: G06F3/0481 , G06F9/451
Abstract: A method and apparatus provides a user an interface for a file system. In one example, the method and apparatus displays the file as a visualized object, e.g., a graphical representation of the file as a real life object, receives selection of visualized objects and activates data elements represented by the visualized objects. The visualization of the file may be determined based on visualizer identification information associated with the file. For the activated data elements, the method and apparatus displays tool interfaces, in combination with the visualized objects. The tool interfaces may be selectively displayed for the activated data element base on tool identification information associated with a data type of the data element. Furthermore, the method and apparatus can process the activated data elements using the selected tool actions from different programs.
-
-
-
-
-
-
-
-
-