-
公开(公告)号:US10360177B2
公开(公告)日:2019-07-23
申请号:US15189054
申请日:2016-06-22
Applicant: Advanced Micro Devices, Inc. , ATI Technologies ULC
Inventor: Syed Zohaib M. Gilani , Jiasheng Chen , QingCheng Wang , YunXiao Zou , Michael Mantor , Bin He , Timour T. Paltashev
IPC: G06F15/80 , G06F1/3234 , G06T15/00
Abstract: Described is a method and processing apparatus to improve power efficiency by gating redundant threads processing. In particular, the method for gating redundant threads in a graphics processor includes determining if data for a thread and data for at least another thread are within a predetermined similarity threshold, gating execution of the at least another thread if the data for the thread and the data for the at least another thread are within the predetermined similarity threshold, and using an output data from the thread as an output data for the at least another thread.
-
公开(公告)号:US20180342099A1
公开(公告)日:2018-11-29
申请号:US16057212
申请日:2018-08-07
Applicant: Advanced Micro Devices, Inc.
Inventor: Timour T. Paltashev , Boris Prokopenko , Vladimir V. Kibardin
CPC classification number: G06T17/20 , G06T1/20 , G06T15/005 , G06T2210/52
Abstract: A method, a system, and a computer-readable storage medium directed to performing high-speed parallel tessellation of 3D surface patches are disclosed. The method includes generating a plurality of primitives in parallel. Each primitive in the plurality is generated by a sequence of functional blocks, in which each sequence acts independently of all the other sequences.
-
公开(公告)号:US20180114290A1
公开(公告)日:2018-04-26
申请号:US15331278
申请日:2016-10-21
Applicant: Advanced Micro Devices, Inc.
Inventor: Timour T. Paltashev , Michael Mantor , Rex Eldon McCrary
Abstract: A graphics processing unit (GPU) includes a plurality of programmable processing cores configured to process graphics primitives and corresponding data and a plurality of fixed-function hardware units. The plurality of processing cores and the plurality of fixed-function hardware units are configured to implement a configurable number of virtual pipelines to concurrently process different command flows. Each virtual pipeline includes a configurable number of fragments and an operational state of each virtual pipeline is specified by a different context. The configurable number of virtual pipelines can be modified from a first number to a second number that is different than the first number. An emulation of a fixed-function hardware unit can be instantiated on one or more of the graphics processing cores in response to detection of a bottleneck in a fixed-function hardware unit. One or more of the virtual pipelines can then be reconfigured to utilize the emulation instead of the fixed-function hardware unit.
-
公开(公告)号:US10032308B2
公开(公告)日:2018-07-24
申请号:US15189656
申请日:2016-06-22
Applicant: Advanced Micro Devices, Inc.
Inventor: Timour T. Paltashev , Chris Brennan
Abstract: A shader in a graphics pipeline accesses an object that represents a portion of a model of a scene in object space and one or more far-z values that indicate a furthest distance of a previously rendered portion of one or more tiles from a viewpoint used to render the scene on a screen. The one or more tiles overlap a bounding box of the object in a plane of the screen. The shader culls the object from the graphics pipeline in response to the one or more far-z values being smaller than a near-z value that represents a closest distance of a portion of the object to the viewpoint.
-
公开(公告)号:US20180075574A1
公开(公告)日:2018-03-15
申请号:US15262317
申请日:2016-09-12
Applicant: Advanced Micro Devices, Inc.
Inventor: Chris Brennan , Timour T. Paltashev
IPC: G06T1/60 , G06T1/20 , H04N19/184 , H04N19/186 , H04N19/46 , H04N19/423
CPC classification number: G06T1/60 , G06T1/20 , H04N19/593
Abstract: A method and apparatus for real time compressing randomly accessed data includes extracting a block of randomly accessed data from a memory hierarchy. One or more individual portions of the randomly accessed data are independently compressed in real time to create a lossless compressed image surface. The compressed image surface includes data of independently compressed image blocks for reading and decompressing in a random order. The method further includes storing structured information relating to the dynamically compressed randomly accessed data.
-
公开(公告)号:US20180024938A1
公开(公告)日:2018-01-25
申请号:US15216071
申请日:2016-07-21
Applicant: Advanced Micro Devices, Inc.
Inventor: Timour T. Paltashev , Christopher Brennan
IPC: G06F12/1009 , G06F12/02 , G06F12/128
CPC classification number: G06F12/1009 , G06F12/023 , G06F12/0897 , G06F12/1027 , G06F12/128 , G06F2212/1044 , G06F2212/657
Abstract: A processing system for reduction of a virtual memory page fault rate that includes a first memory to store a dataset, a second memory to store a subset of the dataset, and a processing unit. The processing unit is configured to receive a memory access request including a virtual address and determine whether the virtual address is mapped to a first physical page in the first memory and or a second physical page in the second memory. The processing unit maps a third physical page in a free page pool of the second memory to the virtual address in response to the virtual address not being mapped to the second physical page. The processing unit also grants access to the third physical page that is mapped to the virtual address.
-
公开(公告)号:US12254527B2
公开(公告)日:2025-03-18
申请号:US16879991
申请日:2020-05-21
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: Timour T. Paltashev , Michael Mantor , Rex Eldon McCrary
Abstract: A graphics processing unit (GPU) includes a plurality of programmable processing cores configured to process graphics primitives and corresponding data and a plurality of fixed-function hardware units. The plurality of processing cores and the plurality of fixed-function hardware units are configured to implement a configurable number of virtual pipelines to concurrently process different command flows. Each virtual pipeline includes a configurable number of fragments and an operational state of each virtual pipeline is specified by a different context. The configurable number of virtual pipelines can be modified from a first number to a second number that is different than the first number. An emulation of a fixed-function hardware unit can be instantiated on one or more of the graphics processing cores in response to detection of a bottleneck in a fixed-function hardware unit. One or more of the virtual pipelines can then be reconfigured to utilize the emulation instead of the fixed-function hardware unit.
-
公开(公告)号:US20180061124A1
公开(公告)日:2018-03-01
申请号:US15251914
申请日:2016-08-30
Applicant: Advanced Micro Devices, Inc.
Inventor: Boris Prokopenko , Timour T. Paltashev , Vladimir V. Kibardin
CPC classification number: G06T17/20 , G06T15/005 , G06T17/10 , G06T2210/12
Abstract: A parallel adaptable graphics rasterization system in which a primitive assembler includes a router to selectively route a primitive to a first rasterizer or one of a plurality of second rasterizers. The second rasterizers concurrently operate on different primitives and the primitive is selectively routed based on an area of the primitive. In some variations, a bounding box of the primitive is reduced to a predetermined number of pixels prior to providing the primitive to the one of the plurality of second rasterizers. Reducing the bounding box can include subtracting an origin of the bounding box from coordinates of points that represent the primitive.
-
公开(公告)号:US20170371393A1
公开(公告)日:2017-12-28
申请号:US15189054
申请日:2016-06-22
Applicant: Advanced Micro Devices, Inc. , ATI Technologies ULC
Inventor: Syed Zohaib M. Gilani , Jiasheng Chen , QingCheng Wang , YunXiao Zou , Michael Mantor , Bin He , Timour T. Paltashev
CPC classification number: G06F15/8007 , G06F1/3234 , G06F1/3243 , G06T15/005 , Y02D10/152
Abstract: Described is a method and processing apparatus to improve power efficiency by gating redundant threads processing. In particular, the method for gating redundant threads in a graphics processor includes determining if data for a thread and data for at least another thread are within a predetermined similarity threshold, gating execution of the at least another thread if the data for the thread and the data for the at least another thread are within the predetermined similarity threshold, and using an output data from the thread as an output data for the at least another thread.
-
公开(公告)号:US20170193697A1
公开(公告)日:2017-07-06
申请号:US14984896
申请日:2015-12-30
Applicant: Advanced Micro Devices, Inc.
Inventor: Timour T. Paltashev , Boris Prokopenko , Vladimir V. Kibardin
CPC classification number: G06T17/20 , G06T1/20 , G06T15/005 , G06T2210/52
Abstract: A method, a system, and a computer-readable storage medium directed to performing high-speed parallel tessellation of 3D surface patches are disclosed. The method includes generating a plurality of primitives in parallel. Each primitive in the plurality is generated by a sequence of functional blocks, in which each sequence acts independently of all the other sequences.
-
-
-
-
-
-
-
-
-