专利检索 ap:("ADVANCED MICRO DEVICES, INC." OR "ATI TECHNOLOGIES ULC") AND inv:"Saurabh Sharma" 第 1 页

1.

发明授权
Compressing texture data on a per-channel basis 有权

公开(公告)号：US11308648B2

公开(公告)日：2022-04-19

申请号：US17030048

申请日：2020-09-23

申请人： ADVANCED MICRO DEVICES, INC. , ATI TECHNOLOGIES ULC

发明人： Saurabh Sharma , Laurent Lefebvre , Sagar Shankar Bhandare , Ruijin Wu

IPC分类号： G06T9/00 , G06T1/60

摘要： Sampling circuitry independently accesses channels of texture data that represent a set of pixels. One or more processing units separately compress the channels of the texture data and store compressed data representative of the channels of the texture data for the set of pixels. The channels can include a red channel, a blue channel, and a green channel that represent color values of the set of pixels and an alpha channel that represents degrees of transparency of the set of pixels. Storing the compressed data can include writing the compress data to portions of a cache. The processing units can identify a subset of the set of pixels that share a value of a first channel of the plurality of channels and represent the value of the first channel over the subset of the set of pixels using information representing the value, the first channel, and boundaries of the subset.

2.

发明授权
Compressing texture data on a per-channel basis 有权

公开(公告)号：US11694367B2

公开(公告)日：2023-07-04

申请号：US17716186

申请日：2022-04-08

申请人： ADVANCED MICRO DEVICES, INC. , ATI TECHNOLOGIES ULC

发明人： Saurabh Sharma , Laurent Lefebvre , Sagar Shankar Bhandare , Ruijin Wu

IPC分类号： G06T9/00 , G06T1/60

CPC分类号： G06T9/00 , G06T1/60 , G06T2200/04

摘要： Sampling circuitry independently accesses channels of texture data that represent a set of pixels. One or more processing units separately compress the channels of the texture data and store compressed data representative of the channels of the texture data for the set of pixels. The channels can include a red channel, a blue channel, and a green channel that represent color values of the set of pixels and an alpha channel that represents degrees of transparency of the set of pixels. Storing the compressed data can include writing the compress data to portions of a cache. The processing units can identify a subset of the set of pixels that share a value of a first channel of the plurality of channels and represent the value of the first channel over the subset of the set of pixels using information representing the value, the first channel, and boundaries of the subset.

3.

发明公开
VARIABLE DISPATCH WALK FOR SUCCESSIVE CACHE ACCESSES 审中-公开

公开(公告)号：US20230195626A1

公开(公告)日：2023-06-22

申请号：US17558008

申请日：2021-12-21

申请人： ADVANCED MICRO DEVICES, INC. , ATI TECHNOLOGIES ULC

发明人： Saurabh Sharma , Jeremy Lukacs , Hashem Hashemi , Gianpaolo Tommasi , Guennadi Riguer , Mark Fowler , Randy Ramsey

IPC分类号： G06F12/0806 , G06F12/10

CPC分类号： G06F12/0806 , G06F12/10 , G06F2212/1016

摘要： A processing system is configured to translate a first cache access pattern of a dispatch of work items to a cache access pattern that facilitates consumption of data stored at a cache of a parallel processing unit by a subsequent access before the data is evicted to a more remote level of the memory hierarchy. For consecutive cache accesses having read-after-read data locality, in some embodiments the processing system translates the first cache access pattern to a space-filling curve. In some embodiments, for consecutive accesses having read-after-write data locality, the processing system translates a first typewriter cache access pattern that proceeds in ascending order for a first access to a reverse typewriter cache access pattern that proceeds in descending order for a subsequent cache access. By translating the cache access pattern based on data locality, the processing system increases the hit rate of the cache.

4.

发明公开
VARIABLE DISPATCH WALK 审中-公开

公开(公告)号：US20230195509A1

公开(公告)日：2023-06-22

申请号：US17557927

申请日：2021-12-21

申请人： ADVANCED MICRO DEVICES, INC. , ATI TECHNOLOGIES ULC

发明人： Saurabh Sharma , Jeremy Lukacs , Hashem Hashemi , Gianpaolo Tommasi , Guennadi Riguer , Mark Fowler , Randy Ramsey

IPC分类号： G06F9/48

CPC分类号： G06F9/4831

摘要： A processing unit performs a dispatch walk of a set of thread groups based on a programmable access pattern. The access pattern is stored at a table that is programmed with the access pattern based upon a specified command. By using the command to program the table with different access patterns, the dispatch order of the set of thread groups is adapted to better suit the processing of different data sets, thereby reducing power consumption at the processing unit, and improving overall processing efficiency.

5.

发明授权
Dead surface invalidation 有权

公开(公告)号：US12033239B2

公开(公告)日：2024-07-09

申请号：US17563950

申请日：2021-12-28

申请人： Advanced Micro Devices, Inc.

发明人： Priyadarshi Sharma , Anshuman Mittal , Saurabh Sharma

IPC分类号： G06T1/20 , G06F12/0891 , G06T1/60

CPC分类号： G06T1/60 , G06F12/0891 , G06T1/20 , G06F2212/455

摘要： Systems, apparatuses, and methods for performing dead surface invalidation are disclosed. An application sends draw call commands to a graphics processing unit (GPU) via a driver, with the draw call commands rendering to surfaces. After it is determined that a given surface will no longer be accessed by subsequent draw calls, the application sends a surface invalidation command for the given surface to a command processor of the GPU. After the command processor receives the surface invalidation command, the command processor waits for a shader engine to send a draw call completion message for a last draw call to access the given surface. Once the command processor receives the draw call completion message, the command processor sends a surface invalidation command to a cache to invalidate cache lines for the given surface to free up space in the cache for other data.

6.

发明公开
DEAD SURFACE INVALIDATION 审中-公开

公开(公告)号：US20230206384A1

公开(公告)日：2023-06-29

申请号：US17563950

申请日：2021-12-28

申请人： Advanced Micro Devices, Inc.

发明人： Priyadarshi Sharma , Anshuman Mittal , Saurabh Sharma

IPC分类号： G06T1/60 , G06F12/0891 , G06T1/20

CPC分类号： G06T1/60 , G06F12/0891 , G06T1/20 , G06F2212/455

摘要： Systems, apparatuses, and methods for performing dead surface invalidation are disclosed. An application sends draw call commands to a graphics processing unit (GPU) via a driver, with the draw call commands rendering to surfaces. After it is determined that a given surface will no longer be accessed by subsequent draw calls, the application sends a surface invalidation command for the given surface to a command processor of the GPU. After the command processor receives the surface invalidation command, the command processor waits for a shader engine to send a draw call completion message for a last draw call to access the given surface. Once the command processor receives the draw call completion message, the command processor sends a surface invalidation command to a cache to invalidate cache lines for the given surface to free up space in the cache for other data.

7.

发明公开
STOCHASTIC OPTIMIZATION OF SURFACE CACHEABILITY IN PARALLEL PROCESSING UNITS 审中-公开

公开(公告)号：US20230195639A1

公开(公告)日：2023-06-22

申请号：US17557475

申请日：2021-12-21

申请人： ADVANCED MICRO DEVICES, INC.

发明人： Saurabh Sharma , Jeremy Lukacs , Hashem Hashemi , Gianpaolo Tommasi , Christopher J. Brennan

IPC分类号： G06F12/0893

CPC分类号： G06F12/0893 , G06F2212/6042

摘要： A processing system selectively allocates storage at a local cache of a parallel processing unit for cache lines of a repeating pattern of data that exceeds the storage capacity of the cache. The processing system identifies repeating patterns of data having cache lines that have a reuse distance that exceeds the storage capacity of the cache. A cache controller allocates storage for only a subset of cache lines of the repeating pattern of data at the cache and excludes the remainder of cache lines of the repeating pattern of data from the cache. By restricting the cache to store only a subset of cache lines of the repeating pattern of data, the cache controller increases the hit rate at the cache for the subset of cache lines.

8.

发明授权
Stochastic optimization of surface cacheability in parallel processing units 有权

公开(公告)号：US12117939B2

公开(公告)日：2024-10-15

申请号：US17557475

申请日：2021-12-21

申请人： ADVANCED MICRO DEVICES, INC.

发明人： Saurabh Sharma , Jeremy Lukacs , Hashem Hashemi , Gianpaolo Tommasi , Christopher J. Brennan

IPC分类号： G06F12/00 , G06F12/0893

CPC分类号： G06F12/0893 , G06F2212/6042

摘要： A processing system selectively allocates storage at a local cache of a parallel processing unit for cache lines of a repeating pattern of data that exceeds the storage capacity of the cache. The processing system identifies repeating patterns of data having cache lines that have a reuse distance that exceeds the storage capacity of the cache. A cache controller allocates storage for only a subset of cache lines of the repeating pattern of data at the cache and excludes the remainder of cache lines of the repeating pattern of data from the cache. By restricting the cache to store only a subset of cache lines of the repeating pattern of data, the cache controller increases the hit rate at the cache for the subset of cache lines.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类