-
公开(公告)号:US11263051B2
公开(公告)日:2022-03-01
申请号:US16866811
申请日:2020-05-05
Applicant: NVIDIA Corporation
Inventor: Ram Rangan , Suryakant Patidar , Praveen Krishnamurthy , Wishwesh Anil Gandhi
Abstract: Accesses between a processor and its external memory is reduced when the processor internally maintains a compressed version of values stored in the external memory. The processor can then refer to the compressed version rather than access the external memory. One compression technique involves maintaining a dictionary on the processor mapping portions of a memory to values. When all of the values of a portion of memory are uniform (e.g., the same), the value is stored in the dictionary for that portion of memory. Thereafter, when the processor needs to access that portion of memory, the value is retrieved from the dictionary rather than from external memory. Techniques are disclosed herein to extend, for example, the capabilities of such dictionary-based compression so that the amount of accesses between the processor and its external memory are further reduced.
-
公开(公告)号:US11069023B2
公开(公告)日:2021-07-20
申请号:US16422776
申请日:2019-05-24
Applicant: NVIDIA Corporation
Inventor: Ram Rangan
Abstract: A technique selectively avoids memory fetches for partially uniform textures in real time graphics shader programs and instead uses program paths specialized for one or more frequently occurring values. One aspect avoids memory lookups and dependent computations for partially uniform textures through use of pre-constructed coarse-grained representations called value locality maps or dirty tilemaps (DTMs). The decision to use a specialized fast path or not is made dynamically by consulting such coarse-grained dirty tilemap representations. Thread-sharing value reuse can be implemented with or instead of the DTM mechanism.
-
公开(公告)号:US11372548B2
公开(公告)日:2022-06-28
申请号:US16888116
申请日:2020-05-29
Applicant: NVIDIA Corporation
Inventor: Ram Rangan , Patrick Richard Brown , Wishwesh Anil Gandhi , Steven James Heinrich , Mathias Heyer , Emmett Michael Kilgariff , Praveen Krishnamurthy , Dong Han Ryu
Abstract: Some systems compress data utilized by a user mode software without the user mode software being aware of any compression taking place. To maintain that illusion, such systems prevent user mode software from being aware of and/or accessing the underlying compressed states of the data. While such an approach protects proprietary compression techniques used in such systems from being deciphered, such restrictions limit the ability of user mode software to use the underlying compressed forms of the data in new ways. Disclosed herein are various techniques for allowing user-mode software to access the underlying compressed states of data either directly or indirectly. Such techniques can be used, for example, to allow various user-mode software on a single system or on multiple systems to exchange data in the underlying compression format of the system(s) even when the user mode software is unable to decipher the compression format.
-
公开(公告)号:US20210349761A1
公开(公告)日:2021-11-11
申请号:US16866811
申请日:2020-05-05
Applicant: NVIDIA Corporation
Inventor: Ram Rangan , Suryakant Patidar , Praveen Krishnamurthy , Wishwesh Anil Gandhi
Abstract: Accesses between a processor and its external memory is reduced when the processor internally maintains a compressed version of values stored in the external memory. The processor can then refer to the compressed version rather than access the external memory. One compression technique involves maintaining a dictionary on the processor mapping portions of a memory to values. When all of the values of a portion of memory are uniform (e.g., the same), the value is stored in the dictionary for that portion of memory. Thereafter, when the processor needs to access that portion of memory, the value is retrieved from the dictionary rather than from external memory. Techniques are disclosed herein to extend, for example, the capabilities of such dictionary-based compression so that the amount of accesses between the processor and its external memory are further reduced.
-
公开(公告)号:US20210373774A1
公开(公告)日:2021-12-02
申请号:US16888116
申请日:2020-05-29
Applicant: NVIDIA Corporation
Inventor: Ram Rangan , Patrick Richard Brown , Wishwesh Anil Gandhi , Steven James Heinrich , Mathias Heyer , Emmett Michael Kilgariff , Praveen Krishnamurthy , Dong Han Ryu
Abstract: Some systems compress data utilized by a user mode software without the user mode software being aware of any compression taking place. To maintain that illusion, such systems prevent user mode software from being aware of and/or accessing the underlying compressed states of the data. While such an approach protects proprietary compression techniques used in such systems from being deciphered, such restrictions limit the ability of user mode software to use the underlying compressed forms of the data in new ways. Disclosed herein are various techniques for allowing user-mode software to access the underlying compressed states of data either directly or indirectly. Such techniques can be used, for example, to allow various user-mode software on a single system or on multiple systems to exchange data in the underlying compression format of the system(s) even when the user mode software is unable to decipher the compression format.
-
公开(公告)号:US20210349639A1
公开(公告)日:2021-11-11
申请号:US16866813
申请日:2020-05-05
Applicant: NVIDIA Corporation
Inventor: Ram Rangan , Suryakant Patidar , Praveen Krishnamurthy , Wishwesh Anil Gandhi
Abstract: Accesses between a processor and its external memory is reduced when the processor internally maintains a compressed version of values stored in the external memory. The processor can then refer to the compressed version rather than access the external memory. One compression technique involves maintaining a dictionary on the processor mapping portions of a memory to values. When all of the values of a portion of memory are uniform (e.g., the same), the value is stored in the dictionary for that portion of memory. Thereafter, when the processor needs to access that portion of memory, the value is retrieved from the dictionary rather than from external memory. Techniques are disclosed herein to extend, for example, the capabilities of such dictionary-based compression so that the amount of accesses between the processor and its external memory are further reduced.
-
公开(公告)号:US11513686B2
公开(公告)日:2022-11-29
申请号:US16866813
申请日:2020-05-05
Applicant: NVIDIA Corporation
Inventor: Ram Rangan , Suryakant Patidar , Praveen Krishnamurthy , Wishwesh Anil Gandhi
Abstract: Accesses between a processor and its external memory is reduced when the processor internally maintains a compressed version of values stored in the external memory. The processor can then refer to the compressed version rather than access the external memory. One compression technique involves maintaining a dictionary on the processor mapping portions of a memory to values. When all of the values of a portion of memory are uniform (e.g., the same), the value is stored in the dictionary for that portion of memory. Thereafter, when the processor needs to access that portion of memory, the value is retrieved from the dictionary rather than from external memory. Techniques are disclosed herein to extend, for example, the capabilities of such dictionary-based compression so that the amount of accesses between the processor and its external memory are further reduced.
-
公开(公告)号:US11138018B2
公开(公告)日:2021-10-05
申请号:US16221106
申请日:2018-12-14
Applicant: Nvidia Corporation
Inventor: Marc Blackstein , Ram Rangan
Abstract: Profile-guided optimization is a technique for optimizing execution of computer programs using profile information to improve program runtime performance. Obtaining the profile information can be challenging, especially in live production environments such as high-performance gaming systems. A profiling strategy is provided herein that obtains profile information without requiring extra effort from users. The profiling strategy collects several approximate, lightweight profiles called piecemeal profiles over one or more lifetimes of a computer program, or application. The piecemeal profiles are then used to generate whole program application profiles that can then be used to improve the execution of the application. A piecemeal profile is profile information of a section or portion of an application.
-
-
-
-
-
-
-