-
公开(公告)号:US20210243101A1
公开(公告)日:2021-08-05
申请号:US16893327
申请日:2020-06-04
Applicant: NVIDIA CORPORATION
Inventor: Joohwan Kim , Benjamin Boudaoud , Josef B. Spjut , Morgan S. McGuire , Seth P. Schneider , Rouslan L. Dimitrov , Lars Nordskog , Cody J. Robson , Sau Yan Keith Li , Gerrit Ary Slavenburg , Tom J. Verbeure
Abstract: A display device for measuring the end-to-end latency of a computing system. The computing system includes an input device, a computing device, and the display device. The display device is directly connected with the input device and receives input data packets generated by the input device in response to received user input events. The display device passes the input packets to the computing device for graphics processing. The display device measures the end-to-end latency comprising the sum of three latencies. A first latency comprises an input delay of the input device. A second latency comprises an amount of time between generation of the input packet and a corresponding change in pixel values caused by the input event at the display device. A third latency comprises a display latency. The display device also displays latency information associated with the measured end-to-end latency.
-
公开(公告)号:US11784906B2
公开(公告)日:2023-10-10
申请号:US16893327
申请日:2020-06-04
Applicant: NVIDIA CORPORATION
Inventor: Joohwan Kim , Benjamin Boudaoud , Josef B. Spjut , Morgan S. McGuire , Seth P. Schneider , Rouslan L. Dimitrov , Lars Nordskog , Cody J. Robson , Sau Yan Keith Li , Gerrit Ary Slavenburg , Tom J. Verbeure
IPC: H04L43/106 , H04L43/0852 , G06F3/14 , G09G5/14 , G06T1/20
CPC classification number: H04L43/106 , G06F3/14 , G06T1/20 , G09G5/14 , H04L43/0852 , G09G2340/0407
Abstract: A display device for measuring the end-to-end latency of a computing system. The computing system includes an input device, a computing device, and the display device. The display device is directly connected with the input device and receives input data packets generated by the input device in response to received user input events. The display device passes the input packets to the computing device for graphics processing. The display device measures the end-to-end latency comprising the sum of three latencies. A first latency comprises an input delay of the input device. A second latency comprises an amount of time between generation of the input packet and a corresponding change in pixel values caused by the input event at the display device. A third latency comprises a display latency. The display device also displays latency information associated with the measured end-to-end latency.
-
公开(公告)号:US09355483B2
公开(公告)日:2016-05-31
申请号:US13946977
申请日:2013-07-19
Applicant: NVIDIA Corporation
Inventor: Eric B. Lum , Rouslan L. Dimitrov , Ignacio Llamas Ubieto , Patrick James Neill , Yury Uralsky , Albert Meixner
IPC: G06T11/40
CPC classification number: G06T11/40
Abstract: A system, method, and computer program product are provided for shading primitive fragments. A target buffer may be recast when shaded samples that are covered by a primitive fragment are generated at a first shading rate using a first sampling mode, the shaded samples are stored in the target buffer that is associated with the first sampling mode and the first shading rate, a second sampling mode is determined, and the target buffer is associated with the second sampling mode. A sampling mode and/or shading rate may be changed for a primitive. A primitive fragment that is associated with a first sampling mode and a first shading rate is received and a second sampling mode is determined for the primitive fragment. Shaded samples corresponding to the primitive fragment are generated, at a second shading rate, using the second sampling mode and the shaded samples are stored in a target buffer.
Abstract translation: 提供了一种系统,方法和计算机程序产品,用于着色原始片段。 当使用第一采样模式以第一阴影率产生由原始片段覆盖的阴影样本时,可以重写目标缓冲器,阴影样本存储在与第一采样模式和第一着色相关联的目标缓冲器中 速率,确定第二采样模式,并且目标缓冲器与第二采样模式相关联。 对于原语,可以改变采样模式和/或阴影率。 接收与第一采样模式和第一遮蔽速率相关联的原始片段,并且为原始片段确定第二采样模式。 使用第二采样模式以第二遮蔽速率生成与原始片段对应的阴影样本,并将阴影样本存储在目标缓冲器中。
-
公开(公告)号:US20230007920A1
公开(公告)日:2023-01-12
申请号:US17932539
申请日:2022-09-15
Applicant: NVIDIA Corporation
Inventor: Rouslan L. Dimitrov , Dale L. Kirkland , Emmett M. Kilgariff , Sachin Satish Idgunji , Siddharth Sharma
Abstract: Graphics processing unit (GPU) performance and power efficiency is improved using machine learning to tune operating parameters based on performance monitor values and application information. Performance monitor values are processed using machine learning techniques to generate model parameters, which are used by a control unit within the GPU to provide real-time updates to the operating parameters. In one embodiment, a neural network processes the performance monitor values to generate operating parameters in real-time.
-
公开(公告)号:US10402937B2
公开(公告)日:2019-09-03
申请号:US15857330
申请日:2017-12-28
Applicant: NVIDIA Corporation
Inventor: Rouslan L. Dimitrov , Kirill A. Dmitriev , Andrei Khodakovsky , Tzyywei Hwang , Wishwesh Anil Gandhi , Lacky Vasant Shah
IPC: G06T1/60 , G06T15/00 , G06F12/1009 , G06T1/20
Abstract: A method for rendering graphics frames allocates rendering work to multiple graphics processing units (GPUs) that are configured to allow access to pages of data stored in locally attached memory of a peer GPU. The method includes the steps of generating, by a first GPU coupled to a first memory circuit, one or more first memory access requests to render a first primitive for a first frame, where at least one of the first memory access requests targets a first page of data that physically resides within a second memory circuit coupled to a second GPU. The first GPU requests the first page of data through a first data link coupling the first GPU to the second GPU and a register circuit within the first GPU accumulates an access request count for the first page of data. The first GPU notifies a driver that the access request count has reached a specified threshold.
-
公开(公告)号:US12045924B2
公开(公告)日:2024-07-23
申请号:US17932539
申请日:2022-09-15
Applicant: NVIDIA Corporation
Inventor: Rouslan L. Dimitrov , Dale L. Kirkland , Emmett M. Kilgariff , Sachin Satish Idgunji , Siddharth Sharma
CPC classification number: G06T15/005 , G06N3/08 , G06T15/80 , G06T17/10
Abstract: Graphics processing unit (GPU) performance and power efficiency is improved using machine learning to tune operating parameters based on performance monitor values and application information. Performance monitor values are processed using machine learning techniques to generate model parameters, which are used by a control unit within the GPU to provide real-time updates to the operating parameters. In one embodiment, a neural network processes the performance monitor values to generate operating parameters in real-time.
-
公开(公告)号:US10430915B2
公开(公告)日:2019-10-01
申请号:US15879377
申请日:2018-01-24
Applicant: NVIDIA Corporation
Inventor: Andrei Khodakovsky , Kirill A. Dmitriev , Rouslan L. Dimitrov , Tzyywei Hwang , Wishwesh Anil Gandhi , Lacky Vasant Shah
Abstract: One or more copy commands are scheduled for locating one or more pages of data in a local memory of a graphics processing unit (GPU) for more efficient access to the pages of data during rendering. A first processing unit that is coupled to a first GPU receives a notification that an access request count has reached a specified threshold. The first processing unit schedules a copy command to copy the first page of data to a first memory circuit of the first GPU from a second memory circuit of the second GPU. The copy command is included within a GPU command stream.
-
公开(公告)号:US20190206023A1
公开(公告)日:2019-07-04
申请号:US15857330
申请日:2017-12-28
Applicant: NVIDIA Corporation
Inventor: Rouslan L. Dimitrov , Kirill A. Dmitriev , Andrei Khodakovsky , Tzyywei Hwang , Wishwesh Anil Gandhi , Lacky Vasant Shah
IPC: G06T1/60 , G06T1/20 , G06F12/1009 , G06T15/00
CPC classification number: G06T1/60 , G06F9/44 , G06F12/1009 , G06T1/20 , G06T15/005
Abstract: A method for rendering graphics frames allocates rendering work to multiple graphics processing units (GPUs) that are configured to allow access to pages of data stored in locally attached memory of a peer GPU. The method includes the steps of generating, by a first GPU coupled to a first memory circuit, one or more first memory access requests to render a first primitive for a first frame, where at least one of the first memory access requests targets a first page of data that physically resides within a second memory circuit coupled to a second GPU. The first GPU requests the first page of data through a first data link coupling the first GPU to the second GPU and a register circuit within the first GPU accumulates an access request count for the first page of data. The first GPU notifies a driver that the access request count has reached a specified threshold.
-
公开(公告)号:US20150022537A1
公开(公告)日:2015-01-22
申请号:US13946977
申请日:2013-07-19
Applicant: NVIDIA Corporation
Inventor: Eric B. Lum , Rouslan L. Dimitrov , Ignacio Llamas , Patrick James Neill , Yury Uralsky , Albert Meixner
CPC classification number: G06T11/40
Abstract: A system, method, and computer program product are provided for shading primitive fragments. A target buffer may be recast when shaded samples that are covered by a primitive fragment are generated at a first shading rate using a first sampling mode, the shaded samples are stored in the target buffer that is associated with the first sampling mode and the first shading rate, a second sampling mode is determined, and the target buffer is associated with the second sampling mode. A sampling mode and/or shading rate may be changed for a primitive. A primitive fragment that is associated with a first sampling mode and a first shading rate is received and a second sampling mode is determined for the primitive fragment. Shaded samples corresponding to the primitive fragment are generated, at a second shading rate, using the second sampling mode and the shaded samples are stored in a target buffer.
Abstract translation: 提供了一种系统,方法和计算机程序产品,用于着色原始片段。 当使用第一采样模式以第一阴影率产生由原始片段覆盖的阴影样本时,可以重写目标缓冲器,阴影样本存储在与第一采样模式和第一着色相关联的目标缓冲器中 速率,确定第二采样模式,并且目标缓冲器与第二采样模式相关联。 对于原语,可以改变采样模式和/或阴影率。 接收与第一采样模式和第一遮蔽速率相关联的原始片段,并且为原始片段确定第二采样模式。 使用第二采样模式以第二遮蔽速率生成与原始片段相对应的阴影样本,并将阴影样本存储在目标缓冲器中。
-
公开(公告)号:US11481950B2
公开(公告)日:2022-10-25
申请号:US17162550
申请日:2021-01-29
Applicant: NVIDIA Corporation
Inventor: Rouslan L. Dimitrov , Dale L. Kirkland , Emmett M. Kilgariff , Sachin Satish Idgunji , Siddharth Sharma
Abstract: Graphics processing unit (GPU) performance and power efficiency is improved using machine learning to tune operating parameters based on performance monitor values and application information. Performance monitor values are processed using machine learning techniques to generate model parameters, which are used by a control unit within the GPU to provide real-time updates to the operating parameters. In one embodiment, a neural network processes the performance monitor values to generate operating parameters in real-time.
-
-
-
-
-
-
-
-
-