Patent search ap:("NVIDIA Corporation") AND inv:"Robert J. STOLL" Page 1

1.

发明申请
REORDERING BUFFER FOR MEMORY ACCESS LOCALITY 有权
Title translation: 用于存储访问本地的后台缓冲区

公开(公告)号：US20140164743A1

公开(公告)日：2014-06-12

申请号：US13710004

申请日：2012-12-10

Applicant: NVIDIA CORPORATION

Inventor： Olivier GIROUX , Jack Hilaire CHOQUETTE , Xiaogang QIU , Robert J. STOLL

IPC: G06F9/30

CPC classification number: G06F9/30145 , G06F9/30152 , G06F9/3808 , G06F9/3851 , G06F9/3855

Abstract: Systems and methods for scheduling instructions for execution on a multi-core processor reorder the execution of different threads to ensure that instructions specified as having localized memory access behavior are executed over one or more sequential clock cycles to benefit from memory access locality. At compile time, code sequences including memory access instructions that may be localized are delineated into separate batches. A scheduling unit ensures that multiple parallel threads are processed over one or more sequential scheduling cycles to execute the batched instructions. The scheduling unit waits to schedule execution of instructions that are not included in the particular batch until execution of the batched instructions is done so that memory access locality is maintained for the particular batch. In between the separate batches, instructions that are not included in a batch are scheduled so that threads executing non-batched instructions are also processed and not starved.

Abstract translation: 用于调度用于在多核处理器上执行的指令的系统和方法重新排序不同线程的执行，以确保指定为具有局部存储器访问行为的指令在一个或多个顺序时钟周期上执行以受益于存储器访问位置。在编译时，包括可能本地化的存储器访问指令的代码序列被划分为单独的批处理。调度单元确保通过一个或多个顺序调度周期处理多个并行线程以执行批量指令。调度单元等待执行不包括在特定批中的指令，直到完成批处理指令的执行，以便为特定批次维护存储器访问位置。在单独批次之间，调度不包含在批处理中的指令，以便执行非批处理指令的线程也被处理并且不会被饿死。

2.

发明申请
APPROACH FOR A CONFIGURABLE PHASE-BASED PRIORITY SCHEDULER 有权
Title translation: 可配置的基于相位优先级调度器的方法

公开(公告)号：US20140189698A1

公开(公告)日：2014-07-03

申请号：US13728828

申请日：2012-12-27

Applicant: NVIDIA Corporation

Inventor： Jack Hilaire CHOQUETTE , Olivier GIROUX , Robert J. STOLL , Gary M. TAROLLI , John Erik LINDHOLM

IPC: G06F9/48

CPC classification number: G06F9/5038 , G06F9/3851 , G06F9/3887 , G06F9/4881

Abstract: A streaming multiprocessor (SM) in a parallel processing subsystem schedules priority among a plurality of threads. The SM retrieves a priority descriptor associated with a thread group, and determines whether the thread group and a second thread group are both operating in the same phase. If so, then the method determines whether the priority descriptor of the thread group indicates a higher priority than the priority descriptor of the second thread group. If so, the SM skews the thread group relative to the second thread group such that the thread groups operate in different phases, otherwise the SM increases the priority of the thread group. f the thread groups are not operating in the same phase, then the SM increases the priority of the thread group. One advantage of the disclosed techniques is that thread groups execute with increased efficiency, resulting in improved processor performance.

Abstract translation: 并行处理子系统中的流多处理器（SM）调度多个线程中的优先级。 SM检索与线程组相关联的优先级描述符，并确定线程组和第二线程组是否都处于同一阶段。如果是，则该方法确定线程组的优先级描述符是否指示比第二线程组的优先级描述符更高的优先级。如果是这样，则SM相对于第二个线程组倾斜线程组，使得线程组以不同的阶段运行，否则SM增加了线程组的优先级。如果线程组不在同一阶段工作，则SM会增加线程组的优先级。所公开的技术的一个优点是线程组以更高的效率执行，从而提高了处理器性能。

3.

发明申请
APPROACH FOR A CONFIGURABLE PHASE-BASED PRIORITY SCHEDULER 审中-公开

公开(公告)号：US20170192822A9

公开(公告)日：2017-07-06

申请号：US14613339

申请日：2015-02-03

Applicant: NVIDIA Corporation

Inventor： Jack Hilaire CHOQUETTE , Olivier GIROUX , Robert J. STOLL , Gary M. TAROLLI , John Erik LINDHOLM

IPC: G06F9/50

CPC classification number: G06F9/5038 , G06F9/3851 , G06F9/3887 , G06F9/4881

Abstract: A streaming multiprocessor (SM) in a parallel processing subsystem schedules priority among a plurality of threads. The SM retrieves a priority descriptor associated with a thread group, and determines whether the thread group and a second thread group are both operating in the same phase. If so, then the method determines whether the priority descriptor of the thread group indicates a higher priority than the priority descriptor of the second thread group. If so, the SM skews the thread group relative to the second thread group such that the thread groups operate in different phases, otherwise the SM increases the priority of the thread group. f the thread groups are not operating in the same phase, then the SM increases the priority of the thread group. One advantage of the disclosed techniques is that thread groups execute with increased efficiency, resulting in improved processor performance.

4.

发明申请
APPROACH FOR A CONFIGURABLE PHASE-BASED PRIORITY SCHEDULER 审中-公开
Title translation: 可配置的基于相位优先级调度器的方法

公开(公告)号：US20160224386A1

公开(公告)日：2016-08-04

申请号：US14613339

申请日：2015-02-03

Applicant: NVIDIA Corporation

Inventor： Jack Hilaire CHOQUETTE , Olivier GIROUX , Robert J. STOLL , Gary M. TAROLLI , John Erik LINDHOLM

IPC: G06F9/50

CPC classification number: G06F9/5038 , G06F9/3851 , G06F9/3887 , G06F9/4881

Abstract: A streaming multiprocessor (SM) in a parallel processing subsystem schedules priority among a plurality of threads. The SM retrieves a priority descriptor associated with a thread group, and determines whether the thread group and a second thread group are both operating in the same phase. If so, then the method determines whether the priority descriptor of the thread group indicates a higher priority than the priority descriptor of the second thread group. If so, the SM skews the thread group relative to the second thread group such that the thread groups operate in different phases, otherwise the SM increases the priority of the thread group. f the thread groups are not operating in the same phase, then the SM increases the priority of the thread group. One advantage of the disclosed techniques is that thread groups execute with increased efficiency, resulting in improved processor performance.

Abstract translation: 并行处理子系统中的流多处理器（SM）调度多个线程中的优先级。 SM检索与线程组相关联的优先级描述符，并确定线程组和第二线程组是否都处于同一阶段。如果是，则该方法确定线程组的优先级描述符是否指示比第二线程组的优先级描述符更高的优先级。如果是这样，则SM相对于第二个线程组倾斜线程组，使得线程组以不同的阶段运行，否则SM增加了线程组的优先级。线程组不在同一个阶段运行，则SM会增加线程组的优先级。所公开的技术的一个优点是线程组以更高的效率执行，从而提高了处理器性能。

5.

发明申请
HIERARCHICAL STAGING AREAS FOR SCHEDULING THREADS FOR EXECUTION 审中-公开
Title translation: 用于调度执行螺纹的分级分区

公开(公告)号：US20150113538A1

公开(公告)日：2015-04-23

申请号：US14061170

申请日：2013-10-23

Applicant: NVIDIA CORPORATION

Inventor： Olivier GIROUX , Jack Hilaire CHOQUETTE , Robert J. STOLL , Xiaogang QIU , Michael Alan FETTERMAN

IPC: G06F9/50

CPC classification number: G06F9/5011 , G06F2209/507

Abstract: One embodiment of the present invention is a computer-implemented method for scheduling a thread group for execution on a processing engine that includes identifying a first thread group included in a first set of thread groups that can be issued for execution on the processing engine, where the first thread group includes one or more threads. The method also includes transferring the first thread group from the first set of thread groups to a second set of thread groups, allocating hardware resources to the first thread group, and selecting the first thread group from the second set of thread groups for execution on the processing engine. One advantage of the disclosed technique is that a scheduler only allocates limited hardware resources to thread groups that are, in fact, ready to be issued for execution, thereby conserving those resources in a manner that is generally more efficient than conventional techniques.

Abstract translation: 本发明的一个实施例是一种用于在处理引擎上调度用于执行的线程组的计算机实现的方法，该处理引擎包括识别包括在可被发行用于在处理引擎上执行的第一组线程组中的第一线程组，其中第一个线程组包括一个或多个线程。该方法还包括将第一线程组从第一组线程组传送到第二组线程组，向第一线程组分配硬件资源，以及从第二组线程组中选择第一线程组以在处理引擎。所公开技术的一个优点是调度器仅将有限的硬件资源分配给事实上准备被发行用于执行的线程组，从而以通常比常规技术更有效的方式来保存那些资源。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification