Patent search ap:("NVIDIA Corporation") AND inv:"Vyas Venkataraman" Page 1

1.

发明申请
DISTRIBUTED ADDRESS TRANSLATION IN A MULTI-NODE INTERCONNECT FABRIC 审中-公开

公开(公告)号：US20200159669A1

公开(公告)日：2020-05-21

申请号：US16198649

申请日：2018-11-21

Applicant: NVIDIA Corporation

Inventor： Samuel Hammond Duncan , Sanjeev Jain , Mark Douglas Hummel , Vyas Venkataraman , Olivier Giroux , Larry Robert Dennison , Alexander Toichi Ishii , Hemayet Hossain , Nir Haim Arad

IPC: G06F12/1027

Abstract: Multiprocessor clusters in a virtualized environment conventionally fail to provide memory access security, which is frequently a requirement for efficient utilization in multi-client settings. Without adequate access security, a malicious process may access what might be confidential data that belongs to a different client sharing the multiprocessor cluster. Furthermore, an inadvertent programming error in the code for one client process may accidentally corrupt data that belongs to the different client. Neither scenario is acceptable. Embodiments of the present disclosure provide access security by enabling each processing node within a multiprocessor cluster to virtualize and manage local memory access and only process access requests possessing proper access credentials. In this way, different applications executing on a multiprocessor cluster may be isolated from each other while advantageously sharing the hardware resources of the multiprocessor cluster.

2.

发明授权
Secure execution for multiple processor devices using trusted executing environments 有权

公开(公告)号：US12141268B2

公开(公告)日：2024-11-12

申请号：US17485148

申请日：2021-09-24

Applicant: NVIDIA Corporation

Inventor： Philip John Rogers , Mark Overby , Michael Asbury Woodmansee , Vyas Venkataraman , Naveen Cherukuri , Gobikrishna Dhanuskodi , Dwayne Frank Swoboda , Lucien Burton Dunning , Mark Hairgrove , Sudeshna Guha

IPC: G06F21/53 , G06F9/455 , G06F21/10 , G06F21/54 , G06F21/60 , G06F21/79

Abstract: Apparatuses, systems, and techniques to generate a trusted execution environment including multiple accelerators. In at least one embodiment, a parallel processing unit (PPU), such as a graphics processing unit (GPU), operates in a secure execution mode including a protect memory region. Furthermore, in an embodiment, a cryptographic key is utilized to protect data during transmission between the accelerators.

3.

发明授权
Techniques for an efficient fabric attached memory 有权

公开(公告)号：US11822491B2

公开(公告)日：2023-11-21

申请号：US17506438

申请日：2021-10-20

Applicant: NVIDIA Corporation

Inventor： John Feehrer , Denis Foley , Mark Hummel , Vyas Venkataraman , Ram Gummadi , Samuel H. Duncan , Glenn Dearth , Brian Kelleher

IPC: G06F13/16 , G06F13/40 , G06F9/455 , G06N20/00 , G06F17/16 , G06F12/1027

CPC classification number: G06F13/1652 , G06F9/45558 , G06F12/1027 , G06F13/1668 , G06F13/4022 , G06F17/16 , G06N20/00 , G06F2009/45583

Abstract: Fabric Attached Memory (FAM) provides a pool of memory that can be accessed by one or more processors, such as a graphics processing unit(s) (GPU)(s), over a network fabric. In one instance, a technique is disclosed for using imperfect processors as memory controllers to allow memory, which is local to the imperfect processors, to be accessed by other processors as fabric attached memory. In another instance, memory address compaction is used within the fabric elements to fully utilize the available memory space.

4.

发明授权
Selectively killing trapped multi-process service clients sharing the same hardware context 有权

公开(公告)号：US09886736B2

公开(公告)日：2018-02-06

申请号：US14481802

申请日：2014-09-09

Applicant: NVIDIA Corporation

Inventor： Kyrylo Perelygin , Vivek Kini , Vyas Venkataraman

IPC: G06T1/00 , G06T1/20 , G06F12/109 , G06T1/60 , G06F9/50

CPC classification number: G06T1/20 , G06F9/5016 , G06F12/109 , G06T1/60

Abstract: A method for handling parallel processing clients associated with a server in a GPU, the method comprising: receiving a failure indication for at least client running a thread in the GPU; determining threads in the GPU associated with the failing client; exiting threads in the GPU associated with the failing client; and continuing to execute remaining threads in the GPU for other clients running threads in the GPU.

5.

发明授权
Implementing trusted executing environments across multiple processor devices 有权

公开(公告)号：US12219057B2

公开(公告)日：2025-02-04

申请号：US17485110

申请日：2021-09-24

Applicant: NVIDIA Corporation

Inventor： Philip John Rogers , Mark Overby , Michael Asbury Woodmansee , Vyas Venkataraman , Naveen Cherukuri , Gobikrishna Dhanuskodi , Dwayne Frank Swoboda , Lucien Burton Dunning , Mark Hairgrove , Sudeshna Guha

IPC: H04L29/00 , G06F9/455 , H04L9/08 , H04L9/30

Abstract: Apparatuses, systems, and techniques to generate a trusted execution environment including multiple accelerators. In at least one embodiment, a parallel processing unit (PPU), such as a graphics processing unit (GPU), operates in a secure execution mode including a protect memory region. Furthermore, in an embodiment, a cryptographic key is utilized to protect data during transmission between the accelerators.

6.

发明申请
IMPLEMENTING TRUSTED EXECUTING ENVIRONMENTS ACROSS MULTIPLE PROCESSOR DEVICES 有权

公开(公告)号：US20230094125A1

公开(公告)日：2023-03-30

申请号：US17485110

申请日：2021-09-24

Applicant: NVIDIA Corporation

Inventor： Philip John Rogers , Mark Overby , Michael Asbury Woodmansee , Vyas Venkataraman , Naveen Cherukuri , Gobikrishna Dhanuskodi , Dwayne Frank Swoboda , Lucien Burton Dunning , Mark Hairgrove , Sudeshna Guha

IPC: H04L9/08 , G06F9/455 , H04L9/30

Abstract: Apparatuses, systems, and techniques to generate a trusted execution environment including multiple accelerators. In at least one embodiment, a parallel processing unit (PPU), such as a graphics processing unit (GPU), operates in a secure execution mode including a protect memory region. Furthermore, in an embodiment, a cryptographic key is utilized to protect data during transmission between the accelerators.

7.

发明授权
Method and system for heterogeneous filtering framework for shared memory data access hazard reports 有权
Title translation: 用于共享内存数据访问危害报告的异构过滤框架的方法和系统

公开(公告)号：US09038080B2

公开(公告)日：2015-05-19

申请号：US13728968

申请日：2012-12-27

Applicant: NVIDIA Corporation

Inventor： Vyas Venkataraman , Manjunath Kudlur , Vinod Grover

IPC: G06F9/46 , G06F3/06 , G06F9/48 , G06F11/07

CPC classification number: G06F3/0653 , G06F9/4881 , G06F9/4887 , G06F11/0715 , G06F11/0751

Abstract: A system and method for detecting, filtering, prioritizing and reporting shared memory hazards are disclosed. The method includes, for a unit of hardware operating on a block of threads, mapping a plurality of shared memory locations assigned to the unit to a tracking table. The tracking table comprises initialization information for each shared memory location. The method also includes, for an instruction of a program within a barrier region, identifying a potential conflict by identifying a second access to a location in shared memory within a block of threads executed by the hardware unit. First information associated with a first access and second information associated with the second access to the location is determined. Filter criteria is applied to the first and second information to determine whether the instruction causes a reportable hazard. The instruction is reported when it causes the reportable hazard.

Abstract translation: 公开了一种用于检测，过滤，优先排序和报告共享内存危害的系统和方法。该方法包括对于在线程块上操作的硬件单元，将分配给该单元的多个共享存储器位置映射到跟踪表。跟踪表包括每个共享存储器位置的初始化信息。该方法还包括对于屏障区域内的程序的指令，通过识别由硬件单元执行的线程块内的共享存储器中的位置的第二访问来识别潜在的冲突。确定与第一访问相关联的第一信息和与对该位置的第二访问相关联的第二信息。过滤标准被应用于第一和第二信息以确定该指令是否导致可报告的危险。当导致可报告的危险时，报告该指令。

8.

发明申请
METHOD AND SYSTEM FOR RUN TIME DETECTION OF SHARED MEMORY DATA ACCESS HAZARDS 审中-公开
Title translation: 用于运行时间检测共享存储器数据访问危险的方法和系统

公开(公告)号：US20130304996A1

公开(公告)日：2013-11-14

申请号：US13728990

申请日：2012-12-27

Applicant: NVIDIA CORPORATION

Inventor： Vyas Venkataraman , Jaydeep Marathe , Manjunath Kudlur , Vinod Grover , Geoffrey Gerfin , Alban Douillet , Mayank Kaushik

IPC: G06F3/06

CPC classification number: G06F3/067 , G06F11/0715 , G06F11/073 , G06F11/0751

Abstract: A system and method for detecting shared memory hazards are disclosed. The method includes, for a unit of hardware operating on a block of threads, mapping a plurality of shared memory locations assigned to the unit to a tracking table. The tracking table comprises an initialization bit as well as access type information, collectively called the state tracking bits for each shared memory location. The method also includes, for an instruction of a program within a barrier region, identifying a second access to a location in shared memory within a block of threads executed by the hardware unit. The second access is identified based on a status of the state tracking bits. The method also includes determining a hazard based on a first type of access and a second type of access to the shared memory location. Information related to the first access is provided in the table.

Abstract translation: 公开了一种用于检测共享内存危害的系统和方法。该方法包括对于在线程块上操作的硬件单元，将分配给该单元的多个共享存储器位置映射到跟踪表。跟踪表包括初始化位以及访问类型信息，统称为用于每个共享存储器位置的状态跟踪位。该方法还包括对于屏障区域内的程序的指令，识别由硬件单元执行的线程块内的共享存储器中的位置的第二访问。基于状态跟踪位的状态来识别第二访问。该方法还包括基于第一类型的访问确定危险，以及对共享存储器位置进行第二类型的访问。与第一次访问相关的信息在表中提供。

9.

发明公开
CONFIDENTIAL COMPUTING USING MULTI-INSTANCING OF PARALLEL PROCESSORS 审中-公开

公开(公告)号：US20230297406A1

公开(公告)日：2023-09-21

申请号：US18123222

申请日：2023-03-17

Applicant: NVIDIA Corporation

Inventor： Philip Rogers , Mark Overby , Vyas Venkataraman , Naveen Cherukuri , James Leroy Deming , Gobikrishna Dhanuskodi , Dwayne Swoboda , Lucien Dunning , Aruna Manjunatha , Aaron Jiricek , Mark Hairgrove , Mike Woodmansee

IPC: G06F9/455

CPC classification number: G06F9/45558 , G06F2009/4557 , G06F2009/45587

Abstract: In examples, trusted execution environments (TEE) are provided for an instance of a parallel processing unit (PPU) as PPU TEEs. Different instances of a PPU correspond to different PPU TEEs, and provide accelerated confidential computing to a corresponding TEE. The processors of each PPU instance have separate and isolated paths through the memory system of the PPU which are assigned uniquely to an individual PPU instance. Data in device memory of the PPU may be isolated and access controlled amongst the PPU instances using one or more hardware firewalls. A GPU hypervisor assigns hardware resources to runtimes and performs access control and context switching for the runtimes. A PPU instance uses a cryptographic key to protect data for secure communication. Compute engines of the PPU instance are prevented from writing outside of a protected memory region. Access to a write protected region in PPU memory is blocked from other computing devices and/or device instances.

10.

发明申请
TECHNIQUES FOR AN EFFICIENT FABRIC ATTACHED MEMORY 有权

公开(公告)号：US20210133123A1

公开(公告)日：2021-05-06

申请号：US16673537

申请日：2019-11-04

Applicant: NVIDIA Corporation

Inventor： John FEEHRER , Denis Foley , Mark Hummel , Vyas Venkataraman , Ram Gummadi , Samuel H. Duncan , Glenn Dearth , Brian Kelleher

IPC: G06F13/16 , G06F13/40 , G06F12/1027 , G06F9/455 , G06N20/00 , G06F17/16

Abstract: Fabric Attached Memory (FAM) provides a pool of memory that can be accessed by one or more processors, such as a graphics processing unit(s) (GPU)(s), over a network fabric. In one instance, a technique is disclosed for using imperfect processors as memory controllers to allow memory, which is local to the imperfect processors, to be accessed by other processors as fabric attached memory. In another instance, memory address compaction is used within the fabric elements to fully utilize the available memory space.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification