Patent search ap:("NVIDIA Corporation") AND inv:"Alan Kaatz" Page 3

21.

发明公开
STORAGE OF TRANSFORMED TENSOR IN A CACHE 审中-公开

公开(公告)号：US20240169472A1

公开(公告)日：2024-05-23

申请号：US18086484

申请日：2022-12-21

Applicant: NVIDIA Corporation

Inventor： Harold Carter Edwards , Stephen Anthony Bernard Jones , Alexander Lev Minkin , Olivier Giroux , Gokul Ramaswamy Hirisave Chandra Shekhara , Aditya Avinash Atluri , Apoorv Parle , Ronny Meir Krashinsky , Alan Kaatz , Andrew Robert Kerr , Jack H. Choquette

IPC: G06T1/60 , G06F12/0811 , G06F12/0862 , G06T1/20

CPC classification number: G06T1/60 , G06F12/0811 , G06F12/0862 , G06T1/20 , G06F2212/62

Abstract: Apparatuses, systems, and techniques to perform a tensor prefetch instruction to cause one or more tensors to be transformed and stored into one or more caches. In at least one embodiment, one or more circuits of a GPU are to perform a tensor prefetch instruction to cause one or more tensors to be transformed and stored into one or more GPU caches.

22.

发明公开
APPLICATION PROGRAMMING INTERFACE TO TRANSLATE A TENSOR ACCORDING TO A TENSOR MAP 审中-公开

公开(公告)号：US20240168831A1

公开(公告)日：2024-05-23

申请号：US18086473

申请日：2022-12-21

Applicant: NVIDIA Corporation

Inventor： Harold Carter Edwards , Stephen Anthony Bernard Jones , Alexander Lev Minkin , Olivier Giroux , Gokul Ramaswamy Hirisave Chandra Shekhara , Aditya Avinash Atluri , Apoorv Parle , Chao Li , Ronny Meir Krashinsky , Alan Kaatz , Andrew Robert Kerr , Jack H. Choquette

IPC: G06F9/54

CPC classification number: G06F9/544

Abstract: Apparatuses, systems, and techniques to cause a first tensor to be translated into a second tensor according to a tensor map. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to cause a first tensor to be translated into a second tensor according to a tensor map.

23.

发明公开
APPLICATION PROGRAMMING INTERFACE TO GENERATE A TENSOR MAPPING 审中-公开

公开(公告)号：US20240168829A1

公开(公告)日：2024-05-23

申请号：US18086451

申请日：2022-12-21

Applicant: NVIDIA Corporation

Inventor： Harold Carter Edwards , Stephen Anthony Bernard Jones , Alexander Lev Minkin , Olivier Giroux , Gokul Ramaswamy Hirisave Chandra Shekhara , Vishalkumar Ketankumar Mehta , Aditya Avinash Atluri , Apoorv Parle , Ronny Meir Krashinsky , Alan Kaatz , Andrew Robert Kerr , Jack H. Choquette

IPC: G06F9/54

CPC classification number: G06F9/544

Abstract: Apparatuses, systems, and techniques to generate a tensor mapping. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to cause a mapping from a first tensor to a second tensor to be generated.

24.

发明公开
APPLICATION PROGRAMMING INTERFACE TO GENERATE A TENSOR ACCORDING TO A TENSOR MAP 审中-公开

公开(公告)号：US20240161224A1

公开(公告)日：2024-05-16

申请号：US18086469

申请日：2022-12-21

Applicant: NVIDIA Corporation

Inventor： Harold Carter Edwards , Stephen Anthony Bernard Jones , Alexander Lev Minkin , Olivier Giroux , Gokul Ramaswamy Hirisave Chandra Shekhara , Vishalkumar Ketankumar Mehta , Aditya Avinash Atluri , Apoorv Parle , Chao Li , Ronny Meir Krashinsky , Alan Kaatz , Andrew Robert Kerr , Jack H. Choquette

IPC: G06T1/20 , G06F9/54 , G06T1/60

CPC classification number: G06T1/20 , G06F9/541 , G06T1/60

Abstract: Apparatuses, systems, and techniques to cause a first tensor to be translated into a second tensor according to a tensor map without storing information about a memory transaction corresponding to the translation. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to cause a first tensor to be translated into a second tensor according to a tensor map without storing information about one or more memory transactions corresponding to the translation.

25.

发明公开
APPLICATION PROGRAMMING INTERFACE TO INDICATE IMAGE-TO-COLUMN TRANSFORMATION 审中-公开

公开(公告)号：US20240161222A1

公开(公告)日：2024-05-16

申请号：US18086457

申请日：2022-12-21

Applicant: NVIDIA Corporation

Inventor： Harold Carter Edwards , Stephen Anthony Bernard Jones , Alexander Lev Minkin , Olivier Giroux , Gokul Ramaswamy Hirisave Chandra Shekhara , Vishalkumar Ketankumar Mehta , Aditya Avinash Atluri , Apoorv Parle , Ronny Meir Krashinsky , Alan Kaatz , Andrew Robert Kerr , Jack H. Choquette

IPC: G06T1/20 , G06F9/54 , G06F17/16 , G06T1/60

CPC classification number: G06T1/20 , G06F9/541 , G06F17/16 , G06T1/60

Abstract: Apparatuses, systems, and techniques to indicate how to generate image-to-column transformations. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to indicate how to generate one or more image-to-column transformations.

26.

发明公开
THREAD SPECIALIZATION FOR COLLABORATIVE DATA TRANSFER AND COMPUTATION 审中-公开

公开(公告)号：US20230140934A1

公开(公告)日：2023-05-11

申请号：US17689660

申请日：2022-03-08

Applicant: NVIDIA Corporation

Inventor： Chao Li , Jing Li , Alan Kaatz , Ronny Meir Krashinsky , Albert Xu

IPC: G06F9/54 , G06F9/48 , G06F9/52

CPC classification number: G06F9/544 , G06F9/52 , G06F9/4843

Abstract: Apparatuses, systems, and techniques to perform a matrix multiplication using parallel processing. In at least one embodiment, a matrix multiplication is divided into a set of tiles, with each tile processed with a prolog task, a calculation task, and an epilog task. The prolog tasks are performed by a dedicated set of threads, with the remaining tasks performed in an interleaved manner using two or more thread groups.

27.

发明申请
NEURAL NETWORK DATA REPLACEMENT 有权

公开(公告)号：US20230110438A1

公开(公告)日：2023-04-13

申请号：US17497507

申请日：2021-10-08

Applicant: Nvidia Corporation

Inventor： Pradeep Ramani , Alex Minkin , Alan Kaatz , Yang Xu , Ronny Krashinsky

IPC: G06N3/04 , G06N3/10

Abstract: Apparatuses, systems, and techniques are presented to perform one or more operations. In at least one embodiment, one or more data values, to be used by one or more neural networks, are caused to be replaced by one or more invalid data values.

28.

发明申请
TECHNIQUES FOR EFFICIENTLY TRANSFERRING DATA TO A PROCESSOR 有权

公开(公告)号：US20210124582A1

公开(公告)日：2021-04-29

申请号：US16712083

申请日：2019-12-12

Applicant: NVIDIA Corporation

Inventor： Andrew Kerr , Jack Choquette , Xiaogang Qiu , Omkar Paranjape , Poornachandra Rao , Shirish Gadre , Steven J. Heinrich , Manan Patel , Olivier Giroux , Alan Kaatz

IPC: G06F9/30 , G06F12/0888 , G06F12/0808

Abstract: A technique for block data transfer is disclosed that reduces data transfer and memory access overheads and significantly reduces multiprocessor activity and energy consumption. Threads executing on a multiprocessor needing data stored in global memory can request and store the needed data in on-chip shared memory, which can be accessed by the threads multiple times. The data can be loaded from global memory and stored in shared memory using an instruction which directs the data into the shared memory without storing the data in registers and/or cache memory of the multiprocessor during the data transfer.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification