Patent search ap:("INTEL CORPORATION") AND inv:"Jayaram Bobba" Page 1

1.

发明授权
Compression of machine learning models utilizing pseudo-labeled data training 有权

公开(公告)号：US12056906B2

公开(公告)日：2024-08-06

申请号：US18466141

申请日：2023-09-13

Applicant: Intel Corporation

Inventor： Joydeep Ray , Ben Ashbaugh , Prasoonkumar Surti , Pradeep Ramani , Rama Harihara , Jerin C. Justin , Jing Huang , Xiaoming Cui , Timothy B. Costa , Ting Gong , Elmoustapha Ould-ahmed-vall , Kumar Balasubramanian , Anil Thomas , Oguz H. Elibol , Jayaram Bobba , Guozhong Zhuang , Bhavani Subramanian , Gokce Keskin , Chandrasekaran Sakthivel , Rajesh Poornachandran

IPC: G06F12/02 , G06T9/00 , G06T15/00

CPC classification number: G06T9/002 , G06F12/023 , G06T15/005 , G06F2212/302 , G06F2212/401

Abstract: Embodiments are generally directed to compression in machine learning and deep learning processing. An embodiment of an apparatus for compression of untyped data includes a graphical processing unit (GPU) including a data compression pipeline, the data compression pipeline including a data port coupled with one or more shader cores, wherein the data port is to allow transfer of untyped data without format conversion, and a 3D compression/decompression unit to provide for compression of untyped data to be stored to a memory subsystem and decompression of untyped data from the memory subsystem.

2.

发明授权
Dynamic assignment of down sampling intervals for data stream processing 有权

公开(公告)号：US11798198B2

公开(公告)日：2023-10-24

申请号：US18152643

申请日：2023-01-10

Applicant: Intel Corporation

Inventor： Joydeep Ray , Ben Ashbaugh , Prasoonkumar Surti , Pradeep Ramani , Rama Harihara , Jerin C. Justin , Jing Huang , Xiaoming Cui , Timothy B. Costa , Ting Gong , Elmoustapha Ould-ahmed-vall , Kumar Balasubramanian , Anil Thomas , Oguz H. Elibol , Jayaram Bobba , Guozhong Zhuang , Bhavani Subramanian , Gokce Keskin , Chandrasekaran Sakthivel , Rajesh Poornachandran

IPC: G06T9/00 , G06F12/02 , G06T15/00

CPC classification number: G06T9/002 , G06F12/023 , G06T15/005 , G06F2212/302 , G06F2212/401

Abstract: Embodiments are generally directed to compression in machine learning and deep learning processing. An embodiment of an apparatus for compression of untyped data includes a graphical processing unit (GPU) including a data compression pipeline, the data compression pipeline including a data port coupled with one or more shader cores, wherein the data port is to allow transfer of untyped data without format conversion, and a 3D compression/decompression unit to provide for compression of untyped data to be stored to a memory subsystem and decompression of untyped data from the memory subsystem.

3.

发明授权
Compression in machine learning and deep learning processing 有权

公开(公告)号：US10546393B2

公开(公告)日：2020-01-28

申请号：US15859408

申请日：2017-12-30

Applicant: Intel Corporation

Inventor： Joydeep Ray , Ben Ashbaugh , Prasoonkumar Surti , Pradeep Ramani , Rama Harihara , Jerin C. Justin , Jing Huang , Xiaoming Cui , Timothy B. Costa , Ting Gong , Elmoustapha Ould-Ahmed-Vall , Kumar Balasubramanian , Anil Thomas , Oguz H. Elibol , Jayaram Bobba , Guozhong Zhuang , Bhavani Subramanian , Gokce Keskin , Chandrasekaran Sakthivel , Rajesh Poornachandran

IPC: G06T9/00 , G06T15/00

Abstract: Embodiments are generally directed to compression in machine learning and deep learning processing. An embodiment of an apparatus for compression of untyped data includes a graphical processing unit (GPU) including a data compression pipeline, the data compression pipeline including a data port coupled with one or more shader cores, wherein the data port is to allow transfer of untyped data without format conversion, and a 3D compression/decompression unit to provide for compression of untyped data to be stored to a memory subsystem and decompression of untyped data from the memory subsystem.

4.

发明公开
COMPRESSION OF MACHINE LEARNING MODELS UTILIZING PSEUDO-LABELED DATA TRAINING 审中-公开

公开(公告)号：US20240070926A1

公开(公告)日：2024-02-29

申请号：US18466141

申请日：2023-09-13

Applicant: Intel Corporation

Inventor： Joydeep Ray , Ben Ashbaugh , Prasoonkumar Surti , Pradeep Ramani , Rama Harihara , Jerin C. Justin , Jing Huang , Xiaoming Cui , Timothy B. Costa , Ting Gong , Elmoustapha Ould-ahmed-vall , Kumar Balasubramanian , Anil Thomas , Oguz H. Elibol , Jayaram Bobba , Guozhong Zhuang , Bhavani Subramanian , Gokce Keskin , Chandrasekaran Sakthivel , Rajesh Poornachandran

IPC: G06T9/00 , G06F12/02 , G06T15/00

CPC classification number: G06T9/002 , G06F12/023 , G06T15/005 , G06F2212/302 , G06F2212/401

Abstract: Embodiments are generally directed to compression in machine learning and deep learning processing. An embodiment of an apparatus for compression of untyped data includes a graphical processing unit (GPU) including a data compression pipeline, the data compression pipeline including a data port coupled with one or more shader cores, wherein the data port is to allow transfer of untyped data without format conversion, and a 3D compression/decompression unit to provide for compression of untyped data to be stored to a memory subsystem and decompression of untyped data from the memory subsystem.

5.

发明公开
DYNAMIC ASSIGNMENT OF DOWN SAMPLING INTERVALS FOR DATA STREAM PROCESSING 审中-公开

公开(公告)号：US20230230289A1

公开(公告)日：2023-07-20

申请号：US18152643

申请日：2023-01-10

Applicant: Intel Corporation

Inventor： Joydeep Ray , Ben Ashbaugh , Prasoonkumar Surti , Pradeep Ramani , Rama Harihara , Jerin C. Justin , Jing Huang , Xiaoming Cui , Timothy B. Costa , Ting Gong , Elmoustapha Ould-ahmed-vall , Kumar Balasubramanian , Anil Thomas , Oguz H. Elibol , Jayaram Bobba , Guozhong Zhuang , Bhavani Subramanian , Gokce Keskin , Chandrasekaran Sakthivel , Rajesh Poornachandran

IPC: G06T9/00 , G06F12/02 , G06T15/00

CPC classification number: G06T9/002 , G06F12/023 , G06T15/005 , G06F2212/401 , G06F2212/302

Abstract: Embodiments are generally directed to compression in machine learning and deep learning processing. An embodiment of an apparatus for compression of untyped data includes a graphical processing unit (GPU) including a data compression pipeline, the data compression pipeline including a data port coupled with one or more shader cores, wherein the data port is to allow transfer of untyped data without format conversion, and a 3D compression/decompression unit to provide for compression of untyped data to be stored to a memory subsystem and decompression of untyped data from the memory subsystem.

6.

发明申请
TECHNOLOGIES FOR DYNAMIC ACCELERATION OF GENERAL-PURPOSE CODE USING HARDWARE ACCELERATORS 审中-公开

公开(公告)号：US20180157531A1

公开(公告)日：2018-06-07

申请号：US15370634

申请日：2016-12-06

Applicant: Intel Corporation

Inventor： Jayaram Bobba , Niranjan K. Soundararajan

IPC: G06F9/50 , G06F9/48

Abstract: Technologies for dynamic acceleration of general-purpose code include a computing device having a general-purpose processor core and one or more hardware accelerators. The computing device identifies an acceleration candidate in an application that is targeted to the processor core. The acceleration candidate may be a long-running computation of the application. The computing device translates the acceleration candidate into a translated executable targeted to the hardware accelerator. The computing device determines whether to offload execution of the acceleration candidate and, if so, executes the translated executable with the hardware accelerator. The computing device may translate the acceleration candidate into multiple translated executables, each targeted to a different hardware accelerator. The computing device may select among the translated executables in response to determining to offload execution. The hardware accelerators may include, for example, a processor graphics, an image signal processor, or a field-programmable gate array. Other embodiments are described and claimed.

7.

发明申请
INTER-ARCHITECTURE COMPATABILITY MODULE TO ALLOW CODE MODULE OF ONE ARCHITECTURE TO USE LIBRARY MODULE OF ANOTHER ARCHITECTURE 审中-公开
Title translation: 允许单一架构代码模块使用其他架构的图书馆模块的架构兼容性模块

公开(公告)号：US20150277867A1

公开(公告)日：2015-10-01

申请号：US14229795

申请日：2014-03-28

Applicant: Intel Corporation

Inventor： Niranjan Hasabnis , Suresh Srinivas , Jayaram Bobba

IPC: G06F9/45 , G06F9/445

CPC classification number: G06F8/433 , G06F9/44521 , G06F9/4552

Abstract: An inter-architecture compatibility apparatus of an aspect includes a control flow transfer reception module to receive a first call procedure operation, intended for a first architecture library module, from a first architecture code module. The first call procedure operation involves a first plurality of input parameters. An application binary interface (ABI) change module is coupled with the control flow transfer reception module. The ABI change module makes ABI changes to convert the first call procedure operation involving the first plurality of input parameters to a corresponding second call procedure operation involving a second plurality of input parameters. The second call procedure operation is compatible with a second architecture library module. A control flow transfer output module is coupled with the ABI change module. The control flow transfer output module provides the second call procedure operation to the second architecture library module.

Abstract translation: 一方面的架构间兼容性装置包括控制流传输接收模块，用于从第一架构代码模块接收针对第一架构库模块的第一呼叫过程操作。第一呼叫过程操作涉及第一多个输入参数。应用二进制接口（ABI）更改模块与控制流传输接收模块耦合。 ABI更改模块使ABI改变，将涉及第一多个输入参数的第一呼叫过程操作转换为涉及第二多个输入参数的相应的第二呼叫过程操作。第二个调用过程操作与第二个架构库模块兼容。控制流传输输出模块与ABI更换模块耦合。控制流传输输出模块向第二架构库模块提供第二呼叫过程操作。

8.

发明申请
SYSTOLIC ARRAY ACCELERATOR SYSTEMS AND METHODS 审中-公开

公开(公告)号：US20200272596A1

公开(公告)日：2020-08-27

申请号：US16283795

申请日：2019-02-24

Applicant: INTEL CORPORATION

Inventor： Srinivasan Narayanamoorthy , Jayaram Bobba , Ankit More

IPC: G06F15/80

Abstract: The present disclosure is directed to systems and methods for decomposing systolic array circuitry to provide a plurality of N×N systolic sub-array circuits, apportioning a first tensor or array into a plurality of N×M first input arrays, and apportioning a second tensor or array into a plurality of M×N second input arrays. Systolic array control circuitry transfers corresponding ones of the first input arrays and second input arrays to a respective one of the plurality of N×N systolic sub-array circuits. As the elements included in the first input array and the elements included in the second input array are transferred to the systolic sub-array, the systolic sub-array performs one or more mathematical operations using the first and the second input arrays. The systems and methods beneficially improve the usage of the systolic array circuitry thereby advantageously reducing the number of clock cycles needed to perform a given number of calculations.

9.

发明授权
Systems, apparatuses, and methods for a hardware and software system to automatically decompose a program to multiple parallel threads 有权

公开(公告)号：US10725755B2

公开(公告)日：2020-07-28

申请号：US15615798

申请日：2017-06-06

Applicant: Intel Corporation

Inventor： David J. Sager , Ruchira Sasanka , Ron Gabor , Shlomo Raikin , Joseph Nuzman , Leeor Peled , Jason A. Domer , Ho-Seop Kim , Youfeng Wu , Koichi Yamada , Tin-Fook Ngai , Howard H. Chen , Jayaram Bobba , Jeffrey J. Cook , Omar M. Shaikh , Suresh Srinivas

IPC: G06F8/41 , G06F9/38 , G06F9/54 , G06F11/36

Abstract: Systems, apparatuses, and methods for a hardware and software system to automatically decompose a program into multiple parallel threads are described. In some embodiments, the systems and apparatuses execute a method of original code decomposition and/or generated thread execution.

10.

发明授权
Inter-architecture compatability module to allow code module of one architecture to use library module of another architecture 有权

公开(公告)号：US10120663B2

公开(公告)日：2018-11-06

申请号：US14229795

申请日：2014-03-28

Applicant: Intel Corporation

Inventor： Niranjan Hasabnis , Suresh Srinivas , Jayaram Bobba

IPC: G06F9/44 , G06F9/45 , G06F8/41 , G06F9/445 , G06F9/455

Abstract: An inter-architecture compatibility apparatus of an aspect includes a control flow transfer reception module to receive a first call procedure operation, intended for a first architecture library module, from a first architecture code module. The first call procedure operation involves a first plurality of input parameters. An application binary interface (ABI) change module is coupled with the control flow transfer reception module. The ABI change module makes ABI changes to convert the first call procedure operation involving the first plurality of input parameters to a corresponding second call procedure operation involving a second plurality of input parameters. The second call procedure operation is compatible with a second architecture library module. A control flow transfer output module is coupled with the ABI change module. The control flow transfer output module provides the second call procedure operation to the second architecture library module.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification