-
公开(公告)号:US20210056028A1
公开(公告)日:2021-02-25
申请号:US17068754
申请日:2020-10-12
Applicant: Intel Corporation
Inventor: JOYDEEP RAY , James Valerio , Ben Ashbaugh , Lakshminarayanan Striramassarma
IPC: G06F12/0811 , G06F3/06 , G06F9/38 , G06F9/54
Abstract: Embodiments described herein provide a general purpose graphics processor comprising a plurality of tiles, each tile of the plurality of tiles comprising at least one execution unit, a local cache, and a cache control unit, and a high bandwidth memory communicatively coupled to the plurality of tiles, wherein the high bandwidth memory is shared between the plurality of tiles. The cache control unit is to implement a partial write management protocol to receive a partial write operation directed to a cache line in the local cache, the partial write operation comprising write data, write the data associated with the partial write operation to the local cache when the cache line is in a modified state, and forward the write data associated with the partial write operation to the high bandwidth memory when the partial write operation triggers a cache miss or when the cache line is in an exclusive state or a shared state. Other embodiments may be described and claimed.
-
公开(公告)号:US20200210472A1
公开(公告)日:2020-07-02
申请号:US16696854
申请日:2019-11-26
Applicant: Intel Corporation
Inventor: BARNAN DAS , MAYURESH M. VARERKAR , NARAYAN BISWAL , STANLEY J. BARAN , GOKCEN CILINGIR , NILESH V. SHAH , ARCHIE SHARMA , SHERINE ABDELHAK , PRANEETHA KOTHA , NEELAY PANDIT , JOHN C. WEAST , MIKE B. MACPHERSON , DUKHWAN KIM , LINDA L. HURD , ABHISHEK R. APPU , ALTUG KOKER , JOYDEEP RAY
IPC: G06F16/583 , G06F16/783 , G06K9/00 , G06K9/62
Abstract: A mechanism is described for facilitating recognition, reidentification, and security in machine learning at autonomous machines. A method of embodiments, as described herein, includes facilitating a camera to detect one or more objects within a physical vicinity, the one or more objects including a person, and the physical vicinity including a house, where detecting includes capturing one or more images of one or more portions of a body of the person. The method may further include extracting body features based on the one or more portions of the body, comparing the extracted body features with feature vectors stored at a database, and building a classification model based on the extracted body features over a period of time to facilitate recognition or reidentification of the person independent of facial recognition of the person.
-
公开(公告)号:US20190391937A1
公开(公告)日:2019-12-26
申请号:US16453995
申请日:2019-06-26
Applicant: Intel Corporation
Inventor: NIRANJAN L. COORAY , ABHISHEK R. APPU , ALTUG KOKER , JOYDEEP RAY , BALAJI VEMBU , PATTABHIRAMAN K , DAVID PUFFER , DAVID J. COWPERTHWAITE , RAJESH M. SANKARAN , SATYESHWAR SINGH , SAMEER KP , ANKUR N. SHAH , KUN TIAN
IPC: G06F13/16 , G06F13/40 , G06F12/0802 , G06F12/1036 , G06F12/1027 , G06F12/1009
Abstract: An apparatus and method are described for implementing memory management in a graphics processing system. For example, one embodiment of an apparatus comprises: a first plurality of graphics processing resources to execute graphics commands and process graphics data; a first memory management unit (MMU) to communicatively couple the first plurality of graphics processing resources to a system-level MMU to access a system memory; a second plurality of graphics processing resources to execute graphics commands and process graphics data; a second MMU to communicatively couple the second plurality of graphics processing resources to the first MMU; wherein the first MMU is configured as a master MMU having a direct connection to the system-level MMU and the second MMU comprises a slave MMU configured to send memory transactions to the first MMU, the first MMU either servicing a memory transaction or sending the memory transaction to the system-level MMU on behalf of the second MMU.
-
公开(公告)号:US20190332869A1
公开(公告)日:2019-10-31
申请号:US16379176
申请日:2019-04-09
Applicant: Intel Corporation
Inventor: MAYURESH M. VARERKAR , BARNAN DAS , NARAYAN BISWAL , STANLEY J. BARAN , GOKCEN CILINGIR , NILESH V. SHAH , ARCHIE SHARMA , SHERINE ABDELHAK , SACHIN GODSE , FARSHAD AKHBARI , NARAYAN SRINIVASA , ALTUG KOKER , NADATHUR RAJAGOPALAN SATISH , DUKHWAN KIM , FENG CHEN , ABHISHEK R. APPU , JOYDEEP RAY , PING T. TANG , MICHAEL S. STRICKLAND , XIAOMING CHEN , ANBANG YAO , TATIANA SHPEISMAN , VASANTH RANGANATHAN , SANJEEV JAHAGIRDAR
Abstract: A mechanism is described for facilitating person tracking and data security in machine learning at autonomous machines. A method of embodiments, as described herein, includes detecting, by a camera associated with one or more trackers, a person within a physical vicinity, where detecting includes capturing one or more images the person. The method may further include tracking, by the one or more trackers, the person based on the one or more images of the person, where tracking includes collect tracking data relating to the person. The method may further include selecting a tracker of the one or more trackers as a preferred tracker based on the tracking data.
-
公开(公告)号:US20180300964A1
公开(公告)日:2018-10-18
申请号:US15488914
申请日:2017-04-17
Applicant: Intel Corporation
Inventor: BARATH LAKSHAMANAN , LINDA L. HURD , BEN J. ASHBAUGH , ELMOUSTAPHA OULD-AHMED-VALL , LIWEI MA , JINGYI JIN , JUSTIN E. GOTTSCHLICH , CHANDRASEKARAN SAKTHIVEL , MICHAEL S. STRICKLAND , BRIAN T. LEWIS , LINDSEY KUPER , ALTUG KOKER , ABHISHEK R. APPU , PRASOONKUMAR SURTI , JOYDEEP RAY , BALAJI VEMBU , JAVIER S. TUREK , NAILA FAROOQUI
CPC classification number: G07C5/008 , B60W30/00 , G01C21/34 , G01S19/13 , G05D1/0088 , G05D2201/0213 , G06F9/5027 , G06F2209/509 , G06N20/00 , G08G1/0112 , G08G1/012 , G08G1/052 , H04L43/0852 , H04L67/12 , H04W28/08
Abstract: One embodiment provides for a computing device within an autonomous vehicle, the compute device comprising a wireless network device to enable a wireless data connection with an autonomous vehicle network, a set of multiple processors including a general-purpose processor and a general-purpose graphics processor, the set of multiple processors to execute a compute manager to manage execution of compute workloads associated with the autonomous vehicle, the compute workload associated with autonomous operations of the autonomous vehicle, and offload logic configured to execute on the set of multiple processors, the offload logic to determine to offload one or more of the compute workloads to one or more autonomous vehicles within range of the wireless network device.
-
公开(公告)号:US20180293690A1
公开(公告)日:2018-10-11
申请号:US15482685
申请日:2017-04-07
Applicant: Intel Corporation
Inventor: JOYDEEP RAY , ABHISHEK R. APPU , ALTUG KOKER , BALAJI VEMBU
IPC: G06T1/20 , G06T1/60 , G06F12/0875
CPC classification number: G06T1/20 , G06F12/0811 , G06F12/0815 , G06F12/0831 , G06F12/0875 , G06F12/0888 , G06F2212/1024 , G06F2212/302 , G06F2212/455 , G06F2212/621 , G06T1/60
Abstract: An apparatus and method are described for managing data which is biased towards a processor or a GPU. For example, one embodiment of an apparatus comprises: a processor comprising one or more cores to execute instructions and process data, one or more cache levels, and cache coherence controllers to maintain coherent data in the one or more cache levels; a graphics processing unit (GPU) to execute graphics instructions and process graphics data, wherein the GPU and processor cores are to share a virtual address space for accessing a system memory; a GPU memory coupled to the GPU, the GPU memory addressable through the virtual address space shared by the processor cores and GPU; and bias management circuitry to store an indication, for each of a plurality of blocks of data, whether the data has a processor bias or a GPU bias, wherein if the data has a GPU bias, then the data is to be accessed by the GPU from the GPU memory without necessarily accessing the processor's cache coherence controllers and wherein requests for the data from the processor cores are processed as uncached requests, preventing the data from being cached in the one or more cache levels of the processor.
-
公开(公告)号:US20180293362A1
公开(公告)日:2018-10-11
申请号:US15482514
申请日:2017-04-07
Applicant: Intel Corporation
Inventor: JOYDEEP RAY , Abhishek R. Appu , Pattabhiraman K , Balaji Vembu , Altug Koker
IPC: G06F21/10 , G06F9/455 , H04N21/4405 , G06T15/00 , H04N19/00 , G06F12/0815
CPC classification number: G06F21/10 , G06F9/455 , G06F12/0815 , G06F12/0895 , G06F12/1425 , G06F2009/45587 , G06F2212/455 , G06T15/005 , H04L2209/603 , H04L2209/605 , H04N19/00 , H04N21/4405
Abstract: An apparatus and method for protecting content in a graphics processor. For example, one embodiment of an apparatus comprises: encode/decode circuitry to decode protected audio and/or video content to generate decoded audio and/or video content; a graphics cache of a graphics processing unit (GPU) to store the decoded audio and/or video content; first protection circuitry to set a protection attribute for each cache line containing the decoded audio and/or video data in the graphics cache; a cache coherency controller to generate a coherent read request to the graphics cache; second protection circuitry to read the protection attribute to determine whether the cache line identified in the read request is protected, wherein if it is protected, the second protection circuitry to refrain from including at least some of the data from the cache line in a response.
-
公开(公告)号:US20250028675A1
公开(公告)日:2025-01-23
申请号:US18791963
申请日:2024-08-01
Applicant: Intel Corporation
Inventor: JOYDEEP RAY , SELVAKUMAR PANNEER , SAURABH TANGRI , BEN ASHBAUGH , SCOTT JANUS , ABHISHEK APPU , VARGHESE GEORGE , RAVISHANKAR IYER , NILESH JAIN , PATTABHIRAMAN K , ALTUG KOKER , MIKE MACPHERSON , JOSH MASTRONARDE , ELMOUSTAPHA OULD-AHMED-VALL , JAYAKRISHNA P. S , ERIC SAMSON
IPC: G06F15/78 , G06F7/544 , G06F7/575 , G06F7/58 , G06F9/30 , G06F9/38 , G06F9/50 , G06F12/02 , G06F12/06 , G06F12/0802 , G06F12/0804 , G06F12/0811 , G06F12/0862 , G06F12/0866 , G06F12/0871 , G06F12/0875 , G06F12/0882 , G06F12/0888 , G06F12/0891 , G06F12/0893 , G06F12/0895 , G06F12/0897 , G06F12/1009 , G06F12/128 , G06F15/80 , G06F17/16 , G06F17/18 , G06N3/08 , G06T1/20 , G06T1/60 , G06T15/06 , H03M7/46
Abstract: Embodiments described herein include software, firmware, and hardware that provides techniques to enable deterministic scheduling across multiple general-purpose graphics processing units. One embodiment provides a multi-GPU architecture with uniform latency. One embodiment provides techniques to distribute memory output based on memory chip thermals. One embodiment provides techniques to enable thermally aware workload scheduling. One embodiment provides techniques to enable end to end contracts for workload scheduling on multiple GPUs.
-
公开(公告)号:US20240053985A1
公开(公告)日:2024-02-15
申请号:US18485089
申请日:2023-10-11
Applicant: Intel Corporation
Inventor: SUBRAMANIAM MAIYURAN , VARGHESE GEORGE , JOYDEEP RAY , ASHUTOSH GARG , JORGE PARRA , SHUBH SHAH , SHUBRA MARWAHA
CPC classification number: G06F9/3001 , G06F9/5011 , G06F17/16 , G06F9/3013 , G06F9/30036
Abstract: Embodiments described herein provide an apparatus comprising a plurality of processing resources including a first processing resource and a second processing resource, a shared local memory communicatively coupled to the first processing resource and the second processing resource, and a processor to receive an instruction to initiate a matrix multiplication operation, write a first set of matrix data into a first set of registers, and share the first set of matrix data between the first processing resource and the second processing resource for use in the matrix multiplication operation. Other embodiments may be described and claimed.
-
公开(公告)号:US20230401064A1
公开(公告)日:2023-12-14
申请号:US18347964
申请日:2023-07-06
Applicant: Intel Corporation
Inventor: JAMES VALERIO , VASANTH RANGANATHAN , JOYDEEP RAY , PRADEEP RAMANI
CPC classification number: G06F9/3802 , G06T1/20 , G06F13/28
Abstract: A graphics processing device is provided that includes a set of compute units to execute a workload, a cache coupled with the set of compute units, and circuitry coupled with the cache and the set of compute units. The circuitry is configured to, in response to a cache miss for the read from a first cache, broadcast an event within the graphics processor device to identify data associated with the cache miss, receive the event at a second compute unit in the set of compute units, and prefetch the data identified by the event into a second cache that is local to the second compute unit before an attempt to read the instruction or data by the second thread.
-
-
-
-
-
-
-
-
-