-
1.
公开(公告)号:US20240211408A1
公开(公告)日:2024-06-27
申请号:US18087887
申请日:2022-12-23
Applicant: INTEL CORPORATION
Inventor: JOYDEEP RAKSHIT , ANANT VITHAL NORI , SREENIVAS SUBRAMONEY , HANNA ALAM , JOSEPH NUZMAN
IPC: G06F12/0891 , G06F12/1009
CPC classification number: G06F12/0891 , G06F12/1009 , G06F2212/1016
Abstract: Apparatus and method for probabilistic cacheline replacement for accelerating address translation. For example, one embodiment of a processor comprises: a plurality of cores, each core to process instructions; a cache to be shared by a subset of the plurality of cores, the cache comprising an N-way set associative cache for storing page table entry (PTE) cachelines and non-PTE cachelines; and a cache manager to implement a PTE-aware eviction policy for evicting cachelines from the cache, the PTE-aware eviction policy to cause a reduction of evictions of PTE cachelines during non-PTE cacheline fills.
-
2.
公开(公告)号:US20230205692A1
公开(公告)日:2023-06-29
申请号:US17561571
申请日:2021-12-23
Applicant: Intel Corporation
Inventor: ANANT NORI , RAHUL BERA , SHANKAR BALACHANDRAN , JOYDEEP RAKSHIT , Om Ji OMER , SREENIVAS SUBRAMONEY , AVISHAII ABUHATZERA , BELLIAPPA KUTTANNA
IPC: G06F12/0811 , G06F9/48 , G06F9/30 , G06N3/02
CPC classification number: G06F12/0811 , G06F9/4881 , G06F9/3012 , G06N3/02
Abstract: Apparatus and method for leveraging simultaneous multithreading for bulk compute operations. For example, one embodiment of a processor comprises: a plurality of cores including a first core to simultaneously process instructions of a plurality of threads; a cache hierarchy coupled to the first core and the memory, the cache hierarchy comprising a Level 1 (L1) cache, a Level 2 (L2) cache, and a Level 3 (L3) cache; and a plurality of compute units coupled to the first core including a first compute unit associated with the L1 cache, a second compute unit associated with the L2 cache, and a third compute unit associated with the L3 cache, wherein the first core is to offload instructions for execution by the compute units, the first core to offload instructions from a first thread to the first compute unit, instructions from a second thread to the second compute unit, and instructions from a third thread to the third compute unit.
-