-
公开(公告)号:US10761849B2
公开(公告)日:2020-09-01
申请号:US15273163
申请日:2016-09-22
Applicant: Intel Corporation
Inventor: Ching-Tsun Chou , Oleg Margulis , Tyler N. Sondag
IPC: G06F9/30
Abstract: A processor of an aspect includes a decode unit to decode a prior instruction that is to have at least a first context, and a subsequent instruction. The subsequent instruction is to be after the prior instruction in original program order. The decode unit is to use the first context of the prior instruction to determine a second context for the subsequent instruction. The processor also includes an execution unit coupled with the decode unit. The execution unit is to perform the subsequent instruction based at least in part on the second context. Other processors, methods, systems, and machine-readable medium are also disclosed.
-
公开(公告)号:US09727476B2
公开(公告)日:2017-08-08
申请号:US14635403
申请日:2015-03-02
Applicant: Intel Corporation
Inventor: Boris Ginzburg , Oleg Margulis
IPC: G09G5/36 , G06F12/0875 , G06T1/60
CPC classification number: G06F12/0875 , G06F2212/452 , G06T1/60
Abstract: A processor may support a two-dimensional (2-D) gather instruction and a 2-D cache. The processor may perform the 2-D gather instruction to access one or more sub-blocks of data from a 2-D image stored in a memory coupled to the processor. The 2-D cache may store the sub-blocks of data in a multiple cache lines. Further, the 2-D cache may support access of more than one cache lines while preserving a 2-D structure of the 2-D image.
-
3.
公开(公告)号:US09996356B2
公开(公告)日:2018-06-12
申请号:US14998299
申请日:2015-12-26
Applicant: Intel Corporation
Inventor: Vineeth Mekkat , Oleg Margulis , Jason M. Agron , Ethan Schuchman , Sebastian Winkel , Youfeng Wu , Gisle Dankel
CPC classification number: G06F9/30185 , G06F9/3826 , G06F9/3834 , G06F9/3838 , G06F9/3865
Abstract: Apparatus and method for detecting and recovering from incorrect memory dependence speculation in an out-of-order processor are described herein. For example, one embodiment of a method comprises: executing a first load instruction; detecting when the first load instruction experiences a bad store-to-load forwarding event during execution; tracking the occurrences of bad store-to-load forwarding event experienced by the first load instruction during execution; controlling enablement of an S-bit in the first load instruction based on the tracked occurrences; generating a plurality of load operations responsive to an enabled S-bit in first load instruction, wherein execution of the plurality of load operations produces a result equivalent to that from the execution of the first load instruction.
-
公开(公告)号:US20170351516A1
公开(公告)日:2017-12-07
申请号:US15175899
申请日:2016-06-07
Applicant: Intel Corporation
Inventor: Vineeth Mekkat , Oleg Margulis , Ching-Tsun Chou , Youfeng Wu
IPC: G06F9/30 , G06F9/38 , G06F9/32 , G06F12/0831 , G06F12/0875
CPC classification number: G06F9/30043 , G06F9/3017 , G06F9/327 , G06F9/3842 , G06F9/3851 , G06F9/3855
Abstract: A processor includes a front end including circuitry to decode instructions from an instruction stream, a data cache unit including circuitry to cache data for the processor, and a binary translator. The binary translator includes circuitry to identify a redundant store in the instruction stream, mark the start and end of a region of the instruction stream with the redundant store, remove the redundant store, and store an amended instruction stream with the redundant store removed.
-
公开(公告)号:US10853078B2
公开(公告)日:2020-12-01
申请号:US16231313
申请日:2018-12-21
Applicant: Intel Corporation
Inventor: Vineeth Mekkat , Mark Dechene , Zhongying Zhang , John Faistl , Janghaeng Lee , Hou-Jen Ko , Sebastian Winkel , Oleg Margulis
Abstract: A processor includes a store buffer to store store instructions to be processed to store data in main memory, a load buffer to store load instructions to be processed to load data from main memory, and a loop invariant code motion (LICM) protection structure coupled to the store buffer and the load buffer. The LPT tracks information to compare an address of a store or snoop microoperation with entries in the LICM and re-loads a load microoperation of a matching entry.
-
6.
公开(公告)号:US10120686B2
公开(公告)日:2018-11-06
申请号:US15175899
申请日:2016-06-07
Applicant: Intel Corporation
Inventor: Vineeth Mekkat , Oleg Margulis , Ching-Tsun Chou , Youfeng Wu
Abstract: A processor includes a front end including circuitry to decode instructions from an instruction stream, a data cache unit including circuitry to cache data for the processor, and a binary translator. The binary translator includes circuitry to identify a redundant store in the instruction stream, mark the start and end of a region of the instruction stream with the redundant store, remove the redundant store, and store an amended instruction stream with the redundant store removed.
-
7.
公开(公告)号:US20180081684A1
公开(公告)日:2018-03-22
申请号:US15273163
申请日:2016-09-22
Applicant: Intel Corporation
Inventor: Ching-Tsun Chou , Oleg Margulis , Tyler N. Sondag
IPC: G06F9/30
Abstract: A processor of an aspect includes a decode unit to decode a prior instruction that is to have at least a first context, and a subsequent instruction. The subsequent instruction is to be after the prior instruction in original program order. The decode unit is to use the first context of the prior instruction to determine a second context for the subsequent instruction. The processor also includes an execution unit coupled with the decode unit. The execution unit is to perform the subsequent instruction based at least in part on the second context. Other processors, methods, systems, and machine-readable medium are also disclosed.
-
公开(公告)号:US20170192788A1
公开(公告)日:2017-07-06
申请号:US14988298
申请日:2016-01-05
Applicant: Intel Corporation
Inventor: Oleg Margulis , Jason M. Agron , Tyler N. Sondag
IPC: G06F9/30
CPC classification number: G06F9/30185 , G06F9/30138 , G06F9/30174 , G06F9/4552
Abstract: A processing system implementing techniques for binary translation support using processor instruction prefixes is provided. In one embodiment, the processing system includes a register bank having a plurality of registers to store data for use in executing instructions and a processor core coupled to the register bank. An instruction to be executed by the processor core is received. The instruction is associated with a binary translator operation to translate input instruction sequences to output instruction sequences. An opcode prefix referencing an extended register of the plurality of registers to be used during the binary translator operation. The extended register preserves a source register value of the plurality of registers.
-
公开(公告)号:US10540178B2
公开(公告)日:2020-01-21
申请号:US15265587
申请日:2016-09-14
Applicant: Intel Corporation
Inventor: Vineeth Mekkat , Youfeng Wu , Sebastian Winkel , Oleg Margulis
IPC: G06F12/0875 , G06F9/30 , G06F9/38 , G06F9/32
Abstract: A processor for redundant stores includes a front end including circuitry to decode instructions from an instruction stream, a data cache unit including circuitry to cache data for the processor, a binary translator, and a memory execution unit. The binary translator includes circuitry to identify a first region of the instruction stream including a redundant store, mark a first starting instruction of the first region with a protection designator, mark a first ending instruction of the first region with a clear designator, and store an amended instruction stream with the markings. The memory execution unit includes circuitry to track the first redundant store based on the protection designator and the clear designator to eliminate the first redundant store.
-
公开(公告)号:US20180074827A1
公开(公告)日:2018-03-15
申请号:US15265587
申请日:2016-09-14
Applicant: Intel Corporation
Inventor: Vineeth Mekkat , Youfeng Wu , Sebastian Winkel , Oleg Margulis
IPC: G06F9/38 , G06F9/30 , G06F12/0875
Abstract: A processor for redundant stores includes a front end including circuitry to decode instructions from an instruction stream, a data cache unit including circuitry to cache data for the processor, a binary translator, and a memory execution unit. The binary translator includes circuitry to identify a first region of the instruction stream including a redundant store, mark a first starting instruction of the first region with a protection designator, mark a first ending instruction of the first region with a clear designator, and store an amended instruction stream with the markings. The memory execution unit includes circuitry to track the first redundant store based on the protection designator and the clear designator to eliminate the first redundant store.
-
-
-
-
-
-
-
-
-