-
公开(公告)号:US11150979B2
公开(公告)日:2021-10-19
申请号:US16539529
申请日:2019-08-13
Applicant: Intel Corporation
Inventor: Zeev Sperber , Stanislav Shwartsman , Jared W. Stark, IV , Lihu Rappoport , Igor Yanover , George Leifman
Abstract: A method for handling load faults in an out-of-order processor is described. The method includes detecting, by a memory ordering buffer of the out-of-order processor, a load fault corresponding to a load instruction that was executed out-of-order by the out-of-order processor; determining, by the memory ordering buffer, whether instant reclamation is available for resolving the load fault of the load instruction; and performing, in response to determining that instant reclamation is available for resolving the load fault of the load instruction, instant reclamation to re-fetch the load instruction for execution prior to attempting to retire the load instruction.
-
公开(公告)号:US20210209023A1
公开(公告)日:2021-07-08
申请号:US17146440
申请日:2021-01-11
Applicant: Intel Corporation
Inventor: Ron Gabor , Igor Yanover
IPC: G06F12/0811 , G06F9/30 , G06F9/34
Abstract: Systems, methods, and apparatuses relating to linear address masking architecture are described. In one embodiment, a hardware processor includes an address generation unit to generate a linear address for a memory access request to a memory, at least one control register comprising a user mode masking bit and a supervisor mode masking bit, a register comprising a current privilege level indication, and a memory management unit to mask out a proper subset of bits inside an address space of the linear address for the memory access request based on the current privilege level indication and either of the user mode masking bit or the supervisor mode masking bit to produce a resultant linear address, and output the resultant linear address.
-
公开(公告)号:US20210034544A1
公开(公告)日:2021-02-04
申请号:US16813346
申请日:2020-03-09
Applicant: INTEL CORPORATION
Inventor: Stanislav Shwartsman , Igor Yanover , Assaf Zaltsman , Ron Rais
IPC: G06F12/1027 , G06F9/30
Abstract: Systems, methods, and apparatuses relating to hardware for split data translation lookaside buffers. In one embodiment, a processor includes a decode circuit to decode instructions into decoded instructions, an execution circuit to execute the decoded instructions, and a memory circuit comprising a load data translation lookaside buffer circuit and a store data translation lookaside buffer circuit separate and distinct from the load data translation lookaside buffer circuit, wherein the memory circuit sends a memory access request of the instructions to the load data translation lookaside buffer circuit when the memory access request is a load data request and to the store data translation lookaside buffer circuit when the memory access request is a store data request to determine a physical address for a virtual address of the memory access request.
-
公开(公告)号:US20200310794A1
公开(公告)日:2020-10-01
申请号:US16368973
申请日:2019-03-29
Applicant: Intel Corporation
Inventor: ZEEV SPERBER , Amit Gradstein , Simon Rubanovich , Igor Yanover , Gavri Berger , Eyal Hadas , Saeed Kharouf , Ron Schneider , Sagi Meller , Jose Yallouz
Abstract: The present disclosure is directed to systems and methods for performing one or more operations on a two dimensional tile register using an accelerator that includes a tiled matrix multiplication unit (TMU). The processor circuitry includes reservation station (RS) circuitry to communicatively couple the processor circuitry to the TMU. The RS circuitry coordinates the operations performed by the TMU. TMU dispatch queue (TDQ) circuitry in the TMU maintains the operations received from the RS circuitry in the order that the operations are received from the RS circuitry. Since the duration of each operation is not known prior to execution by the TMU, the RS circuitry maintains shadow dispatch queue (RS-TDQ) circuitry that mirrors the operations in the TDQ circuitry. Communication between the RS circuitry 134 and the TMU provides the RS circuitry with notification of successfully executed operations and allows the RS circuitry to cancel operations where the operations are associated with branch mispredictions and/or non-retired speculatively executed instructions.
-
公开(公告)号:US20180024925A1
公开(公告)日:2018-01-25
申请号:US15214895
申请日:2016-07-20
Applicant: INTEL CORPORATION
Inventor: Raanan Sade , Joseph Nuzman , Stanislav Shwartsman , Igor Yanover , Liron Zur
IPC: G06F12/0815 , G06F12/0893
CPC classification number: G06F12/0815 , G06F12/0833 , G06F12/0893 , G06F2212/1016 , G06F2212/60 , G06F2212/62
Abstract: An example system on a chip (SoC) includes a processor, a cache, and a main memory. The SoC can include a first memory to store data in a memory line, wherein the memory line is set to an invalid state. The processor can include a processor coupled to the first memory. The processor can determine that a data size of a first data set received from an application is within a data size range. The processor can determine that an aggregate data size of the first data set and a second data set received from the application is at least a same data size as data size of the memory line. The processor can perform an invalid-to-modify (I2M) operation to change the memory line from the invalid state to a modified state. The processor can write the first data set and the second data set to the memory line.
-
-
-
-