-
公开(公告)号:US20240427728A1
公开(公告)日:2024-12-26
申请号:US18670721
申请日:2024-05-21
Applicant: Intel Corporation
Inventor: Vedvyas Shanbhogue , Stephen J. Robinson , Christopher D. Bryant , Jason W. Brandt
Abstract: A processor includes a widest set of data registers that corresponds to a given logical processor. Each of the data registers of the widest set have a first width in bits. A decode unit that corresponds to the given logical processor is to decode instructions that specify the data registers of the widest set, and is to decode an atomic store to memory instruction. The atomic store to memory instruction is to indicate data that is to have a second width in bits that is wider than the first width in bits. The atomic store to memory instruction is to indicate memory address information associated with a memory location. An execution unit is coupled with the decode unit. The execution unit, in response to the atomic store to memory instruction, is to atomically store the indicated data to the memory location.
-
公开(公告)号:US09996487B2
公开(公告)日:2018-06-12
申请号:US14751899
申请日:2015-06-26
Applicant: Intel Corporation
Inventor: Jose S. Niell , Daniel F. Cutter , Stephen J. Robinson , Mukesh K. Patel
IPC: G06F13/36 , G06F11/00 , G06F13/28 , G06F13/40 , G06F12/0868
CPC classification number: G06F13/28 , G06F12/0802 , G06F12/0811 , G06F12/0815 , G06F12/0868 , G06F13/4068 , G06F2212/1032 , G06F2212/608
Abstract: An apparatus having a fabric interconnect that supports multiple topologies and method for using the same are disclosed. In one embodiment, the apparatus comprises mode memory to store information indicative of one of the plurality of modes; and a first fabric operable in a plurality of modes, where the fabric comprises logic coupled to the mode memory to control processing of read and write requests to memory received by the first fabric according to the mode identified by the information indicative.
-
公开(公告)号:US20220405234A1
公开(公告)日:2022-12-22
申请号:US17827882
申请日:2022-05-30
Applicant: Intel Corporation
Inventor: Vedvyas Shanbhogue , Stephen J. Robinson , Christopher D. Bryant , Jason W. Brandt
Abstract: A processor includes a widest set of data registers that corresponds to a given logical processor. Each of the data registers of the widest set have a first width in bits. A decode unit that corresponds to the given logical processor is to decode instructions that specify the data registers of the widest set, and is to decode an atomic store to memory instruction. The atomic store to memory instruction is to indicate data that is to have a second width in bits that is wider than the first width in bits. The atomic store to memory instruction is to indicate memory address information associated with a memory location. An execution unit is coupled with the decode unit. The execution unit, in response to the atomic store to memory instruction, is to atomically store the indicated data to the memory location.
-
公开(公告)号:US20190205139A1
公开(公告)日:2019-07-04
申请号:US15858899
申请日:2017-12-29
Applicant: Intel Corporation
Inventor: Christopher J. Hughes , Joseph Nuzman , Jonas Svennebring , Doddaballapur N. Jayasimha , Samantika S. Sury , David A. Koufaty , Niall D. McDonnell , Yen-Cheng Liu , Stephen R. Van Doren , Stephen J. Robinson
IPC: G06F9/30
Abstract: Disclosed embodiments relate to spatial and temporal merging of remote atomic operations. In one example, a system includes an RAO instruction queue stored in a memory and having entries grouped by destination cache line, each entry to enqueue an RAO instruction including an opcode, a destination identifier, and source data, optimization circuitry to receive an incoming RAO instruction, scan the RAO instruction queue to detect a matching enqueued RAO instruction identifying a same destination cache line as the incoming RAO instruction, the optimization circuitry further to, responsive to no matching enqueued RAO instruction being detected, enqueue the incoming RAO instruction; and, responsive to a matching enqueued RAO instruction being detected, determine whether the incoming and matching RAO instructions have a same opcode to non-overlapping cache line elements, and, if so, spatially combine the incoming and matching RAO instructions by enqueuing both RAO instructions in a same group of cache line queue entries at different offsets.
-
公开(公告)号:US20200319886A1
公开(公告)日:2020-10-08
申请号:US16799619
申请日:2020-02-24
Applicant: Intel Corporation
Inventor: Christopher J. Hughes , Joseph Nuzman , Jonas Svennebring , Doddaballapur N. Jayasimha , Samantika S. Sury , David A. Koufaty , Niall D. McDonnell , Yen-Cheng Liu , Stephen R. Van Doren , Stephen J. Robinson
IPC: G06F9/30 , G06F12/0875
Abstract: Disclosed embodiments relate to spatial and temporal merging of remote atomic operations. In one example, a system includes an RAO instruction queue stored in a memory and having entries grouped by destination cache line, each entry to enqueue an RAO instruction including an opcode, a destination identifier, and source data, optimization circuitry to receive an incoming RAO instruction, scan the RAO instruction queue to detect a matching enqueued RAO instruction identifying a same destination cache line as the incoming RAO instruction, the optimization circuitry further to, responsive to no matching enqueued RAO instruction being detected, enqueue the incoming RAO instruction; and, responsive to a matching enqueued RAO instruction being detected, determine whether the incoming and matching RAO instructions have a same opcode to non-overlapping cache line elements, and, if so, spatially combine the incoming and matching RAO instructions by enqueuing both RAO instructions in a same group of cache line queue entries at different offsets.
-
公开(公告)号:US10067762B2
公开(公告)日:2018-09-04
申请号:US15201218
申请日:2016-07-01
Applicant: Intel Corporation
Inventor: Vikash Agarwal , Christopher D. Bryant , Jonathan D. Combs , Stephen J. Robinson
IPC: G06F9/30
Abstract: Apparatuses, methods, and systems relating to memory disambiguation are described. In one embodiment, a processor includes a decoder to decode an instruction into a decoded instruction, an execution unit to execute the decoded instruction, a retirement unit to retire an executed instruction in program order, and a memory disambiguation circuit to allocate an entry in a memory disambiguation table for a first load instruction that is to be flushed for a memory ordering violation, the entry comprising a counter value and an instruction pointer for the first load instruction.
-
公开(公告)号:US09619412B2
公开(公告)日:2017-04-11
申请号:US14826351
申请日:2015-08-14
Applicant: Intel Corporation
Inventor: Vedvyas Shanbhogue , Stephen J. Robinson
CPC classification number: G06F13/24 , G06F9/30101 , G06F9/45533 , G06F12/10 , G06F12/1027 , G06F12/1441 , G06F12/145 , G06F2212/1016 , G06F2212/152 , G06F2212/206
Abstract: In one embodiment, a processor includes an access logic to determine whether an access request from a virtual machine is to a device access page associated with a device of the processor and if so, to re-map the access request to a virtual device page in a system memory associated with the VM, based at least in part on information stored in a control register of the processor. Other embodiments are described and claimed.
-
8.
公开(公告)号:US20160378701A1
公开(公告)日:2016-12-29
申请号:US14751899
申请日:2015-06-26
Applicant: Intel Corporation
Inventor: Jose S. Niell , Daniel F. Cutter , Stephen J. Robinson , Mukesh K. Patel
CPC classification number: G06F13/28 , G06F12/0802 , G06F12/0811 , G06F12/0815 , G06F12/0868 , G06F13/4068 , G06F2212/1032 , G06F2212/608
Abstract: An apparatus having a fabric interconnect that supports multiple topologies and method for using the same are disclosed. In one embodiment, the apparatus comprises mode memory to store information indicative of one of the plurality of modes; and a first fabric operable in a plurality of modes, where the fabric comprises logic coupled to the mode memory to control processing of read and write requests to memory received by the first fabric according to the mode identified by the information indicative.
Abstract translation: 公开了一种具有支撑多种拓扑结构的结构互连的装置及其使用方法。 在一个实施例中,该装置包括用于存储指示多个模式之一的信息的模式存储器; 以及可以多种模式操作的第一结构,其中所述结构包括耦合到所述模式存储器的逻辑,以根据由所述信息指示识别的模式来控制对由所述第一结构接收的存储器的读取和写入请求的处理。
-
公开(公告)号:US11500636B2
公开(公告)日:2022-11-15
申请号:US16799619
申请日:2020-02-24
Applicant: Intel Corporation
Inventor: Christopher J. Hughes , Joseph Nuzman , Jonas Svennebring , Doddaballapur N. Jayasimha , Samantika S. Sury , David A. Koufaty , Niall D. McDonnell , Yen-Cheng Liu , Stephen R. Van Doren , Stephen J. Robinson
IPC: G06F9/30 , G06F12/0875
Abstract: Disclosed embodiments relate to spatial and temporal merging of remote atomic operations. In one example, a system includes an RAO instruction queue stored in a memory and having entries grouped by destination cache line, each entry to enqueue an RAO instruction including an opcode, a destination identifier, and source data, optimization circuitry to receive an incoming RAO instruction, scan the RAO instruction queue to detect a matching enqueued RAO instruction identifying a same destination cache line as the incoming RAO instruction, the optimization circuitry further to, responsive to no matching enqueued RAO instruction being detected, enqueue the incoming RAO instruction; and, responsive to a matching enqueued RAO instruction being detected, determine whether the incoming and matching RAO instructions have a same opcode to non-overlapping cache line elements, and, if so, spatially combine the incoming and matching RAO instructions by enqueuing both RAO instructions in a same group of cache line queue entries at different offsets.
-
公开(公告)号:US10901940B2
公开(公告)日:2021-01-26
申请号:US15089525
申请日:2016-04-02
Applicant: INTEL CORPORATION
Inventor: Vedvyas Shanbhogue , Stephen J. Robinson , Christopher D. Bryant , Jason W. Brandt
Abstract: A processor includes a widest set of data registers that corresponds to a given logical processor. Each of the data registers of the widest set have a first width in bits. A decode unit that corresponds to the given logical processor is to decode instructions that specify the data registers of the widest set, and is to decode an atomic store to memory instruction. The atomic store to memory instruction is to indicate data that is to have a second width in bits that is wider than the first width in bits. The atomic store to memory instruction is to indicate memory address information associated with a memory location. An execution unit is coupled with the decode unit. The execution unit, in response to the atomic store to memory instruction, is to atomically store the indicated data to the memory location.
-
-
-
-
-
-
-
-
-