Spatial and temporal merging of remote atomic operations

    公开(公告)号:US10572260B2

    公开(公告)日:2020-02-25

    申请号:US15858899

    申请日:2017-12-29

    Abstract: Disclosed embodiments relate to spatial and temporal merging of remote atomic operations. In one example, a system includes an RAO instruction queue stored in a memory and having entries grouped by destination cache line, each entry to enqueue an RAO instruction including an opcode, a destination identifier, and source data, optimization circuitry to receive an incoming RAO instruction, scan the RAO instruction queue to detect a matching enqueued RAO instruction identifying a same destination cache line as the incoming RAO instruction, the optimization circuitry further to, responsive to no matching enqueued RAO instruction being detected, enqueue the incoming RAO instruction; and, responsive to a matching enqueued RAO instruction being detected, determine whether the incoming and matching RAO instructions have a same opcode to non-overlapping cache line elements, and, if so, spatially combine the incoming and matching RAO instructions by enqueuing both RAO instructions in a same group of cache line queue entries at different offsets.

    Enabling Virtualization Of A Processor Resource
    14.
    发明申请
    Enabling Virtualization Of A Processor Resource 审中-公开
    启用处理器资源的虚拟化

    公开(公告)号:US20150347168A1

    公开(公告)日:2015-12-03

    申请号:US14826351

    申请日:2015-08-14

    Abstract: In one embodiment, a processor includes an access logic to determine whether an access request from a virtual machine is to a device access page associated with a device of the processor and if so, to re-map the access request to a virtual device page in a system memory associated with the VM, based at least in part on information stored in a control register of the processor. Other embodiments are described and claimed.

    Abstract translation: 在一个实施例中,处理器包括访问逻辑以确定来自虚拟机的访问请求是否是与处理器的设备相关联的设备访问页面,并且如果是,则将访问请求重新映射到虚拟设备页面 至少部分地基于存储在处理器的控制寄存器中的信息,与VM相关联的系统存储器。 描述和要求保护其他实施例。

    Processors, methods, systems, and instructions to atomically store to memory data wider than a natively supported data width

    公开(公告)号:US11347680B2

    公开(公告)日:2022-05-31

    申请号:US17131729

    申请日:2020-12-22

    Abstract: A processor includes a widest set of data registers that corresponds to a given logical processor. Each of the data registers of the widest set have a first width in bits. A decode unit that corresponds to the given logical processor is to decode instructions that specify the data registers of the widest set, and is to decode an atomic store to memory instruction. The atomic store to memory instruction is to indicate data that is to have a second width in bits that is wider than the first width in bits. The atomic store to memory instruction is to indicate memory address information associated with a memory location. An execution unit is coupled with the decode unit. The execution unit, in response to the atomic store to memory instruction, is to atomically store the indicated data to the memory location.

    PROCESSORS, METHODS, SYSTEMS, AND INSTRUCTIONS TO ATOMICALLY STORE TO MEMORY DATA WIDER THAN A NATIVELY SUPPORTED DATA WIDTH

    公开(公告)号:US20210117372A1

    公开(公告)日:2021-04-22

    申请号:US17131729

    申请日:2020-12-22

    Abstract: A processor includes a widest set of data registers that corresponds to a given logical processor. Each of the data registers of the widest set have a first width in bits. A decode unit that corresponds to the given logical processor is to decode instructions that specify the data registers of the widest set, and is to decode an atomic store to memory instruction. The atomic store to memory instruction is to indicate data that is to have a second width in bits that is wider than the first width in bits. The atomic store to memory instruction is to indicate memory address information associated with a memory location. An execution unit is coupled with the decode unit. The execution unit, in response to the atomic store to memory instruction, is to atomically store the indicated data to the memory location.

    Tracking deferred data packets in a debug trace architecture

    公开(公告)号:US09632907B2

    公开(公告)日:2017-04-25

    申请号:US14566374

    申请日:2014-12-10

    CPC classification number: G06F11/3466 G06F9/30 G06F11/3636

    Abstract: A processing device implementing tracking of deferred data packets in a debug trace architecture is disclosed. The processing device is to determine an order number corresponding to an order in which an instruction was executed relative to other executed instructions that correspond to an instruction type within a sequence of executed instructions, identify a first data packet corresponding to a first packet type and sequentially ordered, according to the order number, with respect to data packets of the first packet type within a data trace log, identify a second data packet corresponding to a second packet type and sequentially ordered, according to the order number, with respect to data packets of the second packet type within the data trace log, and map the identified first and second data packets to the instruction, wherein at least one of the first or second data packets was generated post-retirement of the instruction.

Patent Agency Ranking