System, method, and apparatus for enhanced pointer identification and prefetching

    公开(公告)号:US11693780B2

    公开(公告)日:2023-07-04

    申请号:US17391962

    申请日:2021-08-02

    CPC classification number: G06F12/0862 G06F2212/602

    Abstract: System and method for prefetching pointer-referenced data. A method embodiment includes: tracking a plurality of load instructions which includes a first load instruction to access a first data that identifies a first memory location; detecting a second load instruction which accesses a second memory location for a second data, the second memory location matching the first memory location identified by the first data; responsive to the detecting, updating a list of pointer load instructions to include information identifying the first load instruction as a pointer load instruction; prefetching a third data for a third load instruction prior to executing the third load instruction; identifying the third load instruction as a pointer load instruction based on information from the list of pointer load instructions and responsively prefetching a fourth data from a fourth memory location, wherein the fourth memory location is identified by the third data.

    METHODS AND APPARATUS TO OPTIMIZE INSTRUCTIONS FOR EXECUTION BY A PROCESSOR
    3.
    发明申请
    METHODS AND APPARATUS TO OPTIMIZE INSTRUCTIONS FOR EXECUTION BY A PROCESSOR 有权
    优化处理器执行指令的方法和装置

    公开(公告)号:US20160364240A1

    公开(公告)日:2016-12-15

    申请号:US14737058

    申请日:2015-06-11

    CPC classification number: G06F9/3861 G06F8/41 G06F8/443 G06F9/30058 G06F9/3846

    Abstract: Methods, apparatus, systems and articles of manufacture are disclosed herein. An example apparatus includes an instruction profiler to identify a predicated block within instructions to be executed by a hardware processor. The example apparatus includes a performance monitor to access a mis-prediction statistic based on an instruction address associated with the predicated block. The example apparatus includes a region former to, in response to determining that the mis-prediction statistic is above a mis-prediction threshold, include the predicated block in a predicated region for optimization.

    Abstract translation: 本文公开了方法,装置,系统和制品。 示例性设备包括指令分析器,用于识别要由硬件处理器执行的指令内的预测块。 示例性装置包括:性能监视器,用于基于与所述预测块相关联的指令地址来访问误差预测统计量。 示例性装置包括区域形成器,响应于确定误差预测统计量高于误差预测阈值,将所述预测块包括在用于优化的预测区域中。

    SYSTEM, METHOD, AND APPARATUS FOR ENHANCED POINTER IDENTIFICATION AND PREFETCHING

    公开(公告)号:US20200210339A1

    公开(公告)日:2020-07-02

    申请号:US16234135

    申请日:2018-12-27

    Abstract: System and method for prefetching pointer-referenced data. A method embodiment includes: tracking a plurality of load instructions which includes a first load instruction to access a first data that identifies a first memory location; detecting a second load instruction which accesses a second memory location for a second data, the second memory location matching the first memory location identified by the first data; responsive to the detecting, updating a list of pointer load instructions to include information identifying the first load instruction as a pointer load instruction; prefetching a third data for a third load instruction prior to executing the third load instruction; identifying the third load instruction as a pointer load instruction based on information from the list of pointer load instructions and responsively prefetching a fourth data from a fourth memory location, wherein the fourth memory location is identified by the third data.

    Register reclamation
    5.
    发明授权

    公开(公告)号:US10235177B2

    公开(公告)日:2019-03-19

    申请号:US15201403

    申请日:2016-07-02

    Abstract: In an example, an apparatus includes a binary translator (BT) including circuitry to: analyze a code block; determine that an architectural register mapped to a physical register in the physical register file is available for early reclamation; and insert a reclamation hint into the code block. In another example, a processor reclaims the physical register based at least in part on the reclamation hint.

    REGISTER RECLAMATION
    7.
    发明申请

    公开(公告)号:US20180004524A1

    公开(公告)日:2018-01-04

    申请号:US15201403

    申请日:2016-07-02

    CPC classification number: G06F9/30123 G06F8/41 G06F9/3016 G06F9/384

    Abstract: In an example, there is disclosed an apparatus, including a binary translator (BT) including circuitry to: analyze a code block; determine that an architectural register mapped to a physical register in the physical register file is available for early reclamation; and insert a reclamation hint into the code block. There is also disclosed a processor to reclaim the physical register based at least in part on the reclamation hint.

    SYSTEM, METHOD, AND APPARATUS FOR ENHANCED POINTER IDENTIFICATION AND PREFETCHING

    公开(公告)号:US20230409481A1

    公开(公告)日:2023-12-21

    申请号:US18320780

    申请日:2023-05-19

    CPC classification number: G06F12/0862 G06F2212/602

    Abstract: System and method for prefetching pointer-referenced data. A method embodiment includes: tracking a plurality of load instructions which includes a first load instruction to access a first data that identifies a first memory location; detecting a second load instruction which accesses a second memory location for a second data, the second memory location matching the first memory location identified by the first data; responsive to the detecting, updating a list of pointer load instructions to include information identifying the first load instruction as a pointer load instruction; prefetching a third data for a third load instruction prior to executing the third load instruction; identifying the third load instruction as a pointer load instruction based on information from the list of pointer load instructions and responsively prefetching a fourth data from a fourth memory location, wherein the fourth memory location is identified by the third data.

    Eliminating redundant stores using a protection designator and a clear designator

    公开(公告)号:US10540178B2

    公开(公告)日:2020-01-21

    申请号:US15265587

    申请日:2016-09-14

    Abstract: A processor for redundant stores includes a front end including circuitry to decode instructions from an instruction stream, a data cache unit including circuitry to cache data for the processor, a binary translator, and a memory execution unit. The binary translator includes circuitry to identify a first region of the instruction stream including a redundant store, mark a first starting instruction of the first region with a protection designator, mark a first ending instruction of the first region with a clear designator, and store an amended instruction stream with the markings. The memory execution unit includes circuitry to track the first redundant store based on the protection designator and the clear designator to eliminate the first redundant store.

Patent Agency Ranking