PREFETCH OPTIMIZATION IN SHARED RESOURCE MULTI-CORE SYSTEMS
    1.
    发明申请
    PREFETCH OPTIMIZATION IN SHARED RESOURCE MULTI-CORE SYSTEMS 有权
    共享资源多核系统中的优选优化

    公开(公告)号:US20140136795A1

    公开(公告)日:2014-05-15

    申请号:US13864028

    申请日:2013-04-16

    IPC分类号: G06F12/08

    摘要: An apparatus and method is described herein for optimization to prefetch throttling, which potentially enhances performance, reduces power consumption, and maintains positive gain for workloads that benefit from prefetching. More specifically, the optimizations described herein allow for bandwidth congestion and prefetch accuracy to be taken into account as feedbacks for throttling at the source of prefetch generation. As a result, when there is low congestion, full prefetch generation is allowed, even if the prefetch is inaccurate, since there is available bandwidth. However, when congestion is high, the determination of throttling falls to prefetch accuracy. If accuracy is high—miss rate is low—then less throttling is needed, because the prefetches are being utilized—performance is being enhanced. Yet, if prefetch accuracy is low—miss rate is high—then more prefetch throttling is needed to save power, because the prefetch are not being utilized—performance is not being enhanced by the large number of prefetches.

    摘要翻译: 本文描述了用于优化预取节流的装置和方法,其潜在地增强了性能,降低了功耗,并为从预取中受益的工作负载保持了正增益。 更具体地说,这里描述的优化允许考虑带宽拥塞和预取精度作为用于在预取生成源处节流的反馈。 结果,当拥塞低时,即使预取不准确,由于存在可用带宽,因此允许完全预取生成。 然而,当拥塞较高时,节流的确定下降到预取精度。 如果精度高,错失率低,则需要较少的节流,因为预取已被利用 - 性能正在提高。 然而,如果预取精度低错过率高,则需要更多的预取节流来节省功率,因为​​预取不被利用 - 性能并没有被大量预取提高。

    MULTI-CORE MEMORY THERMAL THROTTLING ALGORITHMS FOR IMPROVING POWER/PERFORMANCE TRADEOFFS
    2.
    发明申请
    MULTI-CORE MEMORY THERMAL THROTTLING ALGORITHMS FOR IMPROVING POWER/PERFORMANCE TRADEOFFS 有权
    用于改善功率/性能传输的多核心存储器热转移算法

    公开(公告)号:US20090248976A1

    公开(公告)日:2009-10-01

    申请号:US12055417

    申请日:2008-03-26

    IPC分类号: G06F12/08

    摘要: Embodiments of the invention are generally directed to systems, methods, and apparatuses for improving power/performance tradeoffs associated with multi-core memory thermal throttling algorithms. In some embodiments, the priority of shared resource allocation is changed on one or more points in a system, while the system is in dynamic random access memory (DRAM) throttling mode. This may enable the forward progress of cache bound workloads while still throttling DRAM for memory bound workloads.

    摘要翻译: 本发明的实施例一般涉及用于改善与多核存储器热节流算法相关联的功率/性能权衡的系统,方法和装置。 在一些实施例中,在系统处于动态随机存取存储器(DRAM)限制模式的情况下,在系统中的一个或多个点上改变共享资源分配的优先级。 这可能使缓存绑定工作负载的正向进展,同时仍然限制DRAM用于内存绑定工作负载。