Prefetching to a cache based on buffer fullness
    1.
    发明授权
    Prefetching to a cache based on buffer fullness 有权
    基于缓冲区丰满度预取到缓存

    公开(公告)号:US08909866B2

    公开(公告)日:2014-12-09

    申请号:US13669502

    申请日:2012-11-06

    IPC分类号: G06F12/08

    CPC分类号: G06F12/0862 G06F12/0897

    摘要: A processor transfers prefetch requests from their targeted cache to another cache in a memory hierarchy based on a fullness of a miss address buffer (MAB) or based on confidence levels of the prefetch requests. Each cache in the memory hierarchy is assigned a number of slots at the MAB. In response to determining the fullness of the slots assigned to a cache is above a threshold when a prefetch request to the cache is received, the processor transfers the prefetch request to the next lower level cache in the memory hierarchy. In response, the data targeted by the access request is prefetched to the next lower level cache in the memory hierarchy, and is therefore available for subsequent provision to the cache. In addition, the processor can transfer a prefetch request to lower level caches based on a confidence level of a prefetch request.

    摘要翻译: 处理器根据缺失地址缓冲区(MAB)的丰满度或基于预取请求的置信水平,将预取请求从目标缓存传输到存储器层次结构中的另一高速缓存。 内存层次结构中的每个高速缓存在MAB上分配了多个插槽。 响应于当接收到高速缓存的预取请求时,分配给高速缓存的时隙的丰满度高于阈值,则处理器将预取请求传送到存储器层级中的下一个较低级别的高速缓存。 作为响应,访问请求所针对的数据被预取到存储器层次结构中的下一个较低级缓存,因此可用于后续的缓存提供。 此外,处理器可以基于预取请求的置信水平将预取请求传送到较低级别的高速缓存。

    Throttling computational units according to performance sensitivity
    2.
    发明授权
    Throttling computational units according to performance sensitivity 有权
    根据性能灵敏度调节计算单位

    公开(公告)号:US08443209B2

    公开(公告)日:2013-05-14

    申请号:US12508935

    申请日:2009-07-24

    IPC分类号: G05B13/02 G06F1/00

    摘要: A power allocation strategy limits performance of a subset of a plurality of computational units in a computer system according to performance sensitivity of each of the plurality of computational units to a change performance capability, e.g., frequency change. The performance of the subset of computational units may be limited by setting a power state in which the subset may be operated and/or reducing a current power state of the subset to a lower power state. The subset whose performance is limited includes computational units that are least performance sensitive according to stored sensitivity data. The subset may include one or more processing cores and performance of the one or more processing cores may be limited in response to a CPU-bounded application or graphics processing unit (GPU)-bounded application being executed.

    摘要翻译: 功率分配策略根据多个计算单元中的每个计算单元的性能灵敏度来限制计算机系统中的多个计算单元的子集的性能,以改变性能能力,例如频率变化。 计算单元子集的性能可以通过设置子集可以被操作的功率状态和/或将该子集的当前功率状态降低到较低功率状态来限制。 其性能受限的子集包括根据存储的灵敏度数据对性能敏感度最低的计算单元。 子集可以包括一个或多个处理核心,并且一个或多个处理核心的性能可能受到响应于被执行的CPU限制的应用程序或图形处理单元(GPU)绑定应用程序的限制。

    DETERMINING PERFORMANCE SENSITIVITIES OF COMPUTATIONAL UNITS
    3.
    发明申请
    DETERMINING PERFORMANCE SENSITIVITIES OF COMPUTATIONAL UNITS 审中-公开
    确定计算单位的绩效敏感度

    公开(公告)号:US20110022356A1

    公开(公告)日:2011-01-27

    申请号:US12508902

    申请日:2009-07-24

    IPC分类号: G06F15/00

    摘要: Performance sensitivities to a change in performance capabilities of computational units of a computer system are determined based on measured utilization metrics for each of the computational units. In order to determine the performance sensitivities, in one approach, the computational units are operated at a first performance level, and respective first utilization metrics are determined. The computational units are then operated at a second performance level and respective second utilization metrics are determined. The sensitivity to performance capability change, e.g., a frequency change, is determined based on the respective first and second utilization metrics. The performance sensitivities of the computational units to a change in performance capability are continually updated in response to, e.g., a process context change of a computational unit or in response to a predetermined period of time elapsing since the last sensitivity to a performance capability change was determined for a computational unit.

    摘要翻译: 基于用于每个计算单元的测量的利用度量确定对计算机系统的计算单元的性能能力变化的性能敏感度。 为了确定性能灵敏度,在一种方法中,计算单元在第一性能水平下操作,并且确定各自的第一使用度量。 然后,计算单元在第二性能水平下操作并确定相应的第二利用度量。 基于相应的第一和第二利用度量来确定对性能能力改变的敏感性,例如频率变化。 响应于例如计算单元的处理上下文变化或响应于经过的预定时间段,持续地更新计算单元对性能能力变化的性能敏感度,因为对性能能力变化的最后敏感度为 确定一个计算单位。

    HIERARCHICAL MEMORY ARBITRATION TECHNIQUE FOR DISPARATE SOURCES
    4.
    发明申请
    HIERARCHICAL MEMORY ARBITRATION TECHNIQUE FOR DISPARATE SOURCES 有权
    不同来源的分层记忆仲裁技术

    公开(公告)号:US20100281231A1

    公开(公告)日:2010-11-04

    申请号:US12431874

    申请日:2009-04-29

    IPC分类号: G06F12/00 G06F9/46

    CPC分类号: G06F13/161

    摘要: A hierarchical memory request stream arbitration technique merges coherent memory request streams from multiple memory request sources and arbitrates the merged coherent memory request stream with requests from a non-coherent memory request stream. In at least one embodiment of the invention, a method of generating a merged memory request stream from a plurality of memory request streams includes merging coherent memory requests into a first serial memory request stream. The method includes selecting, by a memory controller circuit, a memory request for placement in the merged memory request stream from at least the first serial memory request stream and a merged non-coherent request stream. The merged non-coherent memory request stream is at least partially based on an indicator of a previous memory request selected for placement in the merged memory request stream.

    摘要翻译: 分级存储器请求流仲裁技术将来自多个存储器请求源的相干存储器请求流合并,并且根据来自非相干存储器请求流的请求对合并的相干存储器请求流进行仲裁。 在本发明的至少一个实施例中,从多个存储器请求流生成合并存储器请求流的方法包括将相干存储器请求合并到第一串行存储器请求流中。 该方法包括由存储器控制器电路从至少第一串行存储器请求流和合并的非相干请求流中选择存储器请求,以供放置在合并的存储器请求流中。 合并的非相干存储器请求流至少部分地基于选择用于放置在合并的存储器请求流中的先前存储器请求的指示符。

    Techniques for Accessing a Resource in a Processor System
    5.
    发明申请
    Techniques for Accessing a Resource in a Processor System 有权
    在处理器系统中访问资源的技术

    公开(公告)号:US20090083741A1

    公开(公告)日:2009-03-26

    申请号:US11859044

    申请日:2007-09-21

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5016

    摘要: A technique of accessing a resource includes receiving, at a master scheduler, resource access requests. The resource access requests are translated into respective slave state machine work orders that each include one or more respective commands. The respective commands are assigned, for execution, to command streams associated with respective slave state machines. The respective commands are then executed responsive to the respective slave state machines.

    摘要翻译: 访问资源的技术包括在主调度器处接收资源访问请求。 资源访问请求被转换成各自的从状态机工作单,其各自包括一个或多个相应的命令。 相应的命令被分配用于执行以命令与相应从属状态机相关联的流。 然后响应于相应的从状态机执行各自的命令。

    Hierarchical memory arbitration technique for disparate sources
    6.
    发明授权
    Hierarchical memory arbitration technique for disparate sources 有权
    不同来源的分层内存仲裁技术

    公开(公告)号:US08645639B2

    公开(公告)日:2014-02-04

    申请号:US13600614

    申请日:2012-08-31

    IPC分类号: G06F13/18

    CPC分类号: G06F13/161

    摘要: A hierarchical memory request stream arbitration technique merges coherent memory request streams from multiple memory request sources and arbitrates the merged coherent memory request stream with requests from a non-coherent memory request stream. In at least one embodiment of the invention, a method of generating a merged memory request stream from a plurality of memory request streams includes merging coherent memory requests into a first serial memory request stream. The method includes selecting, by a memory controller circuit, a memory request for placement in the merged memory request stream from at least the first serial memory request stream and a merged non-coherent request stream. The merged non-coherent memory request stream is based on an indicator of a previous memory request selected for placement in the merged memory request stream.

    摘要翻译: 分级存储器请求流仲裁技术将来自多个存储器请求源的相干存储器请求流合并,并且根据来自非相干存储器请求流的请求对合并的相干存储器请求流进行仲裁。 在本发明的至少一个实施例中,从多个存储器请求流生成合并存储器请求流的方法包括将相干存储器请求合并到第一串行存储器请求流中。 该方法包括由存储器控制器电路从至少第一串行存储器请求流和合并的非相干请求流中选择存储器请求,以供放置在合并的存储器请求流中。 合并的非相干存储器请求流基于选择用于放置在合并的存储器请求流中的先前存储器请求的指示符。

    Hierarchical memory arbitration technique for disparate sources
    7.
    发明授权
    Hierarchical memory arbitration technique for disparate sources 有权
    不同来源的分层内存仲裁技术

    公开(公告)号:US08266389B2

    公开(公告)日:2012-09-11

    申请号:US12431874

    申请日:2009-04-29

    IPC分类号: G06F13/18

    CPC分类号: G06F13/161

    摘要: A hierarchical memory request stream arbitration technique merges coherent memory request streams from multiple memory request sources and arbitrates the merged coherent memory request stream with requests from a non-coherent memory request stream. In at least one embodiment of the invention, a method of generating a merged memory request stream from a plurality of memory request streams includes merging coherent memory requests into a first serial memory request stream. The method includes selecting, by a memory controller circuit, a memory request for placement in the merged memory request stream from at least the first serial memory request stream and a merged non-coherent request stream. The merged non-coherent memory request stream is at least partially based on an indicator of a previous memory request selected for placement in the merged memory request stream.

    摘要翻译: 分级存储器请求流仲裁技术将来自多个存储器请求源的相干存储器请求流合并,并且根据来自非相干存储器请求流的请求对合并的相干存储器请求流进行仲裁。 在本发明的至少一个实施例中,从多个存储器请求流生成合并存储器请求流的方法包括将相干存储器请求合并到第一串行存储器请求流中。 该方法包括由存储器控制器电路从至少第一串行存储器请求流和合并的非相干请求流中选择存储器请求,以供放置在合并的存储器请求流中。 合并的非相干存储器请求流至少部分地基于选择用于放置在合并的存储器请求流中的先前存储器请求的指示符。

    Techniques for accessing a resource in a processor system
    8.
    发明授权
    Techniques for accessing a resource in a processor system 有权
    用于访问处理器系统中的资源的技术

    公开(公告)号:US08341344B2

    公开(公告)日:2012-12-25

    申请号:US11859044

    申请日:2007-09-21

    IPC分类号: G06F12/00 G06F9/46

    CPC分类号: G06F9/5016

    摘要: A technique of accessing a resource includes receiving, at a master scheduler, resource access requests. The resource access requests are translated into respective slave state machine work orders that each include one or more respective commands. The respective commands are assigned, for execution, to command streams associated with respective slave state machines. The respective commands are then executed responsive to the respective slave state machines.

    摘要翻译: 访问资源的技术包括在主调度器处接收资源访问请求。 资源访问请求被转换成各自的从状态机工作单,其各自包括一个或多个相应的命令。 相应的命令被分配用于执行以命令与相应从属状态机相关联的流。 然后响应于相应的从状态机执行各自的命令。

    THROTTLING COMPUTATIONAL UNITS ACCORDING TO PERFORMANCE SENSITIVITY
    10.
    发明申请
    THROTTLING COMPUTATIONAL UNITS ACCORDING TO PERFORMANCE SENSITIVITY 有权
    根据性能灵敏度计算出的计算单位

    公开(公告)号:US20110022857A1

    公开(公告)日:2011-01-27

    申请号:US12508935

    申请日:2009-07-24

    IPC分类号: G06F1/26

    摘要: A power allocation strategy limits performance of a subset of a plurality of computational units in a computer system according to performance sensitivity of each of the plurality of computational units to a change performance capability, e.g., frequency change. The performance of the subset of computational units may be limited by setting a power state in which the subset may be operated and/or reducing a current power state of the subset to a lower power state. The subset whose performance is limited includes computational units that are least performance sensitive according to stored sensitivity data. The subset may include one or more processing cores and performance of the one or more processing cores may be limited in response to a CPU-bounded application or graphics processing unit (GPU)-bounded application being executed.

    摘要翻译: 功率分配策略根据多个计算单元中的每个计算单元的性能灵敏度来限制计算机系统中的多个计算单元的子集的性能,以改变性能能力,例如频率变化。 计算单元子集的性能可以通过设置子集可以被操作的功率状态和/或将该子集的当前功率状态降低到较低功率状态来限制。 其性能受限的子集包括根据存储的灵敏度数据对性能敏感度最低的计算单元。 子集可以包括一个或多个处理核心,并且一个或多个处理核心的性能可能受到响应于被执行的CPU限制的应用程序或图形处理单元(GPU)绑定应用程序的限制。