Dual latency status and coherency reporting for a multiprocessing system
    1.
    发明授权
    Dual latency status and coherency reporting for a multiprocessing system 失效
    多处理系统的双延迟状态和一致性报告

    公开(公告)号:US5608878A

    公开(公告)日:1997-03-04

    申请号:US316980

    申请日:1994-10-03

    IPC分类号: G06F12/08 G06F13/00

    CPC分类号: G06F12/0831

    摘要: A multiprocessing system utilizes a bus protocol having two response windows. The first response window is at a fixed latency from the transmission of a bus request and/or address, while the second response window, utilized for coherency reporting, is placed a configurable number of clock cycles after the bus request and address to allow for longer access, or snoop, times to perform a cache directory look-up within other bus devices. The first response window reports error or flow control and error status. Furthermore, a method had been described, which implements the reporting of response information in a flexible and high performance manner.

    摘要翻译: 多处理系统利用具有两个响应窗口的总线协议。 第一响应窗口处于来自总线请求和/或地址传输的固定延迟时间,而用于一致性报告的第二响应窗口在总线请求和地址之后被放置成可配置数量的时钟周期,以允许更长时间 访问或窥探时间,以执行其他总线设备内的缓存目录查找。 第一个响应窗口报告错误或流量控制和错误状态。 此外,已经描述了以灵活和高性能的方式报告响应信息的方法。

    Environmental control of liquid cooled electronics

    公开(公告)号:US09811097B2

    公开(公告)日:2017-11-07

    申请号:US12425210

    申请日:2009-04-16

    IPC分类号: G05D23/00 G05D23/19

    CPC分类号: G05D23/1931

    摘要: A system and computer program product are provided for controlling liquid-cooled electronics, which includes measuring a first set point temperature, Ta, wherein the Ta is based on a dew point temperature, Tdp of a computer room. A second set point temperature, Tb, is measured, wherein the Tb is based on a facility chilled liquid inlet temperature, Tci, and a rack power, Prack, of an electronics rack. A Modular Cooling Unit (MCU) set point temperature, Tsp, is selected. The Tsp is the higher value of said Ta and said Tb. Responsive to the selected Tsp, a control valve is regulated. The control valve controls a flow of liquid that passes through a heat exchanger.

    Block driven computation using a caching policy specified in an operand data structure
    3.
    发明授权
    Block driven computation using a caching policy specified in an operand data structure 有权
    使用在操作数数据结构中指定的缓存策略进行块驱动计算

    公开(公告)号:US08458439B2

    公开(公告)日:2013-06-04

    申请号:US12336350

    申请日:2008-12-16

    IPC分类号: G06F12/00

    CPC分类号: G06F9/383 G06F2212/6028

    摘要: A processor has an associated memory hierarchy including a cache memory. The processor includes an instruction sequencing unit that fetches instructions for processing, an operand data structure including a plurality of entries corresponding to operands of operations to be performed by the processor, and a computation engine. A first entry among the plurality of entries in the operand data structure specifies a first caching policy for a first operand, and a second entry specifies a second caching policy for a second operand. The computation engine computes and stores operands in the memory hierarchy in accordance with the cache policies indicated within the operand data structure.

    摘要翻译: 处理器具有包括高速缓冲存储器的相关联的存储器层级。 所述处理器包括:指令排序单元,其提取用于处理的指令;操作数数据结构,包括与由所述处理器执行的操作操作数对应的多个条目;以及计算引擎。 操作数数据结构中的多个条目中的第一条目指定第一操作数的第一高速缓存策略,第二条目指定用于第二操作数的第二高速缓存策略。 计算引擎根据操作数数据结构中指示的缓存策略计算并存储存储器层次结构中的操作数。

    Wake-and-go mechanism with data monitoring
    4.
    发明授权
    Wake-and-go mechanism with data monitoring 有权
    具有数据监控的唤醒机制

    公开(公告)号:US08386822B2

    公开(公告)日:2013-02-26

    申请号:US12024540

    申请日:2008-02-01

    IPC分类号: G06F1/00 G06F1/32

    摘要: A wake-and-go mechanism is provided for a data processing system. The wake-and-go mechanism recognizes a programming idiom, specialized instruction, operating system call, or application programming interface call that indicates that a thread is waiting for an event. The wake-and-go mechanism updates a wake-and-go array with a target address, expected data value, and comparison type associated with the event. The thread then goes to sleep until the event occurs. The wake-and-go array may be a content addressable memory (CAM). When a transaction appears on the symmetric multiprocessing (SMP) fabric that modifies the value at a target address in the CAM, logic associated with the CAM performs a comparison based on the data value being written, expected data value, and comparison type.

    摘要翻译: 为数据处理系统提供了一个唤醒机制。 唤醒机制识别编程成语,专用指令,操作系统调用或应用程序编程接口调用,指示线程正在等待事件。 唤醒机制使用目标地址,预期数据值和与事件关联的比较类型来更新唤醒数组。 线程然后进入休眠状态直到事件发生。 唤醒阵列可以是内容可寻址存储器(CAM)。 当在对称多处理(SMP)结构上出现修改CAM中目标地址上的值的事务时,与CAM相关联的逻辑将根据写入的数据值,预期数据值和比较类型进行比较。

    Performing a partial cache line storage-modifying operation based upon a hint
    5.
    发明授权
    Performing a partial cache line storage-modifying operation based upon a hint 失效
    基于提示执行部分缓存行存储修改操作

    公开(公告)号:US08332588B2

    公开(公告)日:2012-12-11

    申请号:US13349315

    申请日:2012-01-12

    IPC分类号: G06F12/04

    CPC分类号: G06F12/0822

    摘要: Analyzing pre-processed code includes identifying at least one storage-modifying construct specifying a storage-modifying memory access to a memory hierarchy of a data processing system and determining if more than one granule of a cache line of data containing multiple granules that is targeted by the storage-modifying construct is subsequently referenced by said pre-processed code. Post-processed code including a storage-modifying instruction corresponding to the at least one storage-modifying construct in the pre-processed code is generated and stored. Generating the post-processed code includes marking the storage-modifying instruction with a partial cache line hint indicating that said storage-modifying instruction targets less than a full cache line of data within a memory hierarchy if the analyzing indicates only one granule of the target cache line will be accessed while the cache line is held in the cache memory and otherwise refraining from marking the storage-modifying instruction with the partial cache line hint.

    摘要翻译: 分析预处理的代码包括识别指定对数据处理系统的存储器层次结构的存储修改存储器访问的至少一个存储修改结构,并且确定是否存在多个颗粒的高速缓存行数据,所述数据包含多个颗粒的高速缓存行是由 存储修改结构随后由所述预处理代码引用。 生成并存储包括与预处理代码中的至少一个存储修改结构对应的存储修改指令的后处理代码。 生成后处理代码包括用部分高速缓存线提示标记存储修改指令,指示所述存储修改指令的目标小于存储器层次结构内的完整高速缓存数据行,如果分析仅指示目标高速缓存的一个颗粒 将高速缓存线保持在高速缓存存储器中,并以其它方式避免使用部分高速缓存线提示来标记存储修改指令。

    Cache management during asynchronous memory move operations
    6.
    发明授权
    Cache management during asynchronous memory move operations 有权
    异步存储器移动操作期间的缓存管理

    公开(公告)号:US08327101B2

    公开(公告)日:2012-12-04

    申请号:US12024526

    申请日:2008-02-01

    IPC分类号: G06F12/02

    摘要: A data processing system includes a mechanism for completing an asynchronous memory move (AMM) operation in which the processor receives an AMM ST instruction and processes a processor-level move of data in virtual address space and an asynchronous memory mover then completes a physical move of the data within the real address space (memory). A status/control field of the AMM ST instruction includes an indication of a requested treatment of the lower level cache(s) on completion of the AMM operation. When the status/control field indicates an update to at least one cache should be performed, the asynchronous memory mover automatically forwards a copy of the data from the data move to the lower level cache, and triggers an update of a coherency state for a cache line in which the copy of the data is placed.

    摘要翻译: 数据处理系统包括用于完成异步存储器移动(AMM)操作的机制,其中处理器接收AMM ST指令并处理虚拟地址空间中的数据的处理器级移动,然后异步存储器移动器完成物理移动 实际地址空间(内存)中的数据。 AMM ST指令的状态/控制字段在完成AMM操作时包括对低级缓存的请求处理的指示。 当状态/控制字段指示应该执行至少一个缓存的更新时,异步存储器移动器自动将数据的副本从数据移动转发到较低级的高速缓存,并触发高速缓存的一致性状态的更新 其中放置数据副本的行。

    Hardware based dynamic load balancing of message passing interface tasks by modifying tasks
    7.
    发明授权
    Hardware based dynamic load balancing of message passing interface tasks by modifying tasks 失效
    基于硬件的动态负载平衡消息传递接口任务通过修改任务

    公开(公告)号:US08312464B2

    公开(公告)日:2012-11-13

    申请号:US11846168

    申请日:2007-08-28

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5083 G06F9/522

    摘要: Mechanisms are provided for providing hardware based dynamic load balancing of message passing interface (MPI) tasks by modifying tasks. Mechanisms for adjusting the balance of processing workloads of the processors executing tasks of an MPI job are provided so as to minimize wait periods for waiting for all of the processors to call a synchronization operation. Each processor has an associated hardware implemented MPI load balancing controller. The MPI load balancing controller maintains a history that provides a profile of the tasks with regard to their calls to synchronization operations. From this information, it can be determined which processors should have their processing loads lightened and which processors are able to handle additional processing loads without significantly negatively affecting the overall operation of the parallel execution system. Thus, operations may be performed to shift workloads from the slowest processor to one or more of the faster processors.

    摘要翻译: 提供了通过修改任务来提供消息传递接口(MPI)任务的基于硬件的动态负载平衡的机制。 提供了用于调整执行MPI作业任务的处理器的处理工作负载的平衡的机制,以便最小化等待所有处理器调用同步操作的等待时间。 每个处理器都有一个相关的硬件实现的MPI负载平衡控制器。 MPI负载平衡控制器维护一个历史记录,提供任务关于其对同步操作的调用的简档。 根据该信息,可以确定哪些处理器应该减轻其处理负载,哪些处理器能够处理额外的处理负载,而不会对并行执行系统的整体操作产生显着的负面影响。 因此,可以执行操作以将工作负载从最慢处理器转移到一个或多个较快处理器。

    PERFORMING A PARTIAL CACHE LINE STORAGE-MODIFYING OPERATION BASED UPON A HINT
    8.
    发明申请
    PERFORMING A PARTIAL CACHE LINE STORAGE-MODIFYING OPERATION BASED UPON A HINT 失效
    根据提示执行部分缓存线存储 - 修改操作

    公开(公告)号:US20120265938A1

    公开(公告)日:2012-10-18

    申请号:US13349315

    申请日:2012-01-12

    IPC分类号: G06F12/08

    CPC分类号: G06F12/0822

    摘要: Analyzing pre-processed code includes identifying at least one storage-modifying construct specifying a storage-modifying memory access to a memory hierarchy of a data processing system and determining if more than one granule of a cache line of data containing multiple granules that is targeted by the storage-modifying construct is subsequently referenced by said pre-processed code. Post-processed code including a storage-modifying instruction corresponding to the at least one storage-modifying construct in the pre-processed code is generated and stored. Generating the post-processed code includes marking the storage-modifying instruction with a partial cache line hint indicating that said storage-modifying instruction targets less than a full cache line of data within a memory hierarchy if the analyzing indicates only one granule of the target cache line will be accessed while the cache line is held in the cache memory and otherwise refraining from marking the storage-modifying instruction with the partial cache line hint.

    摘要翻译: 分析预处理的代码包括识别指定对数据处理系统的存储器层次结构的存储修改存储器访问的至少一个存储修改结构,并且确定是否存在多个颗粒的高速缓存行数据,所述数据包含多个颗粒的高速缓存行是由 存储修改结构随后由所述预处理代码引用。 生成并存储包括与预处理代码中的至少一个存储修改结构对应的存储修改指令的后处理代码。 生成后处理代码包括用部分高速缓存线提示标记存储修改指令,指示所述存储修改指令的目标小于存储器层次结构内的完整高速缓存数据行,如果分析仅指示目标高速缓存的一个颗粒 将高速缓存线保持在高速缓存存储器中,并以其它方式避免使用部分高速缓存线提示来标记存储修改指令。

    Specifying an addressing relationship in an operand data structure
    9.
    发明授权
    Specifying an addressing relationship in an operand data structure 有权
    在操作数数据结构中指定寻址关系

    公开(公告)号:US08281106B2

    公开(公告)日:2012-10-02

    申请号:US12336342

    申请日:2008-12-16

    IPC分类号: G06F12/00

    CPC分类号: G06F9/345

    摘要: A processor includes at least one execution unit that executes instructions, at least one register file, coupled to the at least one execution unit, that buffers operands for access by the at least one execution unit, and an instruction sequencing unit that fetches instructions for execution by the execution unit. The processor further includes an operand data structure and an address generation accelerator. The operand data structure specifies a first relationship between addresses of sequential accesses within a first address region and a second relationship between addresses of sequential accesses within a second address region. The address generation accelerator computes a first address of a first memory access in the first address region by reference to the first relationship and a second address of a second memory access in the second address region by reference to the second relationship.

    摘要翻译: 处理器包括执行指令的至少一个执行单元,耦合到所述至少一个执行单元的至少一个寄存器文件,其缓冲由所述至少一个执行单元访问的操作数,以及指令排序单元,其提取用于执行的指令 由执行单位。 处理器还包括操作数数据结构和地址生成加速器。 操作数数据结构指定第一地址区域内的顺序访问的地址与第二地址区域内的顺序存取的地址之间的第一关系。 参考第二关系,地址生成加速器通过参考第一关系和第二地址区中的第二存储器访问的第二地址来计算第一地址区中的第一存储器访问的第一地址。

    Sourcing differing amounts of prefetch data in response to data prefetch requests
    10.
    发明授权
    Sourcing differing amounts of prefetch data in response to data prefetch requests 失效
    根据数据预取请求采购不同数量的预取数据

    公开(公告)号:US08250307B2

    公开(公告)日:2012-08-21

    申请号:US12024165

    申请日:2008-02-01

    IPC分类号: G06F12/00

    摘要: According to a method of data processing, a memory controller receives a prefetch load request from a processor core of a data processing system. The prefetch load request specifies a requested line of data. In response to receipt of the prefetch load request, the memory controller determines by reference to a stream of demand requests how much data is to be supplied to the processor core in response to the prefetch load request. In response to the memory controller determining to provide less than all of the requested line of data, the memory controller provides less than all of the requested line of data to the processor core.

    摘要翻译: 根据数据处理的方法,存储器控制器从数据处理系统的处理器核心接收预取负载请求。 预取加载请求指定所请求的数据行。 响应于接收到预取加载请求,存储器控制器通过参考需求请求流来确定响应于预取加载请求将多少数据提供给处理器核。 响应于存储器控制器确定提供少于全部所请求的数据行,存储器控制器将少于所有请求的数据行提供给处理器核。