Specialized memory move barrier operations
    61.
    发明授权
    Specialized memory move barrier operations 失效
    专业记忆移动屏障操作

    公开(公告)号:US07941627B2

    公开(公告)日:2011-05-10

    申请号:US12024674

    申请日:2008-02-01

    IPC分类号: G06F12/00

    摘要: An instruction set architecture (ISA) includes an asynchronous memory move (AMM) synchronization (SYNC) instruction. When processor of a data processing system executes the AMM SYNC instruction, the processor prevents an AMM operation generated by a subsequently received/executed AMM ST instruction from proceeding with the data move portion of the AMM operation within the memory subsystem until completion of all ongoing memory access operations within the memory subsystem and fabric. The AMM operation does not wait for a normal barrier operation. The processor forwards the information relevant to initiate the AMM operation to an asynchronous memory mover logic, and signals the logic to not proceed with the AMM operation until signaled of the completion of the AMM SYNC.

    摘要翻译: 指令集架构(ISA)包括异步存储器移动(AMM)同步(SYNC)指令。 当数据处理系统的处理器执行AMM SYNC指令时,处理器防止随后接收/执行的AMM ST指令产生的AMM操作进入存储器子系统内的AMM操作的数据移动部分,直到完成所有正在进行的存储器 内存子系统和结构中的访问操作。 AMM操作不等待正常的屏障操作。 处理器将与启动AMM操作相关的信息转发到异步存储器移动器逻辑,并且将逻辑信号发送到不进行AMM操作,直到AMM SYNC完成为止。

    Allocating a global shared memory
    62.
    发明授权
    Allocating a global shared memory 有权
    分配全局共享内存

    公开(公告)号:US07925842B2

    公开(公告)日:2011-04-12

    申请号:US11958956

    申请日:2007-12-18

    IPC分类号: G06F12/00 G06F13/00 G06F13/28

    CPC分类号: G06F12/0284 G06F2212/1048

    摘要: A method of operating a data processing system includes each of multiple tasks within a parallel job executing on multiple nodes of the data processing system issuing a system call to request allocation of backing storage in physical memory for global shared memory accessible to all of the multiple tasks within the parallel job, where the global shared memory is in a global address space defined by a range of effective addresses. Each task among the multiple tasks receives an indication that the allocation requested by the system call was successful only if the global address space for that task was previously reserved and backing storage for the global shared memory has not already been allocated.

    摘要翻译: 一种操作数据处理系统的方法包括执行在数据处理系统的多个节点上的并行作业中的多个任务中的每个,发出系统调用以请求在物理存储器中分配后备存储器,用于所有多个任务可访问的全局共享存储器 在并行作业中,全局共享存储器位于由一系列有效地址定义的全局地址空间中。 多个任务之间的每个任务都接收到一个指示,即仅当该任务的全局地址空间以前被保留并且尚未分配全局共享存储器的备份存储时,系统调用所请求的分配才成功。

    USER LEVEL MESSAGE BROADCAST MECHANISM IN DISTRIBUTED COMPUTING ENVIRONMENT
    63.
    发明申请
    USER LEVEL MESSAGE BROADCAST MECHANISM IN DISTRIBUTED COMPUTING ENVIRONMENT 失效
    分布式计算环境中的用户级信息广播机制

    公开(公告)号:US20100269027A1

    公开(公告)日:2010-10-21

    申请号:US12424837

    申请日:2009-04-16

    摘要: A data processing system is programmed to provide a method for enabling user-level one-to-all message/messaging (OTAM) broadcast within a distributed parallel computing environment in which multiple threads of a single job execute on different processing nodes across a network. The method comprises: generating one or more messages for transmission to at least one other processing node accessible via a network, where the messages are generated by/for a first thread executing at the data processing system (first processing node) and the other processing node executes one or more second threads of a same parallel job as the first thread. An OTAM broadcast is transmitting via a host fabric interface (HFI) of the data processing system as a one-to-all broadcast on the network, whereby the messages are transmitted to a cluster of processing nodes across the network that execute threads of the same parallel job as the first thread.

    摘要翻译: 数据处理系统被编程为提供一种在分布式并行计算环境中实现用户级一对一消息/消息传递(OTAM)广播的方法,其中单个作业的多个线程在跨越网络的不同处理节点上执行。 该方法包括:生成一个或多个消息以便传输到经由网络可访问的至少一个其他处理节点,其中消息由数据处理系统(第一处理节点)执行的第一个线程生成,另一个处理节点 执行与第一线程相同的并行作业的一个或多个第二线程。 OTAM广播通过数据处理系统的主机结构接口(HFI)作为网络上的一对一广播进行发送,由此将消息传送到跨网络的处理节点群集,该群集执行相同的线程 并行作为第一个线程。

    Sharing lock mechanism between protocol layers
    64.
    发明授权
    Sharing lock mechanism between protocol layers 失效
    在协议层之间共享锁机制

    公开(公告)号:US07689992B2

    公开(公告)日:2010-03-30

    申请号:US10877095

    申请日:2004-06-25

    IPC分类号: G06F9/46

    CPC分类号: G06F9/526

    摘要: Shared locks are employed for controlling a thread which extends across more than one protocol layer in a data processing system. The use of a counter is used as part of a data structure which makes it possible to implement shared locks across multiple layers. The use of shared locks avoids the processing overhead usually associated with lock acquisition and release. The thread which is controlled may be initiated in either an upper layer protocol or in a lower layer.

    摘要翻译: 共享锁用于控制在数据处理系统中跨越多于一个协议层延伸的线程。 计数器的使用被用作数据结构的一部分,这使得可以跨多层实现共享锁。 共享锁的使用避免了通常与锁获取和释放相关的处理开销。 被控制的线程可以在上层协议或下层协议中启动。

    Mechanisms to Order Global Shared Memory Operations
    65.
    发明申请
    Mechanisms to Order Global Shared Memory Operations 有权
    订购全局共享内存操作的机制

    公开(公告)号:US20090199200A1

    公开(公告)日:2009-08-06

    申请号:US12024367

    申请日:2008-02-01

    IPC分类号: G06F9/50

    摘要: A method and data processing system for performing fence operations within a global shared memory (GSM) environment having a local task executing on a processor and providing GSM commands for processing by a host fabric interface (HFI) window that is allocated to the task. The HFI window has one or more registers for use during local fence operations. A first register tracks a first count of task-issued GSM commands, and a second register tracks a second count of GSM operations being processed by the HFI. The processing logic detects a locally-issued fence operation, and responds by performing a series of operations, including: automatically stopping the task from issuing additional GSM commands; monitoring for completion of all the task-issued GSM commands at the HFI; and triggering a resumption of issuance of GSM commands by the task when the completion of all previous task-issued GSM commands is registered by the HFI.

    摘要翻译: 一种用于在全局共享存储器(GSM)环境内执行栅栏操作的方法和数据处理系统,其具有在处理器上执行的本地任务并提供用于由分配给该任务的主机结构接口(HFI)窗口进行处理的GSM命令。 HFI窗口有一个或多个寄存器用于本地栅栏操作。 第一寄存器跟踪任务发出的GSM命令的第一计数,第二寄存器跟踪由HFI正在处理的GSM操作的第二计数。 处理逻辑检测本地发出的围栏操作,并通过执行一系列操作进行响应,包括:自动停止任务发出附加的GSM命令; 监测在HFI完成所有任务发布的GSM命令; 并且当HFI注册所有先前任务发出的GSM命令的完成时,通过任务触发恢复发出GSM命令。

    ASYNCHRONOUS MEMORY MOVE ACROSS PHYSICAL NODES (DUAL-SIDED COMMUNICATION FOR MEMORY MOVE)
    66.
    发明申请
    ASYNCHRONOUS MEMORY MOVE ACROSS PHYSICAL NODES (DUAL-SIDED COMMUNICATION FOR MEMORY MOVE) 有权
    异常记忆移动物理名称(双面通信用于记忆移动)

    公开(公告)号:US20090198955A1

    公开(公告)日:2009-08-06

    申请号:US12024486

    申请日:2008-02-01

    IPC分类号: G06F15/167 G06F12/00

    摘要: A distributed data processing system includes: (1) a first node with a processor, a first memory, and asynchronous memory mover logic; and connection mechanism that connects (2) a second node having a second memory. The processor includes processing logic for completing a cross-node asynchronous memory move (AMM) operation, wherein the processor performs a move of data in virtual address space from a first effective address to a second effective address, and the asynchronous memory mover logic completes a physical move of the data from a first memory location in the first memory having a first real address to a second memory location in the second memory having a second real address. The data is transmitted via the connection mechanism connecting the two nodes independent of the processor.

    摘要翻译: 分布式数据处理系统包括:(1)具有处理器的第一节点,第一存储器和异步存储器移动器逻辑; 以及连接机构,其连接(2)具有第二存储器的第二节点。 处理器包括用于完成跨节点异步存储器移动(AMM)操作的处理逻辑,其中处理器执行将虚拟地址空间中的数据从第一有效地址移动到第二有效地址,并且异步存储器移动器逻辑完成 从具有第一实际地址的第一存储器中的第一存储器位置的数据的物理移动到具有第二实际地址的第二存储器中的第二存储器位置。 数据通过连接独立于处理器的两个节点的连接机制进行传输。

    FULLY ASYNCHRONOUS MEMORY MOVER
    67.
    发明申请
    FULLY ASYNCHRONOUS MEMORY MOVER 失效
    充分的异常记忆运动

    公开(公告)号:US20090198934A1

    公开(公告)日:2009-08-06

    申请号:US12024613

    申请日:2008-02-01

    IPC分类号: G06F12/02 G06F12/14 G06F9/46

    摘要: A data processing system has a processor and a memory coupled to the processor and an asynchronous memory mover coupled to the processor. The asynchronous memory mover has registers for receiving a set of parameters from the processor, which parameters are associated with an asynchronous memory move (AMM) operation initiated by the processor in virtual address space, utilizing a source effective address and a destination effective address. The asynchronous memory mover performs the AMM operation to move the data from a first physical memory location having a source real address corresponding to the source effective address to a second physical memory location having a destination real address corresponding to the destination effective address. The asynchronous memory mover has an associated off-chip translation mechanism. The AMM operation thus occurs independent of the processor, and the processor continues processing other operations independent of the AMM operation.

    摘要翻译: 数据处理系统具有耦合到处理器的处理器和存储器以及耦合到处理器的异步存储器移动器。 异步存储器移动器具有用于从处理器接收一组参数的寄存器,这些参数与虚拟地址空间中由处理器发起的异步存储器移动(AMM)操作相关联,利用源有效地址和目的地有效地址。 异步存储器移动器执行AMM操作以将来自具有与源有效地址相对应的源实际地址的第一物理存储器位置的数据移动到具有与目的地有效地址相对应的目的地实际地址的第二物理存储器位置。 异步存储器移动器具有相关的片外转换机制。 因此,AMM操作独立于处理器,并且处理器继续处理独立于AMM操作的其他操作。

    SPECIALIZED MEMORY MOVE BARRIER OPERATIONS
    68.
    发明申请
    SPECIALIZED MEMORY MOVE BARRIER OPERATIONS 失效
    专用记忆移动障碍操作

    公开(公告)号:US20090198917A1

    公开(公告)日:2009-08-06

    申请号:US12024674

    申请日:2008-02-01

    IPC分类号: G06F9/00

    摘要: An instruction set architecture (ISA) includes an asynchronous memory move (AMM) synchronization (SYNC) instruction. When processor of a data processing system executes the AMM SYNC instruction, the processor prevents an AMM operation generated by a subsequently received/executed AMM ST instruction from proceeding with the data move portion of the AMM operation within the memory subsystem until completion of all ongoing memory access operations within the memory subsystem and fabric. The AMM operation does not wait for a normal barrier operation. The processor forwards the information relevant to initiate the AMM operation to an asynchronous memory mover logic, and signals the logic to not proceed with the AMM operation until signaled of the completion of the AMM SYNC.

    摘要翻译: 指令集架构(ISA)包括异步存储器移动(AMM)同步(SYNC)指令。 当数据处理系统的处理器执行AMM SYNC指令时,处理器防止随后接收/执行的AMM ST指令产生的AMM操作进入存储器子系统内的AMM操作的数据移动部分,直到完成所有正在进行的存储器 内存子系统和结构中的访问操作。 AMM操作不等待正常的屏障操作。 处理器将与启动AMM操作相关的信息转发到异步存储器移动器逻辑,并且将逻辑信号发送到不进行AMM操作,直到AMM SYNC完成为止。

    CACHE MANAGEMENT DURING ASYNCHRONOUS MEMORY MOVE OPERATIONS
    69.
    发明申请
    CACHE MANAGEMENT DURING ASYNCHRONOUS MEMORY MOVE OPERATIONS 有权
    非正常记忆移动操作期间的缓存管理

    公开(公告)号:US20090198897A1

    公开(公告)日:2009-08-06

    申请号:US12024526

    申请日:2008-02-01

    IPC分类号: G06F12/08

    摘要: A data processing system includes a mechanism for completing an asynchronous memory move (AMM) operation in which the processor receives an AMM ST instruction and processes a processor-level move of data in virtual address space and an asynchronous memory mover then completes a physical move of the data within the real address space (memory). A status/control field of the AMM ST instruction includes an indication of a requested treatment of the lower level cache(s) on completion of the AMM operation. When the status/control field indicates an update to at least one cache should be performed, the asynchronous memory mover automatically forwards a copy of the data from the data move to the lower level cache, and triggers an update of a coherency state for a cache line in which the copy of the data is placed.

    摘要翻译: 数据处理系统包括用于完成异步存储器移动(AMM)操作的机制,其中处理器接收AMM ST指令并处理虚拟地址空间中的数据的处理器级移动,然后异步存储器移动器完成物理移动 实际地址空间(内存)中的数据。 AMM ST指令的状态/控制字段在完成AMM操作时包括对低级缓存的请求处理的指示。 当状态/控制字段指示应该执行至少一个缓存的更新时,异步存储器移动器自动将数据的副本从数据移动转发到较低级的高速缓存,并触发高速缓存的一致性状态的更新 其中放置数据副本的行。

    Mechanism to Provide Reliability Through Packet Drop Detection
    70.
    发明申请
    Mechanism to Provide Reliability Through Packet Drop Detection 失效
    通过丢包检测提供可靠性的机制

    公开(公告)号:US20090198762A1

    公开(公告)日:2009-08-06

    申请号:US12024600

    申请日:2008-02-01

    IPC分类号: G06F15/16

    CPC分类号: G06F9/544

    摘要: A method and a data processing system for completing checkpoint processing of a distributed job with local tasks communicating with other remote tasks via a host fabric interface (HFI) and assigned HFI window. Each HFI window has a send count and a receive count, which tracks GSM messages that are sent from and received at the HFI window. When a checkpoint is initiated by a master task, each local task forwards the send count and the receive count to the master task. The master task sums the respective counts and then compares the totals to each other. When the send count total is equal to the receive count total, the tasks are permitted to continue processing. However, when the send count total is not equal to the receive count total, the master task notifies each task of the job to rollback to a previous checkpoint or kill the job execution.

    摘要翻译: 一种方法和数据处理系统,用于通过主机结构接口(HFI)和分配的HFI窗口完成与其他远程任务通信的本地任务的分布式作业的检查点处理。 每个HFI窗口都有发送计数和接收计数,用于跟踪在HFI窗口发送和接收的GSM消息。 当主任务启动检查点时,每个本地任务将发送计数和接收计数转发给主任务。 主任务对各个计数进行相加,然后将总计相互比较。 当发送计数总数等于接收计数总数时,允许任务继续处理。 但是,当发送计数总数不等于接收计数总数时,主任务会通知作业的每个任务以回滚到先前的检查点或终止作业执行。