Mechanism for Guaranteeing Delivery of Multi-Packet GSM Message
    1.
    发明申请
    Mechanism for Guaranteeing Delivery of Multi-Packet GSM Message 失效
    保证多分组GSM消息传递的机制

    公开(公告)号:US20090199209A1

    公开(公告)日:2009-08-06

    申请号:US12024678

    申请日:2008-02-01

    IPC分类号: G06F3/00

    CPC分类号: H04L1/1642 G06F9/542

    摘要: A target task ensures complete delivery of a global shared memory (GSM) message from an originating task to the target task. The target task's HFI receives a first of multiple GSM packets generated from a single GSM message sent from the originating task. The HFI logic assigns a sequence number and corresponding tuple to track receipt of the complete GSM message. The sequence number is unique relative to other sequence numbers assigned to GSM messages that have not been completely received from the initiating task. The HFI updates a count value within the tuple, which comprises the sequence number and the count value for the first GSM packet and for each subsequent GSM packet received for the GSM message. The HFI determines when receipt of the GSM message is complete by comparing the count value with a count total retrieved from the packet header.

    摘要翻译: 目标任务确保从始发任务到目标任务的全局共享存储器(GSM)消息的完全传递。 目标任务的HFI接收从发起任务发送的单个GSM消息产生的多个GSM分组中的第一个。 HFI逻辑分配序列号和对应的元组来跟踪完整GSM消息的接收。 相对于分配给尚未完全从发起任务接收的GSM消息的其他序列号,序列号是唯一的。 HFI更新元组内的计数值,其包括第一GSM分组的序列号和计数值以及为GSM消息接收的每个后续GSM分组。 通过将计数值与从分组报头检索的计数总数进行比较,HFI确定接收到GSM消息的完成。

    Mechanism to Prevent Illegal Access to Task Address Space by Unauthorized Tasks
    2.
    发明申请
    Mechanism to Prevent Illegal Access to Task Address Space by Unauthorized Tasks 有权
    通过未经授权的任务防止非法访问任务地址空间的机制

    公开(公告)号:US20090199194A1

    公开(公告)日:2009-08-06

    申请号:US12024410

    申请日:2008-02-01

    IPC分类号: G06F9/50

    CPC分类号: G06F9/544 G06F9/468

    摘要: A method and data processing system for tracking global shared memory (GSM) operations to and from a local node configured with a host fabric interface (HFI) coupled to a network fabric. During task/job initialization, the system OS assigns HFI window(s) to handle the GSM packet generation and GSM packet receipt and processing for each local task. HFI processing logic automatically tags each GSM packet generated by the HFI window with a global job identifier (ID) of the job to which the local task is affiliated. The job ID is embedded within each GSM packet placed on the network fabric. On receipt of a GSM packet from the network fabric, the HFI logic retrieves the embedded job ID and compares the embedded job ID with the ID within the HFI window(s). GSM packets are forwarded to an HFI window only when the embedded job ID matches the HFI window's job ID.

    摘要翻译: 一种用于跟踪与配置有耦合到网络结构的主机结构接口(HFI)的本地节点的全局共享存储器(GSM)操作的方法和数据处理系统。 在任务/作业初始化期间,系统OS分配HFI窗口来处理每个本地任务的GSM分组生成和GSM分组接收和处理。 HFI处理逻辑自动将由HFI窗口生成的每个GSM分组标记为本地任务附属于该作业的全局作业标识符(ID)。 作业ID被嵌入到放置在网络结构上的每个GSM分组内。 在从网络结构接收到GSM分组时,HFI逻辑检索嵌入的作业ID,并将嵌入的作业ID与HFI窗口内的ID进行比较。 仅当嵌入的作业ID与HFI窗口的作业ID匹配时,才将GSM数据包转发到HFI窗口。

    Mechanism to provide software guaranteed reliability for GSM operations
    3.
    发明授权
    Mechanism to provide software guaranteed reliability for GSM operations 有权
    机制为GSM操作提供软件保证的可靠性

    公开(公告)号:US07797588B2

    公开(公告)日:2010-09-14

    申请号:US12024637

    申请日:2008-02-01

    IPC分类号: G06F11/00 G06F15/167

    CPC分类号: G06F9/542 G06F9/50 G06F9/546

    摘要: In a global shared memory (GSM) environment, an initiating task at a first node with a host fabric interface (HFI) uses epochs to provide reliability of transmission of packets via a network fabric to a target task. The HFI generates a packet for the initiating task addressed to the target task, and automatically inserts a current epoch of the initiating task into the packet. A copy of the current epoch is maintained by the target task, which accepts for processing only packets having the correct epoch, unless the packet is tagged for guaranteed-once delivery. When a packet delivery is accepted, the target task sends a notification to the initiating task. If the initiating task does not receive the notification of delivery for the issued packet, the initiating task updates the epoch at both the target node and the initiating node and re-transmits the packet.

    摘要翻译: 在全球共享存储器(GSM)环境中,具有主机结构接口(HFI)的第一节点处的发起任务使用时代来提供经由网络结构向目标任务发送分组的可靠性。 HFI生成一个寻址到目标任务的启动任务的数据包,并自动将当前时刻的启动任务插入到数据包中。 目标任务的副本由目标任务维护,目标任务仅接受处理具有正确时期的分组,除非分组被标记为保证一次传递。 当接收到分组传递时,目标任务向发起任务发送通知。 如果发起任务没有接收到所发送的分组的传送通知,则起始任务在目标节点和发起节点两者处更新历元,并重新发送分组。

    Mechanism to prevent illegal access to task address space by unauthorized tasks
    4.
    发明授权
    Mechanism to prevent illegal access to task address space by unauthorized tasks 有权
    通过未经授权的任务防止非法访问任务地址空间的机制

    公开(公告)号:US08275947B2

    公开(公告)日:2012-09-25

    申请号:US12024410

    申请日:2008-02-01

    CPC分类号: G06F9/544 G06F9/468

    摘要: A method and data processing system for tracking global shared memory (GSM) operations to and from a local node configured with a host fabric interface (HFI) coupled to a network fabric. During task/job initialization, the system OS assigns HFI window(s) to handle the GSM packet generation and GSM packet receipt and processing for each local task. HFI processing logic automatically tags each GSM packet generated by the HFI window with a global job identifier (ID) of the job to which the local task is affiliated. The job ID is embedded within each GSM packet placed on the network fabric. On receipt of a GSM packet from the network fabric, the HFI logic retrieves the embedded job ID and compares the embedded job ID with the ID within the HFI window(s). GSM packets are forwarded to an HFI window only when the embedded job ID matches the HFI window's job ID.

    摘要翻译: 一种用于跟踪与配置有耦合到网络结构的主机结构接口(HFI)的本地节点的全局共享存储器(GSM)操作的方法和数据处理系统。 在任务/作业初始化期间,系统OS分配HFI窗口来处理每个本地任务的GSM分组生成和GSM分组接收和处理。 HFI处理逻辑自动将由HFI窗口生成的每个GSM分组标记为本地任务附属于该作业的全局作业标识符(ID)。 作业ID被嵌入到放置在网络结构上的每个GSM分组内。 在从网络结构接收到GSM分组时,HFI逻辑检索嵌入的作业ID,并将嵌入的作业ID与HFI窗口内的ID进行比较。 仅当嵌入的作业ID与HFI窗口的作业ID匹配时,才将GSM数据包转发到HFI窗口。

    Generating and issuing global shared memory operations via a send FIFO
    5.
    发明授权
    Generating and issuing global shared memory operations via a send FIFO 有权
    通过发送FIFO生成和发出全局共享内存操作

    公开(公告)号:US08200910B2

    公开(公告)日:2012-06-12

    申请号:US12024664

    申请日:2008-02-01

    IPC分类号: G06F12/00

    CPC分类号: G06F9/544

    摘要: A method for issuing global shared memory (GSM) operations from an originating task on a first node coupled to a network fabric of a distributed network via a host fabric interface (HFI). The originating task generates a GSM command within an effective address (EA) space. The task then places the GSM command within a send FIFO. The send FIFO is a portion of real memory having real addresses (RA) that are memory mapped to EAs of a globally executing job. The originating task maintains a local EA-to-RA mapping of only a portion of the real address space of the globally executing job. The task enables the HFI to retrieve the GSM command from the send FIFO into an HFI window allocated to the originating task. The HFI window generates a corresponding GSM packet containing GSM operations and/or data, and the HFI window issues the GSM packet to the network fabric.

    摘要翻译: 一种用于通过主机结构接口(HFI)从耦合到分布式网络的网络结构的第一节点上的始发任务发出全局共享存储器(GSM)操作的方法。 始发任务在有效地址(EA)空间内生成GSM命令。 然后任务将GSM命令放在发送FIFO中。 发送FIFO是具有存储器映射到全局执行作业的EA的实际地址(RA)的实际存储器的一部分。 始发任务维护仅全局执行作业的实际地址空间的一部分的本地EA到RA映射。 该任务使HFI能够将GSM命令从发送FIFO检索到分配给始发任务的HFI窗口中。 HFI窗口产生包含GSM操作和/或数据的相应的GSM分组,并且HFI窗口向网络结构发出GSM分组。

    Mechanism to provide reliability through packet drop detection
    6.
    发明授权
    Mechanism to provide reliability through packet drop detection 失效
    通过分组丢包检测提供可靠性的机制

    公开(公告)号:US07877436B2

    公开(公告)日:2011-01-25

    申请号:US12024600

    申请日:2008-02-01

    IPC分类号: G06F15/16

    CPC分类号: G06F9/544

    摘要: A method and a data processing system for completing checkpoint processing of a distributed job with local tasks communicating with other remote tasks via a host fabric interface (HFI) and assigned HFI window. Each HFI window has a send count and a receive count, which tracks GSM messages that are sent from and received at the HFI window. When a checkpoint is initiated by a master task, each local task forwards the send count and the receive count to the master task. The master task sums the respective counts and then compares the totals to each other. When the send count total is equal to the receive count total, the tasks are permitted to continue processing. However, when the send count total is not equal to the receive count total, the master task notifies each task of the job to rollback to a previous checkpoint or kill the job execution.

    摘要翻译: 一种方法和数据处理系统,用于通过主机结构接口(HFI)和分配的HFI窗口完成与其他远程任务通信的本地任务的分布式作业的检查点处理。 每个HFI窗口都有发送计数和接收计数,用于跟踪在HFI窗口发送和接收的GSM消息。 当主任务启动检查点时,每个本地任务将发送计数和接收计数转发给主任务。 主任务对各个计数进行相加,然后将总计相互比较。 当发送计数总数等于接收计数总数时,允许任务继续处理。 但是,当发送计数总数不等于接收计数总数时,主任务会通知作业的每个任务以回滚到先前的检查点或终止作业执行。

    Mechanism to perform debugging of global shared memory (GSM) operations
    7.
    发明授权
    Mechanism to perform debugging of global shared memory (GSM) operations 失效
    执行全局共享内存(GSM)操作调试的机制

    公开(公告)号:US07873879B2

    公开(公告)日:2011-01-18

    申请号:US12024585

    申请日:2008-02-01

    IPC分类号: G06F11/00

    CPC分类号: G06F13/385

    摘要: A host fabric interface (HFI) enables debugging of global shared memory (GSM) operations received at a local node from a network fabric. The local node has a memory management unit (MMU), which provides an effective address to real address (EA-to-RA) translation table that is utilized by the HFI to evaluate when EAs of GSM operations/data from a received GSM packet is memory-mapped to RAs of the local memory. The HFI retrieves the EA associated with a GSM operation/data within a received GSM packet. The HFI forwards the EA to the MMU, which determines when the EA is mapped to RAs within the local memory for the local task. The HFI processing logic enables processing of the GSM packet only when the EA of the GSM operation/data within the GSM packet is an EA that has a local RA translation. Non-matching EAs result in an error condition that requires debugging.

    摘要翻译: 主机结构接口(HFI)可以调试从网络结构在本地节点接收到的全局共享存储器(GSM)操作。 本地节点具有存储器管理单元(MMU),该存储器管理单元(MMU)为HFI用于实际地址(EA-to-RA)转换表提供有效地址,以评估来自接收到的GSM分组的GSM操作/数据的EAs是否为 内存映射到本地内存的RA。 HFI检索与接收的GSM分组内的GSM操作/数据相关联的EA。 HFI将EA转发到MMU,该MMU确定EA何时映射到本地内存中的本地任务的RA。 HFI处理逻辑仅当GSM操作的EA / GSM分组内的数据是具有本地RA转换的EA时才能处理GSM分组。 不匹配的EA会导致需要调试的错误条件。

    Generating and Issuing Global Shared Memory Operations Via a Send FIFO
    8.
    发明申请
    Generating and Issuing Global Shared Memory Operations Via a Send FIFO 有权
    通过发送FIFO生成和发出全局共享内存操作

    公开(公告)号:US20090199195A1

    公开(公告)日:2009-08-06

    申请号:US12024664

    申请日:2008-02-01

    IPC分类号: G06F9/46

    CPC分类号: G06F9/544

    摘要: A method for issuing global shared memory (GSM) operations from an originating task on a first node coupled to a network fabric of a distributed network via a host fabric interface (HFI). The originating task generates a GSM command within an effective address (EA) space. The task then places the GSM command within a send FIFO. The send FIFO is a portion of real memory having real addresses (RA) that are memory mapped to EAs of a globally executing job. The originating task maintains a local EA-to-RA mapping of only a portion of the real address space of the globally executing job. The task enables the HFI to retrieve the GSM command from the send FIFO into an HFI window allocated to the originating task. The HFI window generates a corresponding GSM packet containing GSM operations and/or data, and the HFI window issues the GSM packet to the network fabric.

    摘要翻译: 一种用于通过主机结构接口(HFI)从耦合到分布式网络的网络结构的第一节点上的始发任务发出全局共享存储器(GSM)操作的方法。 始发任务在有效地址(EA)空间内生成GSM命令。 然后任务将GSM命令放在发送FIFO中。 发送FIFO是具有存储器映射到全局执行作业的EA的实际地址(RA)的实际存储器的一部分。 始发任务维护仅全局执行作业的实际地址空间的一部分的本地EA到RA映射。 该任务使HFI能够将GSM命令从发送FIFO检索到分配给始发任务的HFI窗口中。 HFI窗口产生包含GSM操作和/或数据的相应的GSM分组,并且HFI窗口向网络结构发出GSM分组。

    Guaranteeing delivery of multi-packet GSM messages
    9.
    发明授权
    Guaranteeing delivery of multi-packet GSM messages 失效
    保证多分组GSM消息的传送

    公开(公告)号:US08146094B2

    公开(公告)日:2012-03-27

    申请号:US12024678

    申请日:2008-02-01

    CPC分类号: H04L1/1642 G06F9/542

    摘要: A target task ensures complete delivery of a global shared memory (GSM) message from an originating task to the target task. The target task's HFI receives a first of multiple GSM packets generated from a single GSM message sent from the originating task. The HFI logic assigns a sequence number and corresponding tuple to track receipt of the complete GSM message. The sequence number is unique relative to other sequence numbers assigned to GSM messages that have not been completely received from the initiating task. The HFI updates a count value within the tuple, which comprises the sequence number and the count value for the first GSM packet and for each subsequent GSM packet received for the GSM message. The HFI determines when receipt of the GSM message is complete by comparing the count value with a count total retrieved from the packet header.

    摘要翻译: 目标任务确保从始发任务到目标任务的全局共享存储器(GSM)消息的完全传递。 目标任务的HFI接收从发起任务发送的单个GSM消息产生的多个GSM分组中的第一个。 HFI逻辑分配序列号和对应的元组来跟踪完整GSM消息的接收。 相对于分配给尚未完全从发起任务接收的GSM消息的其他序列号,序列号是唯一的。 HFI更新元组内的计数值,其包括第一GSM分组的序列号和计数值以及为GSM消息接收的每个后续GSM分组。 通过将计数值与从分组报头检索的计数总数进行比较,HFI确定接收到GSM消息的完成。

    Mechanism to Provide Software Guaranteed Reliability for GSM Operations
    10.
    发明申请
    Mechanism to Provide Software Guaranteed Reliability for GSM Operations 有权
    提供GSM操作软件保证可靠性的机制

    公开(公告)号:US20090199201A1

    公开(公告)日:2009-08-06

    申请号:US12024637

    申请日:2008-02-01

    IPC分类号: G06F9/50

    CPC分类号: G06F9/542 G06F9/50 G06F9/546

    摘要: In a global shared memory (GSM) environment, an initiating task at a first node with a host fabric interface (HFI) uses epochs to provide reliability of transmission of packets via a network fabric to a target task. The HFI generates a packet for the initiating task addressed to the target task, and automatically inserts a current epoch of the initiating task into the packet. A copy of the current epoch is maintained by the target task, which accepts for processing only packets having the correct epoch, unless the packet is tagged for guaranteed-once delivery. When a packet delivery is accepted, the target task sends a notification to the initiating task. If the initiating task does not receive the notification of delivery for the issued packet, the initiating task updates the epoch at both the target node and the initiating node and re-transmits the packet.

    摘要翻译: 在全球共享存储器(GSM)环境中,具有主机结构接口(HFI)的第一节点处的发起任务使用时代来提供经由网络结构向目标任务发送分组的可靠性。 HFI生成一个寻址到目标任务的启动任务的数据包,并自动将当前时刻的启动任务插入到数据包中。 目标任务的副本由目标任务维护,目标任务仅接受处理具有正确时期的分组,除非分组被标记为保证一次传递。 当接收到分组传递时,目标任务向发起任务发送通知。 如果发起任务没有接收到所发送的分组的传送通知,则起始任务在目标节点和发起节点两者处更新历元,并重新发送分组。