Partial broadcast method in parallel computer and a parallel computer
suitable therefor
    1.
    发明授权
    Partial broadcast method in parallel computer and a parallel computer suitable therefor 失效
    并行计算机中的部分广播方法和适用于其的并行计算机

    公开(公告)号:US5826049A

    公开(公告)日:1998-10-20

    申请号:US916630

    申请日:1992-07-22

    CPC分类号: G06F15/17368

    摘要: In order to determine a transfer path of a message to a receiving-end processor group, a processor includes a routing bit generation circuit, and an exchange switch includes partial broadcast path control circuits and a path control information alteration circuit. In order to define the range of a receiving-end processor group, a network includes transfer control circuits. A crossbar switch includes transfer control circuits associated with output ports and a boundary register group. When a partial broadcast message is transferred from an input port in the downstream direction of an output port, it is decided whether a belonging to the partial broadcast range associated with a connected to the particular input port is connected to the particular output port, whereby the particular partial broadcast message is transferred from the same output port.

    摘要翻译: 为了确定消息到接收端处理器组的传送路径,处理器包括路由位生成电路,并且交换交换机包括部分广播路径控制电路和路径控制信息改变电路。 为了定义接收端处理器组的范围,网络包括传送控制电路。 交叉开关包括与输出端口和边界寄存器组相关联的传输控制电路。 当在输出端口的下游方向上从输入端口传送部分广播消息时,确定属于与连接到特定输入端口的连接的部分广播范围是否连接到特定输出端口,由此 特定的部分广播消息从相同的输出端口传送。

    Inter-processor communication method for transmitting data and processor
dependent information predetermined for a receiving process of another
processor
    5.
    发明授权
    Inter-processor communication method for transmitting data and processor dependent information predetermined for a receiving process of another processor 失效
    用于发送针对另一处理器的接收处理预定的数据和处理器相关信息的处理器间通信方法

    公开(公告)号:US5386566A

    公开(公告)日:1995-01-31

    申请号:US853427

    申请日:1992-03-18

    IPC分类号: G06F9/46 H04L29/00 G06F13/00

    CPC分类号: G06F9/544 H04L29/00

    摘要: In a parallel computer, in order to reduce the overhead of data transmissions between the processes, a data transmission from the virtual space of a process in a certain cluster to the virtual space of a process in other cluster is executed without copying the data to the buffer provided within the operating system. The real communication area resident in the real memory is provided in a part of the virtual space of the process, and an identifier unique within the cluster is given to the communication area. When the transmission process has issued a transmission instruction at the time of data transmission, the cluster address of the cluster in which the transmission destination process exists and the identifier of the communication area are determined based on the name of the transmission destination process. Then, the data is directly transmitted between the mutual real communication areas of the transmission originating process and the transmission destination process. Overhead for the data transmission between the processes can be reduced by avoiding making a copy of the data between the user space and the buffer provided within the operating system at the time of data transmission between the processes.

    摘要翻译: 在并行计算机中,为了减少进程之间的数据传输的开销,执行从某个群集中的进程的虚拟空间到其他群集中的进程的虚拟空间的数据传输,而不将数据复制到 在操作系统中提供缓冲区。 驻留在真实存储器中的实际通信区域被提供在该进程的虚拟空间的一部分中,并且在群集内唯一的标识符被提供给通信区域。 当发送处理在数据发送时发出发送指示时,基于发送目的地处理的名称确定发送目的地处理所在的群集的群集地址和通信区域的标识符。 然后,数据在发送始发处理的相互实际通信区域和发送目的地处理之间直接发送。 可以通过在进程之间的数据传输时避免在用户空间和在操作系统内提供的缓冲区之间的数据的副本来复制用于进程之间的数据传输的开销。

    Fault handling and recovery for system having plural processors
    6.
    发明授权
    Fault handling and recovery for system having plural processors 失效
    具有多个处理器的系统的故障处理和恢复

    公开(公告)号:US5758053A

    公开(公告)日:1998-05-26

    申请号:US189683

    申请日:1994-02-01

    摘要: Parallel processors communicate with each other over a network by transmitting messages that include destination processor information. A message controller for each processor in the network receives the messages and checks for faults in the message, particularly in the destination processor number contained in a first word of the message. If a fault occurs in the destination processor number, then the faulty message is transmitted to an appropriate processor for handling the fault. In this way the network operation is not suspended because of the fault and the message is not left in the network as a result of the error occurring in the destination processor number. The processor to which the faulty message is directed is determined by a substitute destination processor number contained in the message or is predetermined and set in another way, such as by a service processor. To recover from the fault, the processor receiving the faulty message can request that the message be retransmitted or the error can be corrected using an ECC, for example. If the faulty message cannot be retransmitted, then the processor or the host processor can request that the job to which the faulty message pertains be canceled by all of the processors executing that job without affecting the simultaneous execution of other jobs by the same processors.

    摘要翻译: 并行处理器通过发送包含目标处理器信息的消息通过网络彼此进行通信。 用于网络中的每个处理器的消息控制器接收消息并检查消息中的故障,特别是在消息的第一个字中包含的目标处理器号码中。 如果目标处理器号码发生故障,则故障消息被传送到适当的处理器处理故障。 以这种方式,网络操作由于故障而不被暂停,并且由于目标处理器号码中出现错误,网络中没有留下该消息。 错误消息所针对的处理器由包含在消息中的替代目的地处理器号码确定,或者以另一种方式例如由服务处理器预先设定。 为了从故障中恢复,例如,接收到故障消息的处理器可以请求重传该消息或者使用ECC来纠正该错误。 如果故障消息不能重发,则处理器或主机处理器可以请求执行该作业的所有处理器取消与故障消息相关的作业,而不会影响同一处理器同时执行其他作业。