Abstract:
A data transmission method, device and system to improve reliability of a data link. When the sender side detects erroneous data, the erroneous data is discarded and a data retransmission request is sent to the sender side to ensure correctness of received data and improve reliability of the data link; and, when the sender side detects the erroneous data and a bit error rate is greater than a preset bit error rate threshold, the data link gets into auto recovery, and data transmission is resumed after the recovery succeeds, thereby avoiding an excessively high bit error rate, preventing an excessively high probability of omitted checks (the higher the bit error rate is, the higher probability of omitted checks is), and further improving reliability of the data link.
Abstract:
In a data processing method, a timeout aggregation node of a cluster obtains first data that is partial aggregated data in a data-intensive computing task. The first data carries a first identifier of a timeout node indicating that a timeout occurs on the timeout node. The timeout aggregation node obtains second data of the timeout node based on the first identifier, where the second data is to-be-aggregated data sent by the timeout node. The timeout aggregation node aggregates the first data and the second data according to a preset rule to obtain third data that is complete aggregated data. The timeout aggregation node then notifies each computing node in the cluster of the third data.
Abstract:
An intermediate device in a computer device includes a first agent unit supporting single-root input/output (I/O) virtualization (SR-IOV) and a second agent unit supporting Virtio, and the first agent unit and the second agent unit each are an agent of a function unit in a network adapter such that, a virtual machine in the computer device may use an SR-IOV technology or a Virtio technology, and does not configure two sets of independent resource pools to separately support corresponding virtualization technologies, to implement normalization of the resource pools. In addition, the intermediate device implements hardware offloading of Virtio protocol packet forwarding using hardware.
Abstract:
A system for implementing interconnection fault tolerance between CPUs, a first CPU and a second CPU implements interconnection through a first CPU interconnect device and a second CPU interconnect device. The system adds a data channel between a first SerDes interface of the first CPU interconnect device and a second SerDes interface of the second CPU interconnect device, and transmits link connection state information and a link control signal through the added data channel. The system monitors a link state of any one link in a CPU interconnection system, transmits the link state through the added data channel, recovers any one of the connection links when determining whether any one of the first connection link, the second connection link and the third connection link is faulty.
Abstract:
An intermediate device in a computer device includes a first agent unit supporting single-root input/output (I/O) virtualization (SR-IOV) and a second agent unit supporting Virtio, and the first agent unit and the second agent unit each are an agent of a function unit in a network adapter such that, a virtual machine in the computer device may use an SR-IOV technology or a Virtio technology, and does not configure two sets of independent resource pools to separately support corresponding virtualization technologies, to implement normalization of the resource pools. In addition, the intermediate device implements hardware offloading of Virtio protocol packet forwarding using hardware.
Abstract:
In a data processing method, a timeout aggregation node of a cluster obtains first data that is partial aggregated data in a data-intensive computing task. The first data carries a first identifier of a timeout node indicating that a timeout occurs on the timeout node. The timeout aggregation node obtains second data of the timeout node based on the first identifier, where the second data is to-be-aggregated data sent by the timeout node. The timeout aggregation node aggregates the first data and the second data according to a preset rule to obtain third data that is complete aggregated data. The timeout aggregation node then notifies each computing node in the cluster of the third data.
Abstract:
The present disclosure provides a CPU interconnect device, the CPU interconnect device connects with a first CPU, which includes a quick path interconnect QPI interface and a serial deserial SerDes interface, the quick path interconnect QPI interface receives serial QPI data sent from a CPU, converts the received serial QPI data into a parallel QPI data, and outputs the parallel QPI data to the serial deserial SerDes interface; the serial deserial SerDes interface converts the parallel QPI data output by the QPI interface into a high-speed serial SerDes data and then send the high-speed serial SerDes data to another CPU interconnect device connected with another CPU. The defects of poor scalability, long data transmission delay, and a high cost of an existing interconnect system among CPUs can be solved.
Abstract:
A data transmission method, device and system to improve reliability of a data link. When the sender side detects erroneous data, the erroneous data is discarded and a data retransmission request is sent to the sender side to ensure correctness of received data and improve reliability of the data link; and, when the sender side detects the erroneous data and a bit error rate is greater than a preset bit error rate threshold, the data link gets into auto recovery, and data transmission is resumed after the recovery succeeds, thereby avoiding an excessively high bit error rate, preventing an excessively high probability of omitted checks (the higher the bit error rate is, the higher probability of omitted checks is), and further improving reliability of the data link.
Abstract:
A system for implementing interconnection fault tolerance between CPUs, a first CPU and a second CPU implements interconnection through a first CPU interconnect device and a second CPU interconnect device. The system adds a data channel between a first SerDes interface of the first CPU interconnect device and a second SerDes interface of the second CPU interconnect device, and transmits link connection state information and a link control signal through the added data channel. The system monitors a link state of any one link in a CPU interconnection system, transmits the link state through the added data channel, recovers any one of the connection links when determining whether any one of the first connection link, the second connection link and the third connection link is faulty.
Abstract:
The present disclosure provides a CPU interconnect device, the CPU interconnect device connects with a first CPU, which includes a quick path interconnect QPI interface and a serial deserial SerDes interface, the quick path interconnect QPI interface receives serial QPI data sent from a CPU, converts the received serial QPI data into a parallel QPI data, and outputs the parallel QPI data to the serial deserial SerDes interface; the serial deserial SerDes interface converts the parallel QPI data output by the QPI interface into a high-speed serial SerDes data and then send the high-speed serial SerDes data to another CPU interconnect device connected with another CPU. The defects of poor scalability, long data transmission delay, and a high cost of an existing interconnect system among CPUs can be solved.