Abstract:
Generally described, systems and methods are provided for detecting the impact of network failures. The system collects performance information from a plurality of nodes and links in a network, aggregates the collected performance information across paths in the network, processes the aggregated performance information for detecting failures on the paths, adjusts the set of performance information by removing the performance information for any nodes considered to be associated with performance information that is statistically different from performance information from other nodes at a given location or extrapolates the collected information to other paths, and determines the impact to customers of the network failures detected using the adjusted set of performance information.
Abstract:
Techniques and solutions for distributing routing updates using timing information are described. For example, inter-device messaging protocols can support timing information (e.g., routing protocols be extended to support timing information). The timing information can comprise an indication of a time at which the routing update is to be put into effect. Using the timing information, a number of routing devices can put the routing update into effect at the same time (e.g., at an update reference time) according to a synchronous mode. Routing updates can be managed between routing devices that support timing information and routing devices that do not support timing information.
Abstract:
Systems and methods are provided for highly accurate synchronization of machine instances in a distributed, hosted computing environment to a reference timekeeper. In addition to a general communication network accessible to machine instances, the distributed environment includes a second network dedicated to carrying time information, such as a pulse-per-second (PPS) signal to isolated timing hardware within host computing devices. The isolated timing hardware can use the PPS signal, along with a reference time, to set a hardware clock. The isolated timing hardware can further provide an interface to machine instances that enables the instances to read the time of the hardware clock. This configuration enables many instances can share access to a single reference timekeeper, thus synchronizing those instances to a much higher accuracy than in traditional network-based time protocols.
Abstract:
Optical transceivers include receiver-side and transmitter-side optical switches that selectively route optical signals to be transmitted to a wavelength division multiplexer or demultiplexed received signals to corresponding photodetectors. Other optical signals are transmitted or received via optical fibers and directed by the optical switches without wavelength multiplexing. In some examples, the switches are programmable and data routing can be altered as desired. In other examples, the switches are fixed, and data routing is altered by switch replacement.
Abstract:
Generally described, systems and methods are provided for detecting the impact of network failures. The system collects performance information from a plurality of nodes and links in a network, aggregates the collected performance information across paths in the network, processes the aggregated performance information for detecting failures on the paths, adjusts the set of performance information by removing the performance information for any nodes considered to be associated with performance information that is statistically different from performance information from other nodes at a given location or extrapolates the collected information to other paths, and determines the impact to customers of the network failures detected using the adjusted set of performance information.
Abstract:
Technologies are provided for preventing transmission of cyclic redundancy check (CRC) errors, and/or other errors, within a computer network that uses cut-through forwarding of network packets. For example, counts of CRC errors can be maintained for network packets processed by network devices. When a CRC error condition is detected based on the counts of CRC errors, cut-through forwarding is disabled on at least a portion of the network (e.g., on one or more specific network ports and/or on one or more specific network devices). While cut-through forwarding is disabled, the affected portion of the network uses store-and-forward for processing network packets. When the CRC error condition no longer exists, then cut-through forwarding is re-enabled on the portion of the network.
Abstract:
Techniques and solutions for distributing routing updates using timing information are described. For example, inter-device messaging protocols can support timing information (e.g., routing protocols be extended to support timing information). The timing information can indicate a time at which the routing update is to be put into effect where the time decreases the further away a given routing device is from the source of the routing update. For example, each routing device can determine a decayed update reference time when the given routing device will put the routing update into effect according to a decay mode. Routing updates can be managed between routing devices that support timing information and routing devices that do not support timing information.
Abstract:
Methods and apparatus for transparent multipath utilization through encapsulation are disclosed. Respective encapsulation packets are generated for at least two different baseline packets transmitted between a source and destination linked by multiple network paths. Each encapsulation packet comprises contents of a corresponding baseline packet, and one or more data values selected in accordance with a path balancing policy. The data values added to one encapsulation packet may differ from those added to another. Different network paths to the destination may be selected for different encapsulation packets of a given transmission based at least in part on the added data values.
Abstract:
Various embodiments of apparatuses and methods for multi-cast, multiple unicast, and unicast distribution of messages with time synchronized delivery are described. In some embodiments, the disclosed system and methods include a reference timekeeper providing a reference clock to one or more host computing devices. The one or more host computing devices host compute instances, and also contain respective isolated timing hardware outside the control of the compute instances. The isolated timing hardware of the one or more host computing devices then receive respective packets, and obtain the same time to deliver the respective packets. Each isolated timing hardware provides either the packet, or information to access the packet, to its respective destination compute instance subsequent to determining that the same specified time to deliver the packet has occurred. Thus, the respective packets are delivered near simultaneously to the one or more destination compute instances.
Abstract:
Directional capacity of interfaces for networking devices are dynamically modified. Network traffic utilization of one direction of a network interface may be determined. A modification to a capacity of the networking device to process network traffic in the one direction of the network may be determined. The modification may then be applied to the networking device so that subsequent network traffic is processed according to the modified capacity in the one direction of the interface.