-
公开(公告)号:US20240406058A1
公开(公告)日:2024-12-05
申请号:US18629132
申请日:2024-04-08
Applicant: Nvidia Corporation
Inventor: Elad Alon , Eitan Zahavi , Gaby Diengott , Shie Mannor , Vadim Gechman
IPC: H04L41/0659 , H04L41/147 , H04L43/06 , H04L43/0811
Abstract: A network monitor may execute, or communicate with, one or more stored machine learning models that are trained to predict a failure probability for one or more ports and/or links within a network fabric. Systems and methods may monitor a set of ports and/or links to generate predictions for failure probabilities using a first trained model and low frequency telemetry data. For a subset of ports and/or links with failure probabilities exceeding a first threshold, high speed telemetry data may be used by a second trained model to generate predictions for failure probabilities for the subset of ports. Suspicious ports may then be isolated and undergo various remediation and/or monitoring actions prior to de-isolating the isolated ports.