METHOD AND SYSTEM FOR TRAINING AND VALIDATING MACHINE LEARNING IN NETWORK ENVIRONMENTS

    公开(公告)号:US20190294995A1

    公开(公告)日:2019-09-26

    申请号:US16359336

    申请日:2019-03-20

    Abstract: A system and method for training and validating ML algorithms in real networks, including: generating synthetic traffic and receiving it along with real traffic; aggregating the received traffic into network flows by using metadata and transforming them to generate a first dataset readable by the ML algorithm, comprising features defined by the metadata; labelling the traffic and selecting a subset of the features from the labelled dataset used in an iterative training to generate a trained model; filtering out a part of real traffic to obtain a second labelled dataset; and selecting a subset of features from the second labelled dataset used for validating the trained model by comparing predicted results for the trained model and the labels; repeating the steps with a different subset of features to generate another trained model until results are positive in terms of precision or accuracy.

    METHOD AND SYSTEM FOR OPTIMIZING EVENT PREDICTION IN DATA SYSTEMS

    公开(公告)号:US20190392292A1

    公开(公告)日:2019-12-26

    申请号:US16446649

    申请日:2019-06-20

    Abstract: A system and method for optimizing event prediction in data systems, wherein at least one source (100) comprises: a data collector periodically collecting (101) real data values (300) to generate a stream of data modeled as a time series; a generator (110) of prediction models (M1, M2, M3, . . . , Mx) to which the collected values from the data collector are input; a first forecast module (120) receiving (102) one of the generated prediction models (M1, M2, M3, . . . , Mx) for generating a predicted value (310) and computing a committed error (320) by comparing the predicted value (310) with the real data value (300); and wherein the source (100) sends (105) the committed error (320) within the time series to the destination (200) only if the committed error (320) exceeds a threshold and wherein the destination (200) comprises: a second forecast module (210) receiving (204) the same prediction model (M1, M2, M3, . . . , Mx) from the generator (110) through a communication channel (103); a correction module (220) for obtaining (203) the real data value by the generated prediction model (M1, M2, M3, . . . , Mx) and applying the committed error (320) if received (202) from the source (100).

Patent Agency Ranking