-
1.
公开(公告)号:US20190294995A1
公开(公告)日:2019-09-26
申请号:US16359336
申请日:2019-03-20
Applicant: TELEFONICA, S.A.
Inventor: Antonio Pastor Perales , Diego R. Lopez , Alberto Mozo Velasco , Sandra Gomez Canaval
Abstract: A system and method for training and validating ML algorithms in real networks, including: generating synthetic traffic and receiving it along with real traffic; aggregating the received traffic into network flows by using metadata and transforming them to generate a first dataset readable by the ML algorithm, comprising features defined by the metadata; labelling the traffic and selecting a subset of the features from the labelled dataset used in an iterative training to generate a trained model; filtering out a part of real traffic to obtain a second labelled dataset; and selecting a subset of features from the second labelled dataset used for validating the trained model by comparing predicted results for the trained model and the labels; repeating the steps with a different subset of features to generate another trained model until results are positive in terms of precision or accuracy.
-
公开(公告)号:US20190392292A1
公开(公告)日:2019-12-26
申请号:US16446649
申请日:2019-06-20
Applicant: Telefonica, S.A
Inventor: Alberto Mozo Velasco , Sandra Gómez Canaval , Antonio Pastor Perales , Diego R. Lopez
IPC: G06N3/04
Abstract: A system and method for optimizing event prediction in data systems, wherein at least one source (100) comprises: a data collector periodically collecting (101) real data values (300) to generate a stream of data modeled as a time series; a generator (110) of prediction models (M1, M2, M3, . . . , Mx) to which the collected values from the data collector are input; a first forecast module (120) receiving (102) one of the generated prediction models (M1, M2, M3, . . . , Mx) for generating a predicted value (310) and computing a committed error (320) by comparing the predicted value (310) with the real data value (300); and wherein the source (100) sends (105) the committed error (320) within the time series to the destination (200) only if the committed error (320) exceeds a threshold and wherein the destination (200) comprises: a second forecast module (210) receiving (204) the same prediction model (M1, M2, M3, . . . , Mx) from the generator (110) through a communication channel (103); a correction module (220) for obtaining (203) the real data value by the generated prediction model (M1, M2, M3, . . . , Mx) and applying the committed error (320) if received (202) from the source (100).
-
公开(公告)号:US11301778B2
公开(公告)日:2022-04-12
申请号:US16359336
申请日:2019-03-20
Applicant: TELEFONICA, S.A.
Inventor: Antonio Pastor Perales , Diego R. Lopez , Alberto Mozo Velasco , Sandra Gomez Canaval
IPC: G06N20/00 , G06N5/04 , H04L43/026 , H04L43/028 , H04L43/067 , H04L43/12 , H04L41/16 , H04L41/14 , H04L47/10
Abstract: A system and method for training and validating ML algorithms in real networks, including: generating synthetic traffic and receiving it along with real traffic; aggregating the received traffic into network flows by using metadata and transforming them to generate a first dataset readable by the ML algorithm, comprising features defined by the metadata; labelling the traffic and selecting a subset of the features from the labelled dataset used in an iterative training to generate a trained model; filtering out a part of real traffic to obtain a second labelled dataset; and selecting a subset of features from the second labelled dataset used for validating the trained model by comparing predicted results for the trained model and the labels; repeating the steps with a different subset of features to generate another trained model until results are positive in terms of precision or accuracy.
-
-