发明公开
EP2770697A8 APPLICATION IDENTIFICATION METHOD, AND DATA MINING METHOD, DEVICE AND SYSTEM
有权
应用识别方法和数据提取工艺,设备和系统
- 专利标题: APPLICATION IDENTIFICATION METHOD, AND DATA MINING METHOD, DEVICE AND SYSTEM
- 专利标题(中): 应用识别方法和数据提取工艺,设备和系统
-
申请号: EP13801453.5申请日: 2013-07-29
-
公开(公告)号: EP2770697A8公开(公告)日: 2014-12-10
- 发明人: ZHOU, Wei , TANG, Dong , ZHANG, Hongding
- 申请人: Huawei Technologies Co., Ltd.
- 申请人地址: Huawei Administration Building, Bantian Longgang District Shenzhen Guangdong 518129 CN
- 专利权人: Huawei Technologies Co., Ltd.
- 当前专利权人: Huawei Technologies Co., Ltd.
- 当前专利权人地址: Huawei Administration Building, Bantian Longgang District Shenzhen Guangdong 518129 CN
- 代理机构: Haley, Stephen
- 优先权: CN201215922035 20121231
- 国际公布: WO2014101402 20140703
- 主分类号: H04L29/08
- IPC分类号: H04L29/08 ; G06F17/30
摘要:
Embodiments of the present invention disclose a data mining method, apparatus, and system. The UBA-based data mining method includes: obtaining to-be-processed data, where the to-be-processed data includes multiple records, and each record includes application information and remote end triplet information having a correspondence relationship therebetween; performing clustering processing on records with same remote end triplet information and same application information in the to-be-processed data, and according to the records with the same remote end triplet information and the same application information in the to-be-processed data, calculating a service load amount corresponding to the remote end triplet information and the application information to obtain a clustering result including the remote end triplet information, the application information, and the service load amount that have a correspondence relationship therebetween; according to the service load amount or a proportion of the service load amount, selecting remote end triplet information and application information that have high reliability and have correspondence relationship therebetween from the clustering result; and sending the remote end triplet information and application information that have high reliability and have correspondence relationship therebetween to a DPI subsystem; thus DPI-based identification performance and an application identification rate can be improved.
公开/授权文献
信息查询