发明公开
EP2770697A8 APPLICATION IDENTIFICATION METHOD, AND DATA MINING METHOD, DEVICE AND SYSTEM 有权
应用识别方法和数据提取工艺,设备和系统

  • 专利标题: APPLICATION IDENTIFICATION METHOD, AND DATA MINING METHOD, DEVICE AND SYSTEM
  • 专利标题(中): 应用识别方法和数据提取工艺,设备和系统
  • 申请号: EP13801453.5
    申请日: 2013-07-29
  • 公开(公告)号: EP2770697A8
    公开(公告)日: 2014-12-10
  • 发明人: ZHOU, WeiTANG, DongZHANG, Hongding
  • 申请人: Huawei Technologies Co., Ltd.
  • 申请人地址: Huawei Administration Building, Bantian Longgang District Shenzhen Guangdong 518129 CN
  • 专利权人: Huawei Technologies Co., Ltd.
  • 当前专利权人: Huawei Technologies Co., Ltd.
  • 当前专利权人地址: Huawei Administration Building, Bantian Longgang District Shenzhen Guangdong 518129 CN
  • 代理机构: Haley, Stephen
  • 优先权: CN201215922035 20121231
  • 国际公布: WO2014101402 20140703
  • 主分类号: H04L29/08
  • IPC分类号: H04L29/08 G06F17/30
APPLICATION IDENTIFICATION METHOD, AND DATA MINING METHOD, DEVICE AND SYSTEM
摘要:
Embodiments of the present invention disclose a data mining method, apparatus, and system. The UBA-based data mining method includes: obtaining to-be-processed data, where the to-be-processed data includes multiple records, and each record includes application information and remote end triplet information having a correspondence relationship therebetween; performing clustering processing on records with same remote end triplet information and same application information in the to-be-processed data, and according to the records with the same remote end triplet information and the same application information in the to-be-processed data, calculating a service load amount corresponding to the remote end triplet information and the application information to obtain a clustering result including the remote end triplet information, the application information, and the service load amount that have a correspondence relationship therebetween; according to the service load amount or a proportion of the service load amount, selecting remote end triplet information and application information that have high reliability and have correspondence relationship therebetween from the clustering result; and sending the remote end triplet information and application information that have high reliability and have correspondence relationship therebetween to a DPI subsystem; thus DPI-based identification performance and an application identification rate can be improved.
信息查询
0/0