METHOD AND DEVICE FOR RECOGNIZING SPAM SHORT MESSAGES
    1.
    发明申请
    METHOD AND DEVICE FOR RECOGNIZING SPAM SHORT MESSAGES 审中-公开
    识别垃圾信息的方法和设备

    公开(公告)号:US20160232452A1

    公开(公告)日:2016-08-11

    申请号:US15022604

    申请日:2014-06-24

    CPC classification number: G06N7/005 G06F16/353 G06F17/2705 H04L51/12

    Abstract: Provided are a method and device for recognizing spam short messages. In the method, a first feature word set is obtained in a spam short message sample set, and a first conditional probability of each feature word in the first feature word set is obtained; a second feature word set is obtained in a non-spam short message sample set and a second conditional probability of each feature word in the second feature word set is obtained; and a spam short message set is recognized from a short message set according to the number of words contained in each short message in the short message set to be processed, the number of repetition times of each short message in the short message set, the first feature word set, the second feature word set, the first conditional probability and the second conditional probability.

    Abstract translation: 提供了用于识别垃圾邮件短消息的方法和设备。 在该方法中,在垃圾邮件短消息样本集中获得第一特征词组,并且获得第一特征词集合中每个特征词的第一条件概率; 在非垃圾邮件短消息样本集中获得第二特征词集,并且获得第二特征字集合中的每个特征词的第二条件概率; 并且根据要处理的短消息集合中的每个短消息中包含的单词数量,短消息集中的每个短消息的重复次数,第一 特征词集,第二特征词集,第一条件概率和第二条件概率。

Patent Agency Ranking