SYSTEMS AND METHODS FOR GENERATING MACHINE LEARNING-BASED CLASSIFIERS FOR DETECTING SPECIFIC CATEGORIES OF SENSITIVE INFORMATION
    12.
    发明申请
    SYSTEMS AND METHODS FOR GENERATING MACHINE LEARNING-BASED CLASSIFIERS FOR DETECTING SPECIFIC CATEGORIES OF SENSITIVE INFORMATION 有权
    用于生成机器学习型分类器的系统和方法,用于检测特定信息类型的敏感信息

    公开(公告)号:US20120303558A1

    公开(公告)日:2012-11-29

    申请号:US13191018

    申请日:2011-07-26

    Applicant: Sumesh Jaiswal

    Inventor: Sumesh Jaiswal

    CPC classification number: G06N99/005

    Abstract: A computer-implemented method may include (1) identifying a plurality of specific categories of sensitive information to be protected by a DLP system, (2) obtaining a training data set for each specific category of sensitive information that includes a plurality of positive and a plurality of negative examples of the specific category of sensitive information, (3) using machine learning to train, based on an analysis of the training data sets, at least one machine learning-based classifier that is capable of detecting items of data that contain one or more of the plurality of specific categories of sensitive information, and then (4) deploying the machine learning-based classifier within the DLP system to enable the DLP system to detect and protect items of data that contain one or more of the plurality of specific categories of sensitive information in accordance with at least one DLP policy of the DLP system.

    Abstract translation: 计算机实现的方法可以包括(1)识别要由DLP系统保护的多个特定类别的敏感信息,(2)获得针对每个特定类别的敏感信息的训练数据集,其包括多个正的和 多个敏感信息的特定类别的负面例子,(3)使用机器学习训练,基于训练数据集的分析,至少一个基于机器学习的分类器能够检测包含一个 或多个特定类别的敏感信息,然后(4)在DLP系统内部署基于机器学习的分类器,以使DLP系统能够检测和保护包含多个特定类别中的一个或多个的数据项 根据DLP系统的至少一个DLP策略的敏感信息类别。

Patent Agency Ranking