Securing machine learning models against adversarial samples through backdoor misclassification

Invention Grant

US11977626B2 Securing machine learning models against adversarial samples through backdoor misclassification 有权

Please log in to see more content

Patent Title: Securing machine learning models against adversarial samples through backdoor misclassification
Application No.: US17342571

Application Date: 2021-06-09
Publication No.: US11977626B2

Publication Date: 2024-05-07
Inventor: Sebastien Andreina , Giorgia Azzurra Marson , Ghassan Karame
Applicant: NEC Laboratories Europe GmbH
Applicant Address: DE Heidelberg
Assignee: NEC CORPORATION
Current Assignee: NEC CORPORATION
Current Assignee Address: JP Tokyo
Agency: Leydig, Voit & Mayer, Ltd.
Main IPC: G06F21/55
IPC: G06F21/55 ; G06F18/241 ; G06F18/2431 ; G06N3/08 ; G06V10/75

Securing machine learning models against adversarial samples through backdoor misclassification

Abstract:

A method for securing a genuine machine learning model against adversarial samples includes the steps of attaching a trigger to a sample to be classified and classifying the sample with the trigger attached using a backdoored model that has been backdoored using the trigger. In a further step, it is determined whether an output of the backdoored model is the same as a backdoor class of the backdoored model, and/or an outlier detection method is applied to logits compared to honest logits that were computed using a genuine sample. These steps are repeated using different triggers and backdoored models respectively associated therewith. It is compared a number of times that an output of the backdoored models is not the same as the respective backdoor class, and/or a difference determined by applying the outlier detection method, against one or more thresholds so as to determine whether the sample is adversarial.

Public/Granted literature

US20220292185A1 SECURING MACHINE LEARNING MODELS AGAINST ADVERSARIAL SAMPLES THROUGH BACKDOOR MISCLASSIFICATION Public/Granted day:2022-09-15

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F21/00	防止未授权行为的保护计算机、其部件、程序或数据的安全装置
G06F21/50	.监控用户、程序或设备，以维护平台完整。例如：处理器、固件或操作系统
G06F21/55	..检测本地入侵或实施对策