DETECTING ADVERSARIAL EXAMPLES
    1.
    发明申请

    公开(公告)号:US20200250304A1

    公开(公告)日:2020-08-06

    申请号:US16778213

    申请日:2020-01-31

    Abstract: Systems and methods for detecting adversarial examples are provided. The method includes generating encoder direct output by projecting, via an encoder, input data items to a low-dimensional embedding vector of reduced dimensionality with respect to the one or more input data items to form a low-dimensional embedding space. The method includes regularizing the low-dimensional embedding space via a training procedure such that the input data items produce embedding space vectors whose global distribution is expected to follow a simple prior distribution. The method also includes identifying whether each of the input data items is an adversarial or unnatural input. The method further includes classifying, during the training procedure, those input data items which have not been identified as adversarial or unnatural into one of multiple classes.

Patent Agency Ranking