EFFICIENT UNSUPERVISED ANOMALY DETECTION ON HOMOMORPHICALLY ENCRYPTED DATA

    公开(公告)号:US20210092137A1

    公开(公告)日:2021-03-25

    申请号:US16577038

    申请日:2019-09-20

    摘要: Aspects of the present disclosure describe techniques for detecting anomalous data in an encrypted data set. An example method generally includes receiving a data set of encrypted data points. A tree data structure having a number of levels is generated for the data set. Each level of the tree data structure generally corresponds to a feature of the encrypted plurality of features, and each node in the tree data structure at a given level represents a probability distribution of a likelihood that each data point is less than or greater than a split value determined for a given feature. An encrypted data point is received for analysis, and anomaly score is calculated based on a probability identified for each of the plurality of encrypted features. Based on determining that the calculated anomaly score exceeds a threshold value, the encrypted data point is identified as potentially anomalous.

    MULTI-PHASE PRIVACY-PRESERVING INFERENCING IN A HIGH VOLUME DATA ENVIRONMENT

    公开(公告)号:US20220374904A1

    公开(公告)日:2022-11-24

    申请号:US17315409

    申请日:2021-05-10

    IPC分类号: G06Q20/40 G06N20/00 G06N5/04

    摘要: A method, apparatus and computer program product that provides multi-phase privacy-preserving inferencing in a high throughput data environment, e.g., to facilitate fraud prediction, detection and prevention. In one embodiment, two (2) machine learning models are used, a first model that is trained in the clear on first transaction data, and a second model that is trained in the clear but on the first transaction data, and user data. The first model is used to perform inferencing in the clear on the high throughput received data. In this manner, the first model provides a first level evaluation of whether a particular transaction might be fraudulent. If a transaction is flagged in this first phase, a second more secure inference is then carried out using the second model. The inferencing performed by the second model is done on homomorphically encrypted data. Thus, only those transactions marked by the first model are passed to the second model for secure evaluation.

    PRIVACY-ENHANCED DECISION TREE-BASED INFERENCE ON HOMOMORPHICALLY-ENCRYPTED DATA

    公开(公告)号:US20210376995A1

    公开(公告)日:2021-12-02

    申请号:US16884567

    申请日:2020-05-27

    摘要: A technique for computationally-efficient privacy-preserving homomorphic inferencing against a decision tree. Inferencing is carried out by a server against encrypted data points provided by a client. Fully homomorphic computation is enabled with respect to the decision tree by intelligently configuring the tree and the real number-valued features that are applied to the tree. To that end, and to the extent the decision tree is unbalanced, the server first balances the tree. A cryptographic packing scheme is then applied to the balanced decision tree and, in particular, to one or more entries in at least one of: an encrypted feature set, and a threshold data set, that are to be used during the decision tree evaluation process. Upon receipt of an encrypted data point, homomorphic inferencing on the configured decision tree is performed using a highly-accurate approximation comparator, which implements a “soft” membership recursive computation on real numbers, all in an oblivious manner.