SMART TEXT PARTITIONING FOR DETECTING SENSITIVE INFORMATION

    公开(公告)号:US20230222288A1

    公开(公告)日:2023-07-13

    申请号:US17571761

    申请日:2022-01-10

    IPC分类号: G06F40/205 G06F40/284

    CPC分类号: G06F40/205 G06F40/284

    摘要: Systems for partitioning text are disclosed. The system can receive a text string. A delimiter can be identified based on the text string. Based on identifying the delimiter, a character sequence to the left and/or right of the delimiter can be identified. The identification can occur up to a predetermined number/length of characters. Using a trained model, the system can determine whether the character sequence indicates the delimiter is part of a continuous string of text. Based on determining whether or not the delimiter is part of the continuous string of text, the system can generate a token representing the continuous string of text or the delimiter.

    SECURE AND PRIVACY AWARE MONITORING WITH DYNAMIC RESILIENCY FOR DISTRIBUTED SYSTEMS

    公开(公告)号:US20230006908A1

    公开(公告)日:2023-01-05

    申请号:US17364344

    申请日:2021-06-30

    IPC分类号: H04L12/26 H04L29/06

    摘要: Provided herein are systems and methods for sanitizing logged data packets in a distributed system prior to storing them in a remote or third-party data server. Interactions with an application are monitored and values in a data packet are extracted from the interaction. The values are classified based on a classification configuration and respective labels of the values. The values are then sanitized based on the classification to prevent exposure of secure or private data. The sanitized data packets are then logged into the remote data server. The logged data can be used to help resolve events occurring in the application. The classification configuration can be iteratively updated and the interactions repeated to capture data that was previously sanitized to aid in resolution of events. The logged data can also be used in research or analysis, such as for identifying potential improvements to the application.

    SECURE AND PRIVACY AWARE MONITORING WITH DYNAMIC RESILIENCY FOR DISTRIBUTED SYSTEMS

    公开(公告)号:US20230275826A1

    公开(公告)日:2023-08-31

    申请号:US18143258

    申请日:2023-05-04

    IPC分类号: H04L43/10 H04L43/50 H04L9/40

    摘要: Provided herein are systems and methods for sanitizing logged data packets in a distributed system prior to storing them in a remote or third-party data server. Interactions with an application are monitored and values in a data packet are extracted from the interaction. The values are classified based on a classification configuration and respective labels of the values. The values are then sanitized based on the classification to prevent exposure of secure or private data. The sanitized data packets are then logged into the remote data server. The logged data can be used to help resolve events occurring in the application. The classification configuration can be iteratively updated and the interactions repeated to capture data that was previously sanitized to aid in resolution of events. The logged data can also be used in research or analysis, such as for identifying potential improvements to the application.

    Multi-Modal Regression to Predict Customer Intent to Contact a Merchant

    公开(公告)号:US20220414684A1

    公开(公告)日:2022-12-29

    申请号:US17356084

    申请日:2021-06-23

    摘要: Provided herein are systems and methods for using multi-modal regression to predict customer intent to contact a merchant. Multi-modal data including numerical data and unstructured data are extracted from customer interactions with the merchant. Features of the numerical data and the unstructured data are separately extracted and classified using techniques specific to the data types. The features for each type are then separately used to predict probabilities of customer intent. A neural network is used to combine the predictions into a single set of estimates of customer intent. This set of estimates of customer intents is used to estimate a probability that the customer will contact the merchant. The customer is then contacted based on the estimate.

    SENSITIVE DATA DETECTION USING DOMAIN-ENHANCED ATTENTION NEURAL NETWORKS

    公开(公告)号:US20240232695A1

    公开(公告)日:2024-07-11

    申请号:US18094644

    申请日:2023-01-09

    IPC分类号: G06N20/00

    CPC分类号: G06N20/00

    摘要: Provided herein are systems and methods for detecting and sanitizing sensitive data using domain-enhanced attention neural networks. In some embodiments, a processor of a client retrieves training data comprising tuples. Each tuple comprises a first parameter and a second parameter. For each tuple the processor matches a substring in a first parameter of the respective tuple to a keyword of a plurality of keywords, identifies a security category corresponding to the at least one keyword, and expands the first parameter of the respective tuple to comprise a respective string associated with the application event and the security category. The processor trains a model to detect and sanitize the sensitive data from the application events using the tuples, including an expanded first parameter for each tuple. The processor sanitizes the sensitive data using the trained model.