Systems and methods for expanding data classification using synthetic data generation in machine learning models

    公开(公告)号:US11562252B2

    公开(公告)日:2023-01-24

    申请号:US16946426

    申请日:2020-06-22

    IPC分类号: G06N3/08 G06N3/04

    摘要: Systems and methods for classifying data are disclosed. For example, a system may include at least one memory storing instructions and at least one processor configured to execute the instructions to perform operations. The operations may include receiving training data comprising a class. The operations may include training a data classification model using the training data to generate a trained data classification model. The operations may include receiving additional data comprising labeled samples of an additional class not contained in the training data. The operations may include creating a synthetic data generator. The operations may include training the synthetic data generator to generate synthetic data corresponding to the additional class. The operations may include generating a synthetic classified dataset comprising the additional class. The operations may include retraining the trained data classification model using the synthetic classified dataset.

    Representing confidence in natural language processing

    公开(公告)号:US11501074B2

    公开(公告)日:2022-11-15

    申请号:US17004728

    申请日:2020-08-27

    IPC分类号: G06F40/295

    摘要: Methods, systems, and computing devices for visualizing natural language processing algorithm processes are described herein. A plurality of categories may be determined. Each color of a plurality of colors may correspond to the categories. Text content may be processed using a natural language processing algorithm. Confidence values indicating, for each of a plurality of portions of the text content, a degree of confidence corresponding to one or more of the plurality of categories may be determined. Display colors may be determined based on the confidence values. A user interface comprising a visualization of the text content may be displayed, and the user interface may be configured to show each portion of the text content using a display color such that the user interface indicates changes in confidence across the plurality of characters.

    SYSTEMS AND METHODS FOR DYNAMICALLY CONCEALING SENSITIVE INFORMATION

    公开(公告)号:US20220353467A1

    公开(公告)日:2022-11-03

    申请号:US17868948

    申请日:2022-07-20

    IPC分类号: H04N7/15 H04L9/40

    摘要: Systems and methods for dynamically concealing sensitive information in a shared screen session of a video conference are disclosed. The system may establish communication with one or more computing devices active in a video conference in which each computing device may switch between a screen share mode and a video mode. The system may determine that one or more articles of sensitive information are visible in a graphical user interface associated with a first computing device of the plurality of computing devices. The system may receive a first signal from the first computing device that indicates a first intent of a host associated with the first computing device to switch the screen share mode which includes sharing the first graphical user interface with the one or more computing devices during the video conference. In response to the first signal, the system may execute one or more privacy actions.

    Systems and methods for identifying ordered sequence data

    公开(公告)号:US11461403B2

    公开(公告)日:2022-10-04

    申请号:US16920870

    申请日:2020-07-06

    摘要: A system includes one or more processors configured to execute the instructions to perform a method for determining the ordered sequence. In the method, a dataset is retrieved from a database. The dataset comprises a data matrix comprising a plurality of elements or cells arranged in a set of rows and columns. The dataset is partitioned into a plurality of frames comprising a first subset of the set of rows and columns, the plurality of frames being in a sequential order. A machine learning algorithm to the dataset to predict contents of a next frame in the sequential order. Comparing the predicted contents of the next frame with actual contents of the next frame to determine a prediction accuracy value, and if the prediction accuracy value of the predicted contents exceeds a first threshold level, storing the predicted contents of the next frame.

    Communication Analysis for Financial Transaction Tracking

    公开(公告)号:US20220253951A1

    公开(公告)日:2022-08-11

    申请号:US17173909

    申请日:2021-02-11

    摘要: Methods, systems, and apparatuses for correlating electronic communications related to financial transactions. A computing device may receive a first communication related to an update to a past financial transaction. The computing device may identify a second communication by querying, based on the first communication, a communications database. The first communication and second communication may be correlated using one or more natural language processing algorithms. Based on correlating the first communication and second communication, the computing device may identify a portion of the second communication corresponding to the at least one good or service of the past financial transaction by processing, using the one or more natural language processing algorithms, the second communication. The computing device may then cause output of data indicating a correlation between the first communication and the second communication, and the indication of the change to the at least one good or service.