SEMI-SUPERVISED MACHINE LEARNING MODEL FRAMEWORK FOR UNLABELED LEARNING

    公开(公告)号:US20240346322A1

    公开(公告)日:2024-10-17

    申请号:US18328514

    申请日:2023-06-02

    申请人: Paypal, Inc.

    IPC分类号: G06N3/0895

    CPC分类号: G06N3/0895

    摘要: Methods and systems are presented for providing a semi-supervised machine learning framework for training a machine learning model using partly mislabeled training data sets. Using the semi-supervised machine learning framework, an iterative training process is performed on the machine learning model, wherein the training data is being adjusted continuously in each iteration for training the machine learning model. During each iteration, the machine learning model is evaluated based on its ability to identify training data that has been mislabeled. The labeling of identified mislabeled training data is corrected before feeding back to the machine learning model in the next training iteration.

    FINDING SEMANTICALLY RELATED SECURITY INFORMATION

    公开(公告)号:US20240330446A1

    公开(公告)日:2024-10-03

    申请号:US18335064

    申请日:2023-06-14

    IPC分类号: G06F21/55 G06N3/0895

    摘要: Methods and apparatuses for improving the performance and energy efficiency of machine learning systems that generate security specific machine learning models and generate security related information using security specific machine learning models are described. A security specific machine learning model may comprise a security specific large language model (LLM). The security specific LLM may be trained and deployed to generate semantically related security information. The security specific LLM may be pretrained with a security specific data set that was generated using similarity deduplication and long line handling, and with security specific objectives, such as next log line prediction based on host, system, application, and cyber attacker behavior. The security specific large language model may be fine-tuned using a security specific similarity dataset that may be generated to align the security specific LLM to capture similarity between different security events.

    AUTOMATIC AVATAR GENERATION USING SEMI-SUPERVISED MACHINE LEARNING

    公开(公告)号:US20240290022A1

    公开(公告)日:2024-08-29

    申请号:US18176267

    申请日:2023-02-28

    申请人: ADOBE INC.

    摘要: Avatar generation from an image is performed using semi-supervised machine learning. An image space model undergoes unsupervised training from images to generate latent image vectors responsive to image inputs. An avatar parameter space model undergoes unsupervised training from avatar parameter values for avatar parameters to generate latent avatar parameter vectors responsive to avatar parameter value inputs. A cross-modal mapping model undergoes supervised training on image-avatar parameter pair inputs corresponding to the latent image vectors and the latent avatar parameter vectors. The trained image space model generates a latent image vector from an image input. The trained cross-modal mapping model translates the latent image vector to a latent avatar parameter vector. The trained avatar parameter space model generates avatar parameter values from the latent avatar parameter vector. The latent avatar parameter vector can be used to render an avatar having features corresponding to the input image.

    SEMI-SUPERVISED FRAMEWORK FOR EFFICIENT TIME-SERIES ORDINAL CLASSIFICATION

    公开(公告)号:US20240232638A1

    公开(公告)日:2024-07-11

    申请号:US18545042

    申请日:2023-12-19

    IPC分类号: G06N3/0895 G06N3/0442

    CPC分类号: G06N3/0895 G06N3/0442

    摘要: A computer-implemented method for ordinal prediction is provided. The method includes encoding time series data with a temporal encoder to obtain latent space representations. The method includes optimizing the temporal encoder using semi-supervised learning to distinguish different classes in the labeled space using labeled data, and augment the latent space representations using unlabeled training data, to obtain semi-supervised representations. The method further includes discarding a linear layer after the temporal encoder and fixing the temporal encoder. The method also includes training k-1 binary classifiers on top of the semi-supervised representations to obtain k-1 binary predictions. The method additionally includes identifying and correcting inconsistent ones of the k-1 binary predictions by matching the inconsistent ones to consistent ones of the k-1 binary predictions. The method further includes aggregating the k-1 binary predictions to obtain an ordinal prediction.

    SYSTEMS AND METHODS FOR INSURANCE PROCESSES
    7.
    发明公开

    公开(公告)号:US20240212055A1

    公开(公告)日:2024-06-27

    申请号:US18239737

    申请日:2023-08-29

    申请人: SAFB Inc.

    摘要: Disclosed herein are methods and systems related to verifying insurance. An example method may comprise receiving user information associated with a user. The example method may comprise receiving a vehicle identification number (VIN) associated with a vehicle. The example method may comprise transmitting the received VIN to a VIN database. The example method may comprise receiving information about the vehicle from the VIN database. The example method may comprise creating a customized page for the user accessible by a link. The example method may comprise transmitting the link to the user. The example method may comprise receiving a selected insurance carrier associated with the user via the page accessed by the link. The example method may comprise transmitting the user information to a database associated with the selected insurance carrier. The example method may comprise receiving verification of insurance for the user from the database associated with the insurance carrier.

    METHOD AND APPARATUS WITH MACHINE LEARNING MODEL

    公开(公告)号:US20240144021A1

    公开(公告)日:2024-05-02

    申请号:US18341892

    申请日:2023-06-27

    IPC分类号: G06N3/0895

    CPC分类号: G06N3/0895

    摘要: An apparatus includes: one or more processors configured to: randomly split a training data set into a first training data set comprising a first label assigned to first data and a second training data set comprising a second label assigned to second data; train a first neural network using a semi-supervised learning scheme based on the first training data set comprising the first label, and an unlabeled second training data set; and train a second neural network using the semi-supervised learning scheme based on the second training data set comprising the second label, and an unlabeled first training data set.