LEARNING TOKEN IMPORTANCE USING MULTI-MODEL STOCHASTIC SPARSITY INDUCING REGULARIZATION

    公开(公告)号:US20240330762A1

    公开(公告)日:2024-10-03

    申请号:US18293638

    申请日:2021-09-03

    Applicant: Google LLC

    CPC classification number: G06N20/00

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for improving the representation of items of a vocabulary in an embedding space for use in machine learning models. An embedding matrix is generated wherein each row in the embedding matrix is a vector of elements and corresponds to an item of a vocabulary. A score is assigned to each vector in the embedding matrix indicating a probability of its corresponding vector being used in the machine learning model. The scores are iteratively updated by sampling a proper subset of vectors and updating the elements of each respective vector in the proper subset of vectors based on the respective scores of vectors. The score of each vector are then updated based on a loss function of the machine learning model. The embedding matrix is then re-structured based on the updated scores of the vectors.

    Portion-Specific Model Compression for Optimization of Machine-Learned Models

    公开(公告)号:US20240232686A1

    公开(公告)日:2024-07-11

    申请号:US18012292

    申请日:2022-07-29

    Applicant: Google LLC

    CPC classification number: G06N20/00

    Abstract: Systems and methods of the present disclosure are directed to portion-specific compression and optimization of machine-learned models. For example, a method for portion-specific compression and optimization of machine-learned models includes obtaining data descriptive of one or more respective sets of compression schemes for one or more model portions of a plurality of model portions of a machine-learned model. The method includes evaluating a cost function to respectively select one or more candidate compression schemes from the one or more sets of compression schemes. The method includes respectively applying the one or more candidate compression schemes to the one or more model portions to obtain a compressed machine-learned model comprising one or more compressed model portions that correspond to the one or more model portions.

Patent Agency Ranking