FLEXIBLE MACHINE LEARNING MODEL COMPRESSION

    公开(公告)号:US20250148357A1

    公开(公告)日:2025-05-08

    申请号:US18504016

    申请日:2023-11-07

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for compresses a machine learning model having a plurality of parameters. In one aspect, one of the methods includes obtaining trained values of a set of parameters for at least a portion of a machine learning model; identifying one or more dense ranges for the trained values; determining a least number of bits required to represent each trained value within the one or more dense ranges; identifying a second format having a range that is smaller than a range of the first format; and generating a compressed version of the at least a portion of the machine learning model.

Patent Agency Ranking