Robustness Aware Norm Decay for Quantization Aware Training and Generalization

    公开(公告)号:US20240347043A1

    公开(公告)日:2024-10-17

    申请号:US18632237

    申请日:2024-04-10

    Applicant: Google LLC

    CPC classification number: G10L15/063

    Abstract: A method includes obtaining a plurality of training samples, determining a minimum integer fixed-bit width representing a maximum quantization of an automatic speech recognition (ASR) model, and training the ASR model on the plurality of training samples using a quantity of random noise. The ASR model includes a plurality of weights that each include a respective float value. The quantity of random noise is based on the minimum integer fixed-bit value. After training the ASR model, the method also includes selecting a target integer fixed-bit width greater than or equal to the minimum integer fixed-bit width, and for each respective weight of the plurality of weights, quantizing the respective weight from the respective float value to a respective integer associated with a value of the selected target integer fixed-bit width. The operations also include providing the quantized trained ASR model to a user device.

Patent Agency Ranking