Modular Training for Flexible Attention Based End-to-End ASR

    公开(公告)号:US20240185839A1

    公开(公告)日:2024-06-06

    申请号:US18526148

    申请日:2023-12-01

    Applicant: Google LLC

    CPC classification number: G10L15/063 G10L2015/0635

    Abstract: A method for training a modular neural network model includes training only a backbone model to provide a first model configuration of the modular neural network model. The first model configuration includes only the trained backbone model. The method also includes adding an intrinsic sub-model to the trained backbone model. During a fine-tuning training stage, the method includes freezing parameters of the trained backbone model and fine-tuning parameters of the intrinsic sub-model added to the trained backbone model while the parameters of the trained backbone model are frozen to provide a second model configuration that includes the backbone model initially trained during the initial training stage and the intrinsic sub-model having the parameters fine-tuned during the fine-tuning stage.

Patent Agency Ranking