MULTI-LINGUAL AUTOMATIC SPEECH RECOGNITION FOR CONVERSATIONAL AI SYSTEMS AND APPLICATIONS

    公开(公告)号:US20250022457A1

    公开(公告)日:2025-01-16

    申请号:US18349716

    申请日:2023-07-10

    Abstract: Disclosed are systems and techniques for training machine learning models. The techniques include generating, using a first automatic speech recognition (ASR) model, a first text output based on a vector representation of a first speech data and generating, using a second ASR model, a second text output, wherein the second ASR model adds noise to a vector representation of the first text output to obtain a noisy vector representation of the first text output and is trained to remove the noise from the noisy vector representation of the first text output. The techniques include calculating a first loss of the second ASR model based at least on a comparison between the second text output and the first text output and modifying learnable parameters of the second ASR model to improve an accuracy of the second ASR model.

Patent Agency Ranking