MULTILINGUAL AND CODE-SWITCHING ASR USING LARGE LANGUAGE MODEL GENERATED TEXT

    公开(公告)号:US20250095637A1

    公开(公告)日:2025-03-20

    申请号:US18886581

    申请日:2024-09-16

    Applicant: Google LLC

    Abstract: A method includes receiving a textual prompt in a first language and obtaining a fine-tuned prompt embedding configured to guide a large language model (LLM) to generate text in a target language from textual prompts in the first language. The method also includes processing, using the LLM, the textual prompt conditioned on the fine-tuned prompt embedding to generate output text in the target language and concatenating the textual prompt and the generated output text to provide an unspoken textual utterance. The method also includes training a multilingual automatic speech recognition (ASR) model to learn how to recognize speech in the target language by injecting the unspoken textual utterance into a text encoder associated with the multilingual ASR model.

Patent Agency Ranking