LANGUAGE-MODEL SUPPORTED SPEECH EMOTION RECOGNITION

    公开(公告)号:US20250061917A1

    公开(公告)日:2025-02-20

    申请号:US18235372

    申请日:2023-08-18

    Applicant: Google LLC

    Abstract: The technology relates to enhancing speech emotion recognition models with methods that enable the use of unlabeled data by inferring weak emotion labels. This is done by pre-trained large language models through weakly-supervised learning. For inferring weak labels constrained to a taxonomy, a textual entailment approach selects an emotion label with the highest entailment score for a speech transcript extracted via automatic speech recognition. The system may employ a method that generates, by one or more processors, a text transcript for a snippet of input speech, and then applies the text transcript to a pre-trained language model. The system can generate, using the pre-trained language model according to an engineered prompt and a predetermined taxonomy, a textual entailment from the text transcript. Based on this, the system may generate, by the one or more processors using the textual entailment, a predicted emotion corresponding to the input speech.

Patent Agency Ranking