SPEECH EMOTION RECOGNITION METHOD AND SYSTEM BASED ON FUSED POPULATION INFORMATION

    公开(公告)号:US20220328065A1

    公开(公告)日:2022-10-13

    申请号:US17845908

    申请日:2022-06-21

    Applicant: Zhejiang Lab

    Abstract: The present invention discloses a speech emotion recognition method and system based on fused population information. The method includes the following steps: S1: acquiring a user's audio data; S2: preprocessing the audio data, and obtaining a Mel spectrogram feature; S3: cutting off a front mute segment and a rear mute segment of the Mel spectrogram feature; S4: obtaining population depth feature information through a population classification network; S5: obtaining Mel spectrogram depth feature information through a Mel spectrogram preprocessing network; S6: fusing the population depth feature information and the Mel spectrogram depth feature information through SENet to obtain fused information; and S7: obtaining an emotion recognition result from the fused information through a classification network.

Patent Agency Ranking