-
公开(公告)号:US20220328065A1
公开(公告)日:2022-10-13
申请号:US17845908
申请日:2022-06-21
Applicant: Zhejiang Lab
Inventor: Taihao LI , Shukai ZHENG , Yulong LIU , Guanxiong PEI , Shijie MA
Abstract: The present invention discloses a speech emotion recognition method and system based on fused population information. The method includes the following steps: S1: acquiring a user's audio data; S2: preprocessing the audio data, and obtaining a Mel spectrogram feature; S3: cutting off a front mute segment and a rear mute segment of the Mel spectrogram feature; S4: obtaining population depth feature information through a population classification network; S5: obtaining Mel spectrogram depth feature information through a Mel spectrogram preprocessing network; S6: fusing the population depth feature information and the Mel spectrogram depth feature information through SENet to obtain fused information; and S7: obtaining an emotion recognition result from the fused information through a classification network.