发明授权
- 专利标题: Speech emotion recognition method and system based on fused population information
-
申请号: US17845908申请日: 2022-06-21
-
公开(公告)号: US11837252B2公开(公告)日: 2023-12-05
- 发明人: Taihao Li , Shukai Zheng , Yulong Liu , Guanxiong Pei , Shijie Ma
- 申请人: Zhejiang Lab
- 申请人地址: CN Zhejiang
- 专利权人: Zhejiang Lab
- 当前专利权人: Zhejiang Lab
- 当前专利权人地址: CN Zhejiang
- 代理机构: JCIPRNET
- 优先权: CN 2110322720.X 2021.03.26
- 主分类号: G10L25/63
- IPC分类号: G10L25/63 ; G10L25/18 ; G10L25/21 ; G10L25/30
摘要:
The present invention discloses a speech emotion recognition method and system based on fused population information. The method includes the following steps: S1: acquiring a user's audio data; S2: preprocessing the audio data, and obtaining a Mel spectrogram feature; S3: cutting off a front mute segment and a rear mute segment of the Mel spectrogram feature; S4: obtaining population depth feature information through a population classification network; S5: obtaining Mel spectrogram depth feature information through a Mel spectrogram preprocessing network; S6: fusing the population depth feature information and the Mel spectrogram depth feature information through SENet to obtain fused information; and S7: obtaining an emotion recognition result from the fused information through a classification network.
公开/授权文献
信息查询