- 专利标题: Real-time dynamic noise reduction using convolutional networks
-
申请号: US17033605申请日: 2020-09-25
-
公开(公告)号: US12062369B2公开(公告)日: 2024-08-13
- 发明人: Adam Kupryjanow , Tomasz Noczynski , Lukasz Pindor , Sebastian Rosenkiewicz
- 申请人: Intel Corporation
- 申请人地址: US CA Santa Clara
- 专利权人: Intel Corporation
- 当前专利权人: Intel Corporation
- 当前专利权人地址: US CA Santa Clara
- 代理机构: Akona IP PC
- 主分类号: G10L15/20
- IPC分类号: G10L15/20 ; G10L15/16
摘要:
A system, method and computer readable medium for dynamic noise reduction in a voice call. The system includes an encoder having a short-time Fourier transform module to determine a magnitude spectrum and a phase spectrum of an input audio signal, including speech and dynamic noise. A separator coupled to the encoder comprises a temporal convolution network (TCN) used to develop a separation mask using the magnitude spectrum as input. The TCN is trained using a frequency SNR function used to calculate loss during training. A mixer is coupled to the separator to multiply the separation mask with the magnitude spectrum to separate the speech from the dynamic noise to obtain a denoise magnitude spectrum. A decoder coupled to the mixer and the encoder includes an inverse short-time Fourier transform module to reconstruct the input audio signal without the dynamic noise using the denoise magnitude spectrum and the phase spectrum.
公开/授权文献
信息查询