Real-time dynamic noise reduction using convolutional networks

发明授权

US12062369B2 Real-time dynamic noise reduction using convolutional networks 有权

请登陆查看更多内容

专利标题： Real-time dynamic noise reduction using convolutional networks
申请号： US17033605

申请日： 2020-09-25
公开(公告)号： US12062369B2

公开(公告)日： 2024-08-13
发明人: Adam Kupryjanow , Tomasz Noczynski , Lukasz Pindor , Sebastian Rosenkiewicz
申请人： Intel Corporation
申请人地址： US CA Santa Clara
专利权人： Intel Corporation
当前专利权人： Intel Corporation
当前专利权人地址： US CA Santa Clara
代理机构： Akona IP PC
主分类号： G10L15/20
IPC分类号： G10L15/20 ; G10L15/16

Real-time dynamic noise reduction using convolutional networks

摘要：

A system, method and computer readable medium for dynamic noise reduction in a voice call. The system includes an encoder having a short-time Fourier transform module to determine a magnitude spectrum and a phase spectrum of an input audio signal, including speech and dynamic noise. A separator coupled to the encoder comprises a temporal convolution network (TCN) used to develop a separation mask using the magnitude spectrum as input. The TCN is trained using a frequency SNR function used to calculate loss during training. A mixer is coupled to the separator to multiply the separation mask with the magnitude spectrum to separate the speech from the dynamic noise to obtain a denoise magnitude spectrum. A decoder coupled to the mixer and the encoder includes an inverse short-time Fourier transform module to reconstruct the input audio signal without the dynamic noise using the denoise magnitude spectrum and the phase spectrum.

公开/授权文献

US20210012767A1 REAL-TIME DYNAMIC NOISE REDUCTION USING CONVOLUTIONAL NETWORKS 公开/授权日：2021-01-14

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/20	.专门适用于不利环境（例如，噪音环境）中保持鲁棒性或增强语音强度的语音识别技术（G10L21/02优先）