System and method for performing speech enhancement using a deep neural network-based signal

Invention Grant

US10074380B2 System and method for performing speech enhancement using a deep neural network-based signal 有权

Please log in to see more content

Patent Title: System and method for performing speech enhancement using a deep neural network-based signal
Application No.: US15227885

Application Date: 2016-08-03
Publication No.: US10074380B2

Publication Date: 2018-09-11
Inventor: Jason Wung , Ramin Pishehvar , Daniele Giacobello , Joshua D. Atkins
Applicant: Apple Inc.
Applicant Address: US CA Cupertino
Assignee: Apple Inc.
Current Assignee: Apple Inc.
Current Assignee Address: US CA Cupertino
Agency: Womble Bond Dickinson (US) LLP
Main IPC: G10L21/02
IPC: G10L21/02 ; G10L21/0232 ; G10L25/30 ; G10L25/87 ; G10L21/0208

System and method for performing speech enhancement using a deep neural network-based signal

Abstract:

Method for performing speech enhancement using a Deep Neural Network (DNN)-based signal starts with training DNN offline by exciting a microphone using target training signal that includes signal approximation of clean speech. Loudspeaker is driven with a reference signal and outputs loudspeaker signal. Microphone then generates microphone signal based on at least one of: near-end speaker signal, ambient noise signal, or loudspeaker signal. Acoustic-echo-canceller (AEC) generates AEC echo-cancelled signal based on reference signal and microphone signal. Loudspeaker signal estimator generates estimated loudspeaker signal based on microphone signal and AEC echo-cancelled signal. DNN receives microphone signal, reference signal, AEC echo-cancelled signal, and estimated loudspeaker signal and generates a speech reference signal that includes signal statistics for residual echo or for noise. Noise suppressor generates a clean speech signal by suppressing noise or residual echo in the microphone signal based on speech reference signal. Other embodiments are described.

Public/Granted literature

US20180040333A1 SYSTEM AND METHOD FOR PERFORMING SPEECH ENHANCEMENT USING A DEEP NEURAL NETWORK-BASED SIGNAL Public/Granted day:2018-02-08

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L21/00	为了改变语音或声音信号的质量或其可识度而处理语音或声音信号，以产生另一种可听的或非可听的信号，例如视觉信号或触觉信号（G10L19/00优先）
G10L21/02	.语音增强，例如降低噪声或消除回声（在直线传送系统中减轻回声效应入H04B3/20；免提电话中的回声抑制入H04M9/08）