-
1.
公开(公告)号:US10540984B1
公开(公告)日:2020-01-21
申请号:US15273568
申请日:2016-09-22
Applicant: Apple Inc.
Inventor: Sarmad Aziz Malik , Arvindh Krishnaswamy
IPC: G10L21/0232 , G10L21/0388 , G10L21/0208
Abstract: Method for echo control using adaptive polynomial filters in sub-band domain starts with loudspeaker that is configured to be driven by a reference signal outputting a loudspeaker signal. Microphone receives at least one of: a near-end speaker signal, ambient noise signal, or the loudspeaker signal and generates a microphone signal. Adaptive polynomial filters in sub-band domain included in adaptive echo canceller (AEC) are configured to adaptively filter representation of the reference signal in a plurality of channels in a sub-band domain based on a clean signal to generate the echo estimate. Echo suppressor is configured to remove an echo estimate from the microphone signal to generate the clean signal. Other embodiments are described.
-
公开(公告)号:US20200327887A1
公开(公告)日:2020-10-15
申请号:US16380504
申请日:2019-04-10
Applicant: Apple Inc.
Inventor: Sarmad Aziz Malik , Charles P. Clark , Devang K. Naik , Srikanth Vishnubhotla
Abstract: Audio signals produced by microphones can be processed to remove echo and reverberation. The processed signals can be mapped to each other with adaptively estimated impulse responses. One or more of the processed signals, one or more of the mapped signals, and one or more of the impulse responses can be fed to an automatic speech recognizer (ASR) having a deep neural network (DNN), to train the DNN or recognize speech in the input audio signals. Other aspects are described and claimed.
-
公开(公告)号:US10978086B2
公开(公告)日:2021-04-13
申请号:US16517400
申请日:2019-07-19
Applicant: Apple Inc.
Inventor: Jason Wung , Sarmad Aziz Malik , Ashrith Deshpande , Ante Jukic , Joshua D. Atkins
IPC: G10L21/0208 , G10K11/178 , G10L21/0216
Abstract: An echo canceller is disclosed in which audio signals of the playback content received by one or more of the microphones from a loudspeaker of the device may be used as the playback reference signals to estimate the echo signals of the playback content received by a target microphone for echo cancellation. The echo canceller may estimate the transfer function between a reference microphone and the target microphone based on the playback reference signal of the reference microphone and the signal of the target microphone. To mitigate near-end speech cancellation at the target microphone, the echo canceller may compute a mask to distinguish between target microphone audio signals that are echo-signal dominant and near-end speech dominant. The echo canceller may use the mask to adaptively update the transfer function or to modify the playback reference signal used by the transfer function to estimate the echo signals of the playback content.
-
公开(公告)号:US20210020188A1
公开(公告)日:2021-01-21
申请号:US16517400
申请日:2019-07-19
Applicant: Apple Inc.
Inventor: Jason Wung , Sarmad Aziz Malik , Ashrith Deshpande , Ante Jukic , Joshua D. Atkins
IPC: G10L21/0208 , G10K11/178
Abstract: An echo canceller is disclosed in which audio signals of the playback content received by one or more of the microphones from a loudspeaker of the device may be used as the playback reference signals to estimate the echo signals of the playback content received by a target microphone for echo cancellation. The echo canceller may estimate the transfer function between a reference microphone and the target microphone based on the playback reference signal of the reference microphone and the signal of the target microphone. To mitigate near-end speech cancellation at the target microphone, the echo canceller may compute a mask to distinguish between target microphone audio signals that are echo-signal dominant and near-end speech dominant. The echo canceller may use the mask to adaptively update the transfer function or to modify the playback reference signal used by the transfer function to estimate the echo signals of the playback content.
-
公开(公告)号:US11290834B2
公开(公告)日:2022-03-29
申请号:US16880249
申请日:2020-05-21
Applicant: Apple Inc.
Inventor: Sarmad Aziz Malik , Sreeneel Maddika , Devang K. Naik
Abstract: Systems and processes for operating an intelligent automated assistant are provided. An examples process of operating an intelligent automated assistant includes, at an electronic device with one or more processors and memory, receiving audio input, determining a direct-to-reverberant energy ratio based on the audio input, and determining a head pose of a user based on the direct-to-reverberant energy ratio.
-
6.
公开(公告)号:US10090001B2
公开(公告)日:2018-10-02
申请号:US15225595
申请日:2016-08-01
Applicant: Apple Inc.
Inventor: Lalin S. Theverapperuma , Vasu Iyengar , Sarmad Aziz Malik , Raghavendra Prabhu
IPC: G10L25/30 , G10L21/0232 , G10L25/72 , G10L25/84 , G10L21/028
Abstract: Method of speech enhancement using Neural Network-based combined signal starts with training neural network offline which includes: (i) exciting at least one accelerometer and at least one microphone using training accelerometer signal and training acoustic signal, respectively. The training accelerometer signal and the training acoustic signal are correlated during clean speech segments. Training neural network offline further includes(ii) selecting speech included in the training accelerometer signal and in the training acoustic signal, and (iii) spatially localizing the speech by setting a weight parameter in the neural network based on the selected speech included in the training accelerometer signal and in the training acoustic signal. The neural network that is trained offline is then used to generate a speech reference signal based on an accelerometer signal from the at least one accelerometer and an acoustic signal received from the at least one microphone. Other embodiments are described.
-
7.
公开(公告)号:US20180033449A1
公开(公告)日:2018-02-01
申请号:US15225595
申请日:2016-08-01
Applicant: Apple Inc.
Inventor: Lalin S. Theverapperuma , Vasu Iyengar , Sarmad Aziz Malik , Raghavendra Prabhu
IPC: G10L25/30 , G10L25/72 , G10L21/0232 , G10L25/84 , G10L21/028
CPC classification number: G10L25/30 , G10L21/0232 , G10L21/028 , G10L25/72 , G10L25/84
Abstract: Method of speech enhancement using Neural Network-based combined signal starts with training neural network offline which includes: (i) exciting at least one accelerometer and at least one microphone using training accelerometer signal and training acoustic signal, respectively. The training accelerometer signal and the training acoustic signal are correlated during clean speech segments. Training neural network offline further includes (ii) selecting speech included in the training accelerometer signal and in the training acoustic signal, and (iii) spatially localizing the speech by setting a weight parameter in the neural network based on the selected speech included in the training accelerometer signal and in the training acoustic signal. The neural network that is trained offline is then used to generate a speech reference signal based on an accelerometer signal from the at least one accelerometer and an acoustic signal received from the at least one microphone. Other embodiments are described.
-
公开(公告)号:US09858944B1
公开(公告)日:2018-01-02
申请号:US15206110
申请日:2016-07-08
Applicant: Apple Inc.
Inventor: Sarmad Aziz Malik , Arvindh Krishnaswamy
IPC: G10L21/0364 , G10L21/0264 , G10L25/21 , G10L21/0208 , G10L21/0216
CPC classification number: G10L25/21 , G10L21/0208 , G10L2021/02082 , G10L2021/02166
Abstract: Apparatus for linear and nonlinear acoustic echo control includes loudspeaker, first, second, and third microphone, beamformer, and first echo canceller. The loudspeaker outputs a loudspeaker signal that includes reference signal. The first microphone and the second microphone are collocated with the loudspeaker, receive at least one of: a near-end speaker signal from a near-end speaker and the loudspeaker signal, and generate first and second microphone uplink signals, respectively. The third microphone receives the near-end speaker signal and generates a third microphone uplink signal. The beamformer receives the first and second microphone uplink signals, directs a beam towards the loudspeaker and drives a null towards the near-end speaker, and generates a beamformer output. The first echo canceler receives the third microphone uplink signal and the beamformer output, and cancels echoes in the third microphone uplink signal based on the beamformer output to generate an echo cancelled signal. Other embodiments are described.
-
-
-
-
-
-
-