-
公开(公告)号:US20190066710A1
公开(公告)日:2019-02-28
申请号:US15688455
申请日:2017-08-28
Applicant: Apple Inc.
Inventor: Nicholas J. Bryan , Vasu Iyengar , Aram M. Lindahl
IPC: G10L21/02 , G10L15/22 , G10L21/0272 , G10L21/0232 , G10L15/30 , G10L25/51 , G10L21/0316
Abstract: A method for controlling a speech enhancement process in a far-end device, while engaged in a voice or video telephony communication session over a communication link with a near-end device. A near-end user speech signal is produced, using a microphone to pick up speech of a near-end user, and is analyzed by an automatic speech recognizer (ASR) without being triggered by an ASR trigger phrase or button. The recognized words are compared to a library of phrases to select a matching phrase, where each phrase is associated with a message that represents an audio signal processing operation. The message associated with the matching phrase is sent to the far-end device, which is used to configure the far-end device to adjust the speech enhancement process that produces the far-end speech signal. Other embodiments are also described.
-
公开(公告)号:US10269369B2
公开(公告)日:2019-04-23
申请号:US15610500
申请日:2017-05-31
Applicant: Apple Inc.
Inventor: Nicholas J. Bryan , Vasu Iyengar
IPC: H04R3/00 , G10L25/84 , G10L21/028 , G10L21/0208 , G10L21/0216 , G10L21/0232 , G10L21/0272
Abstract: System of noise reduction for mobile devices includes blind source separator (BSS) and noise suppressor. BSS receives signals from at least two audio pickup channels. BSS includes sound source separator, voice source detector, equalizer, and auto-disabler. Sound source separator generates signals representing first sound source and second sound source based on signals from the first and the second channels. Voice source detector determines whether the signals representing the first and second sound sources are voice signal or noise signal, respectively. Equalizer scales noise signal to match a level of the voice signal, and generates scaled noise signal. Auto-disabler determines whether to disable BSS. Auto-disabler outputs signals from the at least two audio pickup channels when the BSS is disabled and outputs the voice signal and the scaled noise signal when the BSS is not disabled. Noise suppressor generates clean signal based on outputs from auto-disabler. Other embodiments are also described.
-
公开(公告)号:US20180350381A1
公开(公告)日:2018-12-06
申请号:US15610500
申请日:2017-05-31
Applicant: Apple Inc.
Inventor: Nicholas J. Bryan , Vasu Iyengar
IPC: G10L21/0232 , G10L25/84 , G10L21/028 , H04R3/00 , G10L21/0272
CPC classification number: G10L21/0232 , G10L21/0272 , G10L21/028 , G10L25/84 , G10L2021/02082 , G10L2021/02166 , H04R3/005
Abstract: System of noise reduction for mobile devices includes blind source separator (BSS) and noise suppressor. BSS receives signals from at least two audio pickup channels. BSS includes sound source separator, voice source detector, equalizer, and auto-disabler. Sound source separator generates signals representing first sound source and second sound source based on signals from the first and the second channels. Voice source detector determines whether the signals representing the first and second sound sources are voice signal or noise signal, respectively. Equalizer scales noise signal to match a level of the voice signal, and generates scaled noise signal. Auto-disabler determines whether to disable BSS. Auto-disabler outputs signals from the at least two audio pickup channels when the BSS is disabled and outputs the voice signal and the scaled noise signal when the BSS is not disabled. Noise suppressor generates clean signal based on outputs from auto-disabler. Other embodiments are also described.
-
公开(公告)号:US11114109B2
公开(公告)日:2021-09-07
申请号:US16785480
申请日:2020-02-07
Applicant: Apple Inc.
Inventor: Nicholas J. Bryan , Qing Yang , Vasu Iyengar
IPC: G10L21/0216 , H04R3/00 , G10L21/0208
Abstract: A device implementing a system for mitigating noise includes at least one processor configured to receive a first audio signal corresponding to a first microphone, and determine whether wind noise is present based at least in part on the first audio signal. The processor is configured to select, based on the determining, a second audio signal from between second and third microphones. The second microphone is disposed at a location that experiences less echo coupling when the device is in a particular orientation with respect to a user. The third microphone is disposed at another location that experiences less wind noise. The processor is configured to determine voice and noise reference values based on the first and the selected second audio signals, and perform noise suppression with respect to at least one of the first or the selected second audio signal, based on the voice or the noise reference value.
-
公开(公告)号:US20190156847A1
公开(公告)日:2019-05-23
申请号:US16256587
申请日:2019-01-24
Applicant: Apple Inc.
Inventor: Nicholas J. Bryan , Vasu Iyengar , Aram M. Lindahl
IPC: G10L21/02 , G10L17/00 , G10L21/0232 , G10L21/028
CPC classification number: G10L21/0205 , G10L17/00 , G10L17/005 , G10L21/02 , G10L21/0208 , G10L21/0232 , G10L21/0272 , G10L21/028 , G10L2015/088 , G10L2021/02082 , H04M2201/40
Abstract: A method for controlling a speech enhancement process in a far-end device, while engaged in a voice or video telephony communication session over a communication link with a near-end device. A near-end user speech signal is produced, using a microphone to pick up speech of a near-end user, and is analyzed by an automatic speech recognizer (ASR) without being triggered by an ASR trigger phrase or button. The recognized words are compared to a library of phrases to select a matching phrase, where each phrase is associated with a message that represents an audio signal processing operation. The message associated with the matching phrase is sent to the far-end device, which is used to configure the far-end device to adjust the speech enhancement process that produces the far-end speech signal. Other embodiments are also described.
-
公开(公告)号:US10553235B2
公开(公告)日:2020-02-04
申请号:US16256587
申请日:2019-01-24
Applicant: Apple Inc.
Inventor: Nicholas J. Bryan , Vasu Iyengar , Aram M. Lindahl
IPC: G10L21/028 , G10L21/02 , G10L21/0232 , G10L17/00
Abstract: A method for controlling a speech enhancement process in a far-end device, while engaged in a voice or video telephony communication session over a communication link with a near-end device. A near-end user speech signal is produced, using a microphone to pick up speech of a near-end user, and is analyzed by an automatic speech recognizer (ASR) without being triggered by an ASR trigger phrase or button. The recognized words are compared to a library of phrases to select a matching phrase, where each phrase is associated with a message that represents an audio signal processing operation. The message associated with the matching phrase is sent to the far-end device, which is used to configure the far-end device to adjust the speech enhancement process that produces the far-end speech signal. Other embodiments are also described.
-
公开(公告)号:US10535362B2
公开(公告)日:2020-01-14
申请号:US15909513
申请日:2018-03-01
Applicant: Apple Inc.
Inventor: Nicholas J. Bryan , Vasu Iyengar
IPC: G10L21/0232 , G10L21/028 , H04R3/00 , G10L21/02 , G10L21/0216
Abstract: Signals are received from audio pickup channels that contain signals from multiple sound sources. The audio pickup channels may include one or more microphones and one or more accelerometers. Signals representative of multiple sound sources are generated using a blind source separation algorithm. It is then determined which of those signals is deemed to be a voice signal and which is deemed to be a noise signal. The output noise signal may be scaled to match a level of the output voice signal, and a clean speech signal is generated based on the output voice signal and the scaled noise signal. Other aspects are described.
-
公开(公告)号:US20190272842A1
公开(公告)日:2019-09-05
申请号:US15909513
申请日:2018-03-01
Applicant: Apple Inc.
Inventor: Nicholas J. Bryan , Vasu Iyengar
IPC: G10L21/0232 , G10L21/028 , G10L21/02 , H04R3/00
Abstract: Signals are received from audio pickup channels that contain signals from multiple sound sources. The audio pickup channels may include one or more microphones and one or more accelerometers. Signals representative of multiple sound sources are generated using a blind source separation algorithm. It is then determined which of those signals is deemed to be a voice signal and which is deemed to be a noise signal. The output noise signal may be scaled to match a level of the output voice signal, and a clean speech signal is generated based on the output voice signal and the scaled noise signal. Other aspects are described.
-
-
-
-
-
-
-