摘要:
A voice control system allows a user to control a device through voice commands. The voice control system includes a speech recognition unit that receives a control signal from a mobile device and a speech signal from a user. The speech recognition unit configures speech recognition settings in response to the control signal to improve speech recognition.
摘要:
The invention relates to speech signal processing that detects a speech signal from more than one microphone and obtains microphone signals that are processed by a beamformer to obtain a beamformed signal that is post-filtered signal with a filter that employs adaptable filter weights to obtain an enhanced beamformed signal with the post-filter adapting the filter weights with previously learned filter weights.
摘要:
The invention provides a method for determining a noise reference signal for noise compensation and/or noise reduction. A first audio signal on a first signal path and a second audio signal on a second signal path are received. The first audio signal is filtered using a first adaptive filter to obtain a first filtered audio signal. The second audio signal is filtered using a second adaptive filter to obtain a second filtered audio signal. The first and the second filtered audio signal are combined to obtain the noise reference signal. The first and the second adaptive filter are adapted such as to minimize a wanted signal component in the noise reference signal.
摘要:
Systems, methods, and apparatus for using at least one mobile device to receive a representation of at least one audio signal. In some embodiments, the at least one audio signal comprises speech of at least one of a plurality of first participants of a meeting, the plurality of first participants participating in the meeting from a first location, and the at least one audio signal may be audibly rendered to at least one second participant of the meeting at a second location different from the first location. In some embodiments, the at least one mobile device may further receive an indication of an identity of a leading speaker of the speech in the at least one audio signal, the leading speaker being identified from among the plurality of first participants, and may render the identity of the leading speaker to the at least one second participant.
摘要:
A signal processor uses input devices to detect speech or aural signals. Through a programmable set of weights and/or time delays (or phasing) the output of the input devices may be processed to yield a combined signal. The noise contributions of some or each of the outputs of the input devices may be estimated by a circuit element or a controller that processes the outputs of the respective input devices to yield power densities. A short-term measure or estimate of the noise contribution of the respective outputs of the input devices may be obtained by processing the power densities of some or each of the outputs of the respective input devices. Based on the short-term measure or estimate, the noise contribution of the combined signal may be estimated to enhance the combined signal when processed further. An enhancement device or post-filter may reduce noise more effectively and yield robust speech based on the estimated noise contribution of the combined signal.
摘要:
Embodiments of the invention disclose computer-implemented methods, systems, and computer program products for estimating signal coherence. First, a sound generated by a sound source is detected by a first microphone to obtain a first microphone signal and by a second microphone to obtain a second microphone signal. The first microphone signal is filtered by a first adaptive finite impulse response filter to obtain a first filtered signal. The second microphone signal is filtered by a second adaptive finite impulse response filter, to obtain a second filtered signal. The coherence of the first filtered signal and the second filtered signal is determined based upon the filtered signals. The first and the second microphone signals are filtered such that the difference between the acoustic transfer function for the transfer of the sound from the sound source to the first microphone and the transfer of the sound from the sound source to the second microphone is compensated in the first and second filtered signals.
摘要:
A method of locating a sound source based on sound received at an array of microphones comprises the steps of determining a correlation function of signals provided by microphones of the array and establishing a direction in which the sound source is located based on at least one eigenvector of a matrix having matrix elements which are determined based on the correlation function. The correlation function has first and second frequency components associated with a first and second frequency band, respectively. The first frequency component is determined based on signals from microphones having a first distance, and the second frequency component is determined based on signals from microphones having a second distance different from the first distance.
摘要:
Embodiments of the present invention relate to methods, systems, and computer program products for signal processing. A first plurality of microphone signals is obtained by a first microphone array. A second plurality of microphone signals is obtained by a second microphone array different from the first microphone array. The first plurality of microphone signals is beamformed by a first beamformer comprising beamforming weights to obtain a first beamformed signal. The second plurality of microphone signals is beamformed by a second beamformer comprising the same beamforming weights as the first beamformer to obtain a second beamformed signal. The beamforming weights are adjusted such that the power density of echo components and/or noise components present in the first and second plurality of microphone signals is substantially reduced.
摘要:
A signal processor uses input devices to detect speech or aural signals. Through a programmable set of weights and/or time delays (or phasing) the output of the input devices may be processed to yield a combined signal. The noise contributions of some or each of the outputs of the input devices may be estimated by a circuit element or a controller that processes the outputs of the respective input devices to yield power densities. A short-term measure or estimate of the noise contribution of the respective outputs of the input devices may be obtained by processing the power densities of some or each of the outputs of the respective input devices. Based on the short-term measure or estimate, the noise contribution of the combined signal may be estimated to enhance the combined signal when processed further. An enhancement device or post-filter may reduce noise more effectively and yield robust speech based on the estimated noise contribution of the combined signal.
摘要:
Embodiments disclosed herein may include determining a signal parameter of a first microphone and a second microphone associated with a computing device. Embodiments may include generating a reference parameter based upon at least one of the parameter of the first microphone and the parameter of the second microphone. Embodiments may include adjusting a tolerance of at least one of the first microphone and the second microphone, based upon the reference parameter. Embodiments may include receiving, at the first microphone, a first speech signal, the first speech signal having a first speech signal magnitude and receiving, at the second microphone, a second speech signal, the second speech signal having a second speech signal magnitude. Embodiments may include comparing at least one of the first speech signal magnitude and the second speech signal magnitude with a third speech signal magnitude and detecting an obstructed microphone based upon the comparison.