Abstract:
The present invention discloses a speech enhancement method and device for mobile phones. By the method and device provided by the present invention, the mobile phone holding state of a user is detected when the user is talking on the phone, so that different denoising solutions will be employed according to the state of the user in holding the mobile phone. When the user holds the mobile phone normally, a solution integrating multi-microphone denoising and single-microphone denoising will be employed to effectively suppress both the steady noise and the non-steady noise; and when the user holds the mobile phone abnormally, a solution of single-microphone denoising will be employed only to suppress the steady noise. The distortion of speech by multi-microphone denoising is avoided, and the speech quality is ensured.
Abstract:
The present invention relates to a method and device for dereverberation of single-channel speech. The method includes the following steps of framing an input single channel speech signal, and processing the frame signals as follows according to a time sequence: performing short-time Fourier transform on a current frame to obtain a power spectrum and a phase spectrum of the current frame; selecting several frames previous to the current frame and having a distance from the current frame within a set duration range, and performing linear superposition on the power spectra of these frames to estimate the power spectrum of a late reflection sound of the current frame; removing the estimated power spectrum of the late reflection sound of the current frame from the power spectrum of the current frame by a spectral subtraction method to obtain the power spectra of a direct sound and an early reflection sound of the current frame; and performing inverse short-time Fourier transform on the power spectra of the direct sound and the early reflection sound of the current frame and the phase spectrum of the current frame together to obtain a signal of the current frame after dereverberation. The dereverberation method and device can solve the problem that the estimation of a transfer function of a reverberation environment or the estimation of reverberation time is difficult in the dereverberation of single-channel speech.
Abstract:
Disclosed in the invention is a method and system for sampling rate mismatch correction of transmitting and receiving terminals, which can obtain a high-precision sampling rate mismatch in real time, carry out sampling rate correction on transmitting and receiving terminal signals, and send the transmitting terminal signal and the receiving terminal signal that have the same sampling rate after corrected to an echo cancellation system to carry out echo cancellation. The present invention can improve the quality of echo cancellation, simplify the computation and reduce the cost. The method for sampling rate mismatch correction of transmitting and receiving terminals provided in the embodiments of the invention comprises: calculating a transfer function of a receiving terminal signal relative to a transmitting terminal signal at each sampling timing according to the transmitting and receiving terminal signals; obtaining a transmission time delay of the transmitting and receiving terminals at each sampling timing using the transfer function; obtaining a sampling rate mismatch of the transmitting and receiving terminals at each sampling timing by means of parameter fitting using the transmission time delay and the linear relationship between the transmission time delay and the sampling rate mismatch; and adjusting the sampling rate of the transmitting terminal signal or the receiving terminal signal at each sampling timing according to the sampling rate mismatch.
Abstract:
The present invention relates to a method and device for dereverberation of single-channel speech. The method includes the following steps of framing an input single channel speech signal, and processing the frame signals as follows according to a time sequence: performing short-time Fourier transform on a current frame to obtain a power spectrum and a phase spectrum of the current frame; selecting several frames previous to the current frame and having a distance from the current frame within a set duration range, and performing linear superposition on the power spectra of these frames to estimate the power spectrum of a late reflection sound of the current frame; removing the estimated power spectrum of the late reflection sound of the current frame from the power spectrum of the current frame by a spectral subtraction method to obtain the power spectra of a direct sound and an early reflection sound of the current frame; and performing inverse short-time Fourier transform on the power spectra of the direct sound and the early reflection sound of the current frame and the phase spectrum of the current frame together to obtain a signal of the current frame after dereverberation. The dereverberation method and device can solve the problem that the estimation of a transfer function of a reverberation environment or the estimation of reverberation time is difficult in the dereverberation of single-channel speech.
Abstract:
A wearing state detection method for a wearable device and a device. The detection method comprises: providing in the wearable device a sensor in an area capable of contacting the skin of a user, wherein the sensor outputs different measurement values when the user wears or takes off the wearable device; acquiring a base value indicating whether the wearable device is being worn; after wearing detection has been started up, collecting measurement values from the sensor at a preset sampling frequency; judging whether the wearable device is currently in a wearing state according to the measurement values and the base value; and controlling the wearable device to turn off a corresponding function that is running when the wearable device is in a non-wearing state. The method and the device can reduce the power consumption and simplify user operation.
Abstract:
The present invention discloses a howling suppression method and device applied to an ANR earphone. The method comprises: collecting signals by using a first microphone and a second microphone; wherein the first microphone is arranged in a position outside an auditory meatus when said ANR earphone is worn, and the second microphone is arranged in a position inside the auditory meatus when the ANR earphone is worn; according to a relation between signals collected by the first microphone and the second microphone, judging whether the current state of said ANR earphone is a state unable to produce a howling or a state able to produce a howling; and when the current state of said ANR earphone is a state able to produce a howling, starting processing for preventing howling production. The technical scheme can achieve that the ANR earphone does not produce a howling all the time.
Abstract:
A method and a system for achieving a self-adaptive surround sound. The method comprises: recognizing specific positions of a room and a user in the room by using an object recognition technology, capturing focusing images of recognized objects by controlling a camera using a focusing control technology, and recording corresponding focusing parameters (S110); calculating position information of the room relative to the camera and position information of the user relative to the camera according to the images and the parameters (S120); calculating sound beams that can achieve the surround sound at the position of the user in said room according to aforesaid calculated position information of the room and the user (S130); obtaining parameters of a filter group according to the calculated sound beams, and adjusting the filter group of a loudspeaker array according to the parameters (S140); and playing an audio signal via the loudspeaker array after the audio signal is filtered by the filter group that has been adjusted according to the parameters to form surround sound at the position of the user in the room (S150).
Abstract:
The present invention discloses an echo elimination device and method for a miniature hands-free voice communication system. The system comprises a receiver, a primary transmitter and an auxiliary transmitter, a distance from the primary transmitter to the receiver being greater than that from the auxiliary transmitter to the receiver. The device comprises an array echo elimination unit, a self-adaptive echo elimination unit and a residual echo elimination unit, which are structurally cascaded in turn. The array echo elimination unit, with inputs being a signal of the primary transmitter and a signal of the auxiliary transmitter, performs array filtering to obtain one path of output signals; the self-adaptive echo elimination unit, with the input signals being a signal of the receiver, the output signal of the array echo elimination unit and a signal of the auxiliary transmitter, performs self-adaptive filtering to obtain two paths of output signals; the residual echo elimination unit, with the input signals being the two paths of output signals of the self-adaptive echo elimination unit, performs voice probability estimation and echo matching to obtain an echo-eliminated voice signal. Thus, the duplex performance can be enhanced, and the phase consistency of the transmitters is not strictly required.
Abstract:
The invention discloses a method and a device for reducing voice reverberation based on double microphones. The method comprises the steps of calculating a transfer function h(t) from a secondary microphone to a primary microphone according to an input signal x2(t) of the primary microphone and an input signal x1(t) of the secondary microphone; judging the strength of reverberation according to h(t) and calculating a regulatory factor β of a gain function by taking a tail section hr(t) of the h(t); obtaining a late reverberation estimation signal {circumflex over (r)}(t) of x2(t) with the convolution of x1(t) and hr(t); calculating the gain function according to the frequency spectrum of x2(t), β and frequency spectrum of {circumflex over (r)}(t); obtaining the reverberation removed frequency spectrum of x2(t) by multiplying the frequency spectrum of x2(t) by the gain function; and obtaining a late reverberation removed time-domain signal of x2(t) by frequency-time conversion. Thus, the late reverberation can be removed from the input signal of the primary microphone, early reverberation can be preserved, processed voice is not caused to be thin, and the voice quality is improved. Meanwhile, spectral subtraction intensity is adjusted according to the strength of the reverberation so as to ensure that the voice is not damaged on the condition that the reverberation is weak and the voice intelligibility is originally high. Accurate estimation of DOA of direct sound is not needed, and therefore the microphones are not required to have high consistency.
Abstract:
The present invention discloses an echo elimination device and method for a miniature hands-free voice communication system. The system comprises a receiver, a primary transmitter and an auxiliary transmitter, a distance from the primary transmitter to the receiver being greater than that from the auxiliary transmitter to the receiver. The device comprises an array echo elimination unit, a self-adaptive echo elimination unit and a residual echo elimination unit, which are structurally cascaded in turn. The array echo elimination unit, with inputs being a signal of the primary transmitter and a signal of the auxiliary transmitter, performs array filtering to obtain one path of output signals; the self-adaptive echo elimination unit, with the input signals being a signal of the receiver, the output signal of the array echo elimination unit and a signal of the auxiliary transmitter, performs self-adaptive filtering to obtain two paths of output signals; the residual echo elimination unit, with the input signals being the two paths of output signals of the self-adaptive echo elimination unit, performs voice probability estimation and echo matching to obtain an echo-eliminated voice signal. Thus, the duplex performance can be enhanced, and the phase consistency of the transmitters is not strictly required.