Abstract:
A method of processing an audio signal is disclosed. The present invention includes receiving, by an audio processing apparatus, an input signal; receiving user gain input; generating a linear gain factor and a non-linear gain factor using the user gain input; modifying the non-linear gain factor using absolute threshold of hearing and power of the input signal to generate a modified non-linear gain factor; and, applying the linear gain factor and the modified non-linear gain factor to the audio signal.
Abstract:
A method of processing an audio signal is disclosed. The present invention includes receiving, by an audio processing apparatus, an input signal; estimating indicator function using a signal power of the input signal; obtaining an adapted filter using the indicator function and an equalization filter; and, generating an output signal by applying the adapted filter to the input signal.
Abstract:
A method of processing an audio signal is disclosed. The present invention comprises receiving downmix signal including object signals, transforming the downmix signal per frequency band, determining a direction of an object signal from the transformed downmix signal, and determining blind information by estimating a level of the object signal corresponding to the direction. Accordingly, the present invention generates blind information in case of using an encoder incapable of generating object information, thereby enabling a gain and panning of object to be controlled using the blind information.
Abstract:
A system and method for use in filtering of an acoustic signal are provided for producing an output signal of attenuated amount of diffuse sound in accordance with predetermined parameters of desired output directional response and required attenuation of diffuse sound. The system includes a filtration module and a filter generation module including a directional analysis module and filter construction module.
Abstract:
Surround sound recording is a tedious task requiring the use of many microphones. The invention aims at enabling the use of two-channel microphones (or stereo microphones) for multi-channel surround recording. A conventional stereo microphone, or a two-channel microphone specifically optimized for use with the proposed algorithm, is used to generate two signals. A post-processor is applied to the microphone generated signals to convert them to multi-channel surround.This aim is achieved through a method to generate multiple output audio channels (y1, . . . , yM) from two microphone generated audio channels (x1, x2), in which the number of output channels is equal or higher than two, this method comprising the steps of: determine directions of sound components related to the microphone characteristics determine compensation gains of sound components related to the microphone characteristics generating the output audio channels, y1, . . . , yM, by using the microphone generated audio channels, x1, x2, directions, and compensation gains.
Abstract:
Embodiments of the present invention are directed to a binaural cue coding (BCC) scheme in which an externally provided audio signal (e.g., a studio engineering audio signal) is transmitted, along with derived cue codes, to a receiver instead of an automatically downmixed audio signal. The cue codes are (adaptively) synchronized with the externally provided audio signal to compensate for time lags (and changes in those time lags) between the externally downmixed audio signal and the multi-channel signal used to generate the cue codes. If the receiver is a legacy receiver, then the studio engineered audio signal will typically provide a higher-quality playback than would be provided by the automatically downmixed audio signal. If the receiver is a BCC-capable receiver, then the synchronization of the cue codes with the externally provided audio signal will typically improve the quality of the synthesized playback.
Abstract:
A method and apparatus are disclosed for controlling a buffer in a digital audio broadcasting (DAB) communication system. The decoder buffer level limits are specified in terms of a maximum number of encoded frames (or duration). The transmitter can predict the number of encoded frames, Fpred, in the decoder buffer and transmit the value, Fpred, to the receiver with the audio data. If the transmitter determines that the decoder buffer level is becoming too high, the frames being generated by the encoder are too small and additional bits are allocated to each frame for each of the N programs. Likewise, if the transmitter determines that the decoder buffer level is becoming too low, the frames being generated by the encoder are too big and fewer bits are allocated to each frame for each of the N programs. The transmitted predicted buffer level, Fpred, can also be employed to (i) determine when the decoder should commence decoding frames; and (ii) synchronize the transmitter and the receiver. The receiver fills the decoder buffer until Fpred frames are received before commencing decoding frames when the decoder first starts up or possibly when a new audio program is selected. The transmitter and receiver clocks may be synchronized by adjusting the clock at the receiver by using a feedback loop that compares the actual level of the decoder buffer to the predicted value, Fpred, received from the transmitter (a higher number of encoded frames in the buffer indicates that the clock of the receiver is too slow and should be increased, and a lower number of encoded frames in the buffer indicates that the clock of the receiver is too fast and needs to be slowed down).
Abstract:
Acoustic echo control and noise suppression is an important part of any “handsfree” telecommunication system, such as telephony or audio or video conferencing systems. Bandwidth and computational complexity constraints have prevented that stereo or multi-channel telecommunication systems have been widely applied. The advantages are very low complexity, high robustness, scalability to multi-channel audio without a need for loudspeaker signal distortion, and efficient integration of echo and noise control in the same algorithm. The proposed method for processing audio signals, comprises the steps of: —receiving an input signal, wherein the input signal is applied to a loudspeaker; —receiving a microphone signal generated by a microphone; —estimating the delay between the loudspeaker and the microphone signals and obtaining a delayed loudspeaker signal, —estimating a coloration correction values of the echo path on the delayed loudspeaker signal, —using information of the delayed loudspeaker signal, microphone signal, and coloration correction values to determine gain filter values, —apply the gain filter values to the microphone signal to remove the echo.
Abstract:
Acoustic echo control and noise suppression is an important part of any “handsfree” telecommunication system, such as telephony or audio or video conferencing systems. Bandwidth and computational complexity constraints have prevented that stereo or multi-channel telecommunication systems have been widely applied. The advantages are very low complexity, high robustness, scalability to multi-channel audio without a need for loudspeaker signal distortion, and efficient integration of echo and noise control in the same algorithm. The proposed method of processing multi-channels audio loudspeakers signals and at least one microphone signal, comprises the steps of: transforming the input microphone signals (y1(n), y2(n), . . . , yM(n)) into input microphone short-time spectra, computing a combined loudspeaker signal short-time spectrum [X(i,k)] from the loudspeaker signals, (x1(n), x2(n), . . . , xL(n)), computing a combined microphone signal short-time spectrum [Y(i,k)] from the input microphone signal, (y1(n), y2(n), . . . , yM(n)), estimating a magnitude or power spectrum of the echo in the combined microphone signal short-time spectrum, computing a gain filter (G(i,k)) for magnitude modification of the input microphone short-time spectra, applying the gain filter to at least one of the input microphone spectra, converting the filtered input microphone spectra into the time domain (e1(n), e2(n), . . . , eM(n)).
Abstract:
An apparatus for generating a multi-channel output signal performs a center channel cancellation to obtain improved base channels for reconstructing left-side output channels or right-side output channels. In particular, the apparatus includes a cancellation channel calculator for calculating a cancellation channel using information related to the original center channel available at the decoder. The device furthermore includes a combiner for combining a transmission channel with the cancellation channel. Finally, the apparatus includes a reconstructor for generating the multi-channel output signal. Due to the center channel cancellation, the channel reconstructor not only uses a different base channel for reconstructing the center channel but also uses base channels different from the transmission channels for reconstructing left and right output channels which have a reduced or even completely cancelled influence of the original center channel.