摘要:
In encoding, pitch periods for time series signals in a predetermined time interval are calculated, and a code corresponding thereto is output. In that encoding, the resolutions for expressing the pitch periods and/or a pitch period encoding mode are switched according to whether an index indicating a periodicity and/or stationarity level of the time series signals satisfies a condition indicating high or low in periodicity and/or stationarity. In that decoding, according to whether an index indicating a periodicity and/or stationarity level, the index being included in or obtained from an input code corresponding to the predetermined time interval, satisfies a condition indicating high periodicity and/or stationarity, a decoding mode for a code, included in the input code, corresponding to pitch periods is switched to decode the code corresponding to the pitch periods to obtain the pitch periods corresponding to the predetermined time interval.
摘要:
The range of disclosed configurations includes methods in which subbands of a speech signal are separately encoded, with the excitation of a first subband being derived from a second subband. Gain factors are calculated to indicate a time-varying relation between envelopes of the original first subband and of the synthesized first subband. The gain factors are quantized, and quantized values that exceed the pre-quantized values are re-coded.
摘要:
A speech intelligibility enhancement (SIE) system and method is described that improves the intelligibility of a speech signal to be played back by an audio device when the audio device is located in an environment with loud acoustic background noise. In an embodiment, the audio device comprises a near-end telephony terminal and the speech signal comprises a speech signal received over a communication network from a far-end telephony terminal for playback at the near-end telephony terminal.
摘要:
Systems, methods, apparatus, and machine-readable media for voice activity detection in a single-channel or multichannel audio signal are disclosed.
摘要:
A decoding apparatus decodes a first encoded data that is encoded from a low-frequency component of an audio signal, and a second encoded data that is used when creating a high-frequency component of an audio signal from a low-frequency component and encoded in accordance with a certain bandwidth, into the audio signal. In the decoding apparatus, a high-frequency component detecting unit divides the high-frequency component into bands with a certain interval range correspondingly to the certain bandwidth, and detects magnitude of the high-frequency components corresponding to each of the bands. A high-frequency component compensating unit compensates the high-frequency components based on the magnitude of the high-frequency components corresponding to each of the bands detected by the high-frequency component detecting unit. A decoding unit that decodes the low-frequency component decoded from the first encoded data, and the high-frequency components compensated by the high-frequency component compensating unit, into the audio signal.
摘要:
An audio encoder for encoding an audio signal includes an impulse extractor for extracting an impulse-like portion from the audio signal. This impulse-like portion is encoded and forwarded to an output interface. Furthermore, the audio encoder includes a signal encoder which encodes a residual signal derived from the original audio signal so that the impulse-like portion is reduced or eliminated in the residual audio signal. The output interface forwards both, the encoded signals, i.e., the encoded impulse signal and the encoded residual signal for transmission or storage. On the decoder-side, both signal portions are separately decoded and then combined to obtain a decoded audio signal.
摘要:
Embodiments of the present invention provide a signal classification method and device, and encoding and decoding methods and devices. The encoding method includes: dividing a current frame into a low-frequency band signal and a high-frequency band signal; attenuating the high-frequency band signal or a to-be-encoded characteristic parameter of the high-frequency band signal according to an energy attenuation value of the low-frequency band signal, where the energy attenuation value indicates energy attenuation of the low-frequency band signal caused by encoding of the low-frequency band signal; and encoding the attenuated high-frequency band signal or the attenuated to-be-encoded characteristic parameter of the high-frequency band signal. The technical solutions according to the embodiments of the present invention can improve the effect of combining the low-frequency band signal and the high-frequency band signal at the decoder.
摘要:
Disclosed herein is a quantization method and apparatus of an audio encoder. The quantization method comprises calculating an absolute value of a maximum frequency spectrum of a first frame, externally received, by analyzing frequency spectrum data of the first frame, setting an initial value of a common scale factor to be used to quantize the first frame based on the absolute value of the maximum frequency spectrum of the first frame and an absolute value of a maximum frequency spectrum of a second frame, which has previously been calculated, and quantizing the frequency spectrum data of the first frame based on the set initial value of the common scale factor. Accordingly, before quantization is performed, an initial value of a common scale factor which is almost close to a value of an actual common scale factor can be previously set.
摘要:
Methods and apparatus for signal processing are disclosed. Source separation can be performed to extract source signals from mixtures of source signals by way of independent component analysis. Source separation described herein involves mixed multivariate probability density functions that are mixtures of component density functions having different parameters corresponding to frequency components of different sources, different time segments, or some combination thereof.
摘要:
Provided is a method and apparatus for encoding or decoding a signal corresponding to a high frequency band in an audio signal. The method and apparatus for encoding a high frequency band detects and encodes frequency component(s) according to a pre-set criterion from a signal corresponding to a frequency band higher than a pre-set frequency and encodes energy value(s) of a signal to reconstruct band(s) in which the detected frequency component(s) are included. The method and apparatus for decoding a high frequency band decodes the signal by adjusting a signal to reconstruct a band in which important frequency component(s) are included by considering an energy value of the important frequency component(s). Accordingly, even though encoding or decoding is performed using a small number of bits, there is no degradation in sound quality of a signal corresponding to a high frequency band, and thus coding efficiency can be maximized.