摘要:
A method for performing variable rate speech coding in the speech codec comprising a plurality of speech codec modes operating at different bit rates, the speech encoded by said speech codec being arranged for transmission in a telecommunications network. Information on an active speech codec mode set to be supported is received from the telecommunications network, in response to which the supported speech codec modes that correspond to the active codec mode set determined in the telecommunications network will be activated. Thereafter, speech signals to be applied to the speech codec are encoded with the activated speech codec modes such that the speech codec mode of the substantially lowest bit rate is adapted to the speech frames comprised by the speech signals such that in view of the channel conditions in the telecommunications network the level of residual error in coding will be substantially minimized at the same time.
摘要:
The effects of bad frames received over a communications channel by a speech decoder are concealed by replacing the values of the spectral parameters of the bad frames (a bad frame being either a corrupted frame or a lost frame) with values based on an at least partly adaptive mean of recently received good frames, but in case of a corrupted frame (as opposed to a lost frame), using the bad frame itself if the bad frame meets a predetermined criterion. The aim of concealment is to find the most suitable parameters for the bad frame so that subjective quality of the synthesized speech is as high as possible.
摘要:
A method and system for concealing errors in one or more bad frames in a speech sequence as part of an encoded bit stream received in a decoder. When the speech sequence is voiced, the LTP-parameters in the bad frames are replaced by the corresponding parameters in the last frame. When the speech sequence is unvoiced, the LTP-parameters in the bad frames are replaced by values calculated based on the LTP history along with an adaptively-limited random term.
摘要:
The invention relates to a method for supporting an encoding of an audio signal, wherein at least a first and a second coder mode are available for encoding a section of the audio signal. The first coder mode enables a coding based on two different coding models. A selection of a coding model is enabled by a selection rule which is based on signal characteristics which have been determined for a certain analysis window. In order to avoid a misclassification of a section after a switch to the first coder mode, it is proposed that the selection rule is activated only when sufficient sections for the analysis window have been received. The invention relates equally to a module 2,3 in which this method is implemented, to a device 1 and a system comprising such a module 2,3, and to a software program product including a software code for realizing the proposed method.
摘要:
The invention relates to an encoder (1) comprising an input (1.2) for inputting frames of an audio signal in a frequency band, an analysis filter (1.3) for dividing the frequency band into at least a lower frequency band and a higher frequency band, a first encoding block (1.4.1) for encoding the audio signals of the lower frequency band, a second encoding block (1.4.2) for encoding the audio signals of the higher frequency band, and a mode selector for selecting operating mode for the encoder among at least a first mode and a second mode. In the first mode signals only on the lower frequency band are encoded, and in the second mode signals on both the lower and higher frequency band are encoded. The encoder (1) further comprises a scaler to control the second encoding block (1.4.2) to gradually change the encoding properties of the second encoding block (1.4.2) in connection with a change in the operating mode of the encoder. The invention also relates to a device, a decoder, a method, a module, a computer program product, and a signal.
摘要:
The effects of bad frames received over a communications channel by a speech decoder are concealed by replacing the values of the spectral parameters of the bad frames (a bad frame being either a corrupted frame or a lost frame) with values based on an at least partly adaptive mean of recently received good frames, but in case of a corrupted frame (as opposed to a lost frame), using the bad frame itself if the bad frame meets a predetermined criterion. The aim of concealment is to find the most suitable parameters for the bad frame so that subjective quality of the synthesized speech is as high as possible.