摘要:
A transcoder for use between speech codecs using different Code-Excited Linear Prediction (CELP) type and a method therefor are disclosed. The transcoder includes a decoding unit of an input CELP codec, a transcoding filter, a transcoding filter design unit, and an encoding unit of an output CELP codec. By substituting a post-filter and a perceptual weighting filter of a prior art with one transcoding filter, the calculation amount of the transcoder is reduced, and speech quality decoded at a receiving end is improved.
摘要:
There is provided a speech coding/decoding apparatus and method, in which the input speech signals are classified into several classes in accordance with characteristics of the input speech signals and the input speech signals are coded using frame sizes, quantizer structures, and bit assignment methods corresponding to the determined classes, or in which the frame sizes can be adjusted in accordance with network conditions or codec type of a counter part. Therefore, by optimally adjusting the frame size, the quantizer structure, and the bit assignment method in accordance with the characteristics of input speech, it is possible to improve the performance of the speech coding apparatus, and by adjusting the frame size in accordance with the speech codec type of a counter part, it is also possible to reduce the total end-to-end delay.
摘要:
A wide-band speech coder and a method thereof and a wide-band speech decoder and a method thereof are provided. The wide-band speech coder includes a speech characteristic classification unit, which stipulates a characteristic of speech corresponding to a current frame statistically using an open-circuit pitch value and a linear prediction coefficient in which a wide-code speech signal to be coded is perceptual weigh filtered, an adaptive codebook retrieving unit, which retrieves a pitch delay value around the open-circuit pitch value, calculates a pitch gain value, generates an adaptive codebook contribution signal corresponding to the retrieved pitch delay value, and outputs a difference between the generated adaptive codebook contribution signal and the perceptual weigh filtered signal as a first fixed codebook target signal, a first fixed codebook retrieving unit, which obtains a first fixed codebook index that can express the first fixed codebook target signal most properly, and a first fixed codebook gain value, generates a first fixed codebook contribution signal corresponding to the retrieved index, and outputs a difference between the first generated fixed codebook contribution signal and the first fixed codebook target signal as a second fixed codebook target signal, a second fixed codebook retrieving unit, which includes at least two second fixed codebooks according to a speech characteristic, selects a second fixed codebook according to the speech characteristic, and retrieves second fixed codebook indices that can express the second fixed codebook target signal most properly, and second fixed codebook gain values, and a parameter multiplexer, which quantizes and multiplexes the speech characteristic information, the pitch delay value, the pitch gain value, the first fixed codebook index, the first fixed codebook gain value, the second fixed codebook indices, and the second fixed codebook gain values, makes them as a bit stream, and transmits the bit stream to an external speech decoding terminal.
摘要:
An audio coding method and apparatus capable of improving efficiency of a MPEG-4 AAC (Moving Picture Expert Group-4 Advanced Audio Coding) process are disclosed. The audio coding method and apparatus reduce the number of calculations of an audio coding algorithm to improve efficiency of an audio coding process. Specifically, the audio coding method and apparatus reduce the number of calculations required for a Psychoacoustic model process of the MPEG-4 AAC algorithm capable of coding an audio signal.
摘要:
Provided are a speech restoration system and method for concealing packet losses. The system includes a demultiplexer that demultiplexes an input bit stream and divides the input bit stream into several packets; a packet loss concealing unit that produces and outputs a linear spectrum pair (LSP) coefficient representing the vocal tract of voice and an excitation signal corresponding to a lost frame, when a packet loss occurs; and a speech restoring unit that synthesizes voice using the packets input from the demultiplexer, outputs the result as restored voice, and synthesizes voice corresponding to a lost packet using the LSP coefficient and the excitation signal input from the packet loss concealing unit and outputs the result as restored voice when the lost packet is detected, wherein the packet loss concealing unit repeats linear prediction coefficients (LPCs) of a last-received valid frame, produces a first excitation signal for the lost frame using a time scale modification (TSM) method, when the lost frame is voiceless, and produces a second excitation signal by re-estimating a gain parameter based on the first excitation signal, when the lost frame is voiced.