摘要:
There is disclosed a scalable encoding device capable of increasing the conversion performance from a narrow-band LSP to a wide-band LSP (prediction accuracy when predicting the wide-band LSP from the narrow-band LSP) and realizing a high-performance band scalable LSP encoding. The device includes a conversion coefficient calculation unit (109) for calculating a conversion coefficient by using a narrow-band quantization LSP which has been outputted from a narrow-band LSP encoding unit (103) and a wide-band quantization LSP which has been outputted from a wide-band LSP encoding unit (107). The wide-band LSP encoding unit (107) multiplies the narrow-band quantization LSP with the conversion coefficient inputted from the conversion coefficient calculation unit (109) so as to convert it into a wide-band LSP. The wide-band LSP is multiplied by a weight coefficient to calculate a prediction wide-band LSP. The wide-band LSP encoding unit (107) encodes an error signal between the obtained prediction wide-band LSP and the wide-band LSP so as to obtain a wide-band quantization LSP.
摘要:
An encoding apparatus and method for generating low-frequency-band encoding information and high-frequency-band encoding information from an original signal. The encoding apparatus includes a first spectrum calculator that calculates a first spectrum of a low frequency band from a decoded signal of the low-frequency-band encoding information, a second spectrum calculator that calculates a second spectrum from the original signal, an estimator that divides a high frequency band of the second spectrum into a plurality of bands and estimates the second spectrum included in each band, using the first spectrum, and a first error component encoder that encodes a first error component between the high frequency band of the second spectrum and an estimated spectrum. A corresponding decoding apparatus and method provides decoding.
摘要:
An audio encoding device for correcting a component having insufficient encoding capability in a core layer by an extended layer. A core layer encoder encodes an audio signal. An extended layer encoder encodes an encoding residual of the core layer encoder. A characteristic correction inverse filter arranged at a pre-stage of an LPC synthesis filter subjects the component having insufficient encoding capability in the core layer to an inverse characteristic correction process, and a characteristic correction filter arranged at a post-stage of the LPC synthesis filter performs a process for characteristic correction of the synthesis signal inputted from the LPC synthesis filter.
摘要:
A fixed codebook searching apparatus which slightly suppresses an increase in the operation amount, even if the filter applied to the excitation pulse has the characteristic that it cannot be represented by a lower triangular matrix and realizes a quasi-optimal fixed codebook search. This fixed codebook searching apparatus is provided with an algebraic codebook that generates a pulse excitation vector; a convolution operation section that convolutes an impulse response of auditory weighted synthesis filter into an impulse response vector that has a value at negative times, to generate a second impulse response vector that has a value at second negative times; a matrix generating section that generates a Toeplitz-type convolution matrix by means of the second impulse response vector; and a convolution operation section that convolutes the matrix generated by matrix generating section into the pulse excitation vector generated by algebraic codebook.
摘要:
Fluctuation in decoded signal localization is suppressed to maintain the feel of stereo. A selection unit (220) selects balance parameters when the balance parameters are input from a gain coefficient decoding unit (210), or selects balance parameters input from a gain coefficient calculation unit (223) when there is no balance parameter input from the gain coefficient decoding unit (210), and outputs the selected balance parameters to a multiplication unit (221). The multiplication unit (221) multiplies a gain coefficient input from the selection unit (220) with a decoded monaural signal input from a monaural decoding unit (202) to perform balance adjustment processing.
摘要:
Provided is an audio decoding device capable of suppressing an information amount for a lost frame compensation process and encoding efficiency. In this device, a decoded sound source generation unit (203) generates a lost frame decoded sound source signal; a pitch pulse information decoding unit (204) decodes the pitch pulse position information and the pitch pulse amplitude information; a pitch pulse waveform learning unit (205) learns a pitch pulse learning waveform in the past frame in advance from the lost frame; a convolution unit (206) amplitude-adjusts the pitch pulse learning waveform according to the pitch pulse amplitude information, and convolutes the pitch pulse waveform into a time axis which has been amplitude-adjusted according to the pitch pulse position information; a sound source signal correction unit (207) adds or replaces the pitch pulse waveform convoluted into the time axis to the lost frame decoded sound source signal.
摘要:
A prediction performance between the individual channels of a stereo signal is improved to improve the sound quality of a decoded signal. An LPF (101-1) interrupts the high-range component of an S1, and outputs an S1′ (a low-range component). An LPF (101-2) interrupts the high-range component of an S2, and outputs an S2′ (a low-range component). A prediction unit (102) predicts the S2′ from the S1′, and outputs a prediction parameter composed of a delay time difference (t) and an amplitude ratio (g). A first channel encoding unit (103) encodes the S1. A prediction parameter encoding unit (104) encodes the prediction parameter. The encoded parameters of the encoded parameter of the S1 and the prediction parameter are finally outputted.
摘要:
When an error is detected in coded data in the current frame, data separation section 201 separates the data into coding parameters first. Then, mode information decoding section 202 outputs decoding mode information in the previous frame and uses this as the mode information of the current frame. Furthermore, using the lag parameter code and gain parameter code of the current frame obtained at data separation section 201 and the mode information, lag parameter decoding section 204 and gain parameter decoding section 205 adaptively calculate a lag parameter and gain parameter to be used in the current frame according to the mode information.
摘要:
A fixed codebook searching apparatus, includes a convolution operator, implemented by at least one processor, that convolves an impulse response of a perceptually weighted synthesis filter with an impulse response vector that has values at negative times, to generate a second impulse response vector that has values at negative times. A matrix generator, implemented by at least one processor, generates a Toeplitz-type convolution matrix using the second impulse response vector generated by the convolution operator. A searcher, implemented by at least one processor, performs a codebook search by maximizing a term using the Toeplitz-type convolution matrix.
摘要:
A sound decoding device is capable of improving the lost frame compensation performance and improving quality of the decoded sound. A rise frame sound source compensation unit generates a compensation sound source signal when the current frame is a lost frame and a rise frame. An average sound source pattern update unit updates the average sound source pattern held in an average sound source pattern holding unit over a plurality of frames. When a frame is lost, an LPC synthesis unit performs LPC synthesis on a decoded sound source signal by using the compensation sound source signal inputted via a switching unit and a decoded LPC parameter from an LPC decoding unit and outputs the compensation decoded sound signal.