摘要:
A vector quantizer which improves the accuracy of vector quantization in switching over a vector quantization codebook on a first stage depending on the type of feature having the correlation with a quantization target vector. In the vector quantizer, a classifier generates classification information representing a type of narrowband LSP vector having the correlation with wideband LSP (Line Spectral Pairs) of the plural types. A first codebook selects one sub-codebook corresponding to the classification information as a codebook used for the quantization of the first stage from plural sub-codebooks corresponding to each of the types of narrowband LSP vectors. A multiplier multiplies the quantization residual vector of the first stage inputted from an adder by a scaling factor corresponding to the classification information of plural scaling factors stored in a scaling factor determiner and outputs it to an adder as the quantization target of a second stage.
摘要:
Disclosed is a tone determination device that determines the tonality of an input signal using correlations between the frequency components of a current frame with the frequency components of the preceding frame, such that the tone determination device is able to decrease the calculation complexity. In the device, a vector coupling unit (104) couples some of the SDFT coefficients of the preceding frame with some of the down-sampled SDFT coefficients of the preceding frame to generate new SDFT coefficients, and also couples some of the SDFT coefficients of the current frame with some of the down-sampled SDFT coefficients of the current frame to generate new SDFT coefficients. A correlation analysis unit (105) finds correlations for the SDFT coefficients between frames, and also finds the power of the current frame for each specific band. A band determination unit (106) determines the band with the greatest power and outputs the location information for the determined band as shift information, and a tone determination unit (107) determines the tonality of the input signal according to the values of the correlations input from the correlation analysis unit (105).
摘要:
A vector quantizer which improves the accuracy of vector quantization in switching over a vector quantization codebook on a first stage depending on the type of feature having the correlation with a quantization target vector. In the vector quantizer, a classifier (101) generates classification information representing a type of narrowband LSP vector having the correlation with wideband LSP (Line Spectral Pairs) out of the plural types. A first codebook (103) selects one sub-codebook corresponding to the classification information as a codebook used for the quantization of the first stage from plural sub-codebooks (CBa1 to CBan) corresponding to each of the types of narrowband LSP vectors. A multiplier (107) multiplies the quantization residual vector of the first stage inputted from an adder (104) by a scaling factor corresponding to the classification information out of plural scaling factors stored in a scaling factor determining section (106) and outputs it to an adder (109) as the quantization target of a second stage.
摘要:
There is provided a wide-band LSP prediction device and others capable of predicting a wide-band LSP from a narrow-band LSP with a high quantization efficiency and a high accuracy while suppressing the size of a conversion table correlating the narrow-band LSP to the wide-band LSP. In this device, a non-linear prediction unit (102) performs non-linear prediction by using a converted wide-band LSP inputted from a narrow-band/wide-band conversion unit (101) and inputs the non-linear prediction result to an amplifier (103). The converted wide-band LSP is inputted to an amplifier (104). An adder (122) adds multiplication results (vectors) inputted from the amplifiers (103, 104).
摘要:
A CELP speech decoder includes an adaptive codebook that generates an adaptive code vector and a random codebook that generates a random code vector. The random codebook includes an input vector provider that provides an input vector including at least one pulse, each pulse having a position and a polarity, a fixed waveform storage that stores at least one fixed waveform, and a selector that selects at least one of a first process and a second process based on a value of an adaptive codebook gain. The random codebook further includes a convolution section that generates the random code vector by convoluting the at least one fixed waveform with the input vector when the first process is selected. A synthesis filter outputs synthesized speech by performing linear prediction coefficient synthesis on a signal based on the adaptive code vector and the random code vector.
摘要:
A random code vector reading section and a random codebook of a conventional CELP type speech coder/decoder are respectively replaced with an oscillator for outputting different vector streams in accordance with values of input seeds, and a seed storage section for storing a plurality of seeds. This makes it unnecessary to store fixed vectors as they are in a fixed codebook (ROM), thereby considerably reducing the memory capacity.
摘要:
A noise canceller removes a noise component from an input speech signal. The noise canceller includes a noise cancellation coefficient adjuster that adjusts a noise cancellation coefficient to determine an amount of noise cancellation. A noise spectrum storage device stores an estimated noise spectrum. A noise estimator estimates a noise spectrum by comparing an input spectrum with a noise spectrum stored in the noise spectrum storage device. A noise canceling/spectrum compensator subtracts the noise spectrum stored in the noise spectrum storage device from the input spectrum based on a coefficient acquired by the noise cancellation coefficient adjuster.
摘要:
A random code vector reading section and a random codebook of a conventional CELP type speech coder/decoder are respectively replaced with an oscillator for outputting different vector streams in accordance with values of input seeds, and a seed storage section for storing a plurality of seeds. This makes it unnecessary to store fixed vectors as they are in a fixed codebook (ROM), thereby considerably reducing the memory capacity.
摘要:
Disclosed are a quantizer, encoder, and the methods thereof, wherein the computational load is reduced when the values related to the transform coefficients of the principal component analysis transform are quantized when a principal component analysis transform is applied to code stereo. A quantizer includes a power correlation calculator which calculates the power of the left channel signal, the power of the right channel signal, and the correlation between the left channel signal and the right channel signal; an intermediate value calculator which calculates the intermediate value which is the difference between left channel signal the power and the right channel signal power; a codebook which holds a plurality of sets of the coefficients related to the transform coefficients of the principal component analysis transform and the code; and a quantizer which calculates the sum of the first multiplication result obtained by multiplying the coefficient by the correlation value and the second multiplication result obtained by multiplying the coefficient by the intermediate value as the cost function E, selects the coefficients where the cost function E becomes the maximum, and fetches the code related to the selected coefficients as the quantized code.
摘要:
Disclosed is an audio encoding device capable of adjusting a spectrum inclination of a quantized noise without changing the Formant weight. The device includes: an HPF (131) which extracts a high-frequency component of the frequency region from an input audio signal; a high-frequency energy level calculation unit (132) which calculates an energy level of the high-frequency component in a frame unit; an LPF (133) which extracts a low-frequency component of the frequency region from the input audio signal; a low-energy level calculation unit (134) which calculates an energy level of a low-frequency component in a frame unit; an inclination correction coefficient calculation unit (141) multiplies the difference between SNR of the high-frequency component and SNR of the low-frequency component inputted from an adder (140) by a constant and adds a bias component to the product so as to calculate an inclination correction coefficient ?3. The inclination correction coefficient is used for adjusting the spectrum inclination of a quantized noise.