摘要:
A noise estimating apparatus estimates two types of noise spectra for removing a noise component using the two types of noise spectra. The noise estimating apparatus includes an A/D converter that converts an input speech signal to a digital signal, and a Fourier transformer that performs a discrete Fourier transform on the digital signal having a predetermined time length to obtain an input spectrum and a complex spectrum. The noise estimating apparatus also includes a noise spectrum storage device that stores the two types of noise spectra, including a mean noise spectrum and a compensation noise spectrum, and a noise estimator that estimates a new compensation noise spectrum and a new mean noise spectrum as new two types of noise spectra.
摘要:
The present research can decrease the amount of computation and enhance speech quality by using a global pulse replacement method in a fixed codebook search. The fixed codebook search method in a speech encoder based upon global pulse replacement, includes the steps of: (a) computing absolute values of the pulse-position likelihood-estimator vectors; (b) temporarily obtaining a codebook vector; (c) computing a mathematical equation by replacing a pulse; (d) determining whether a value computed based upon the mathematical equation is increased after pulse replacement; (e) obtaining a new codebook vector by replacing the pulse; and (f) maintaining a previous codebook vector.
摘要:
A code excited linear prediction type speech coder, which includes a seed storage that stores seeds used as an initial state of oscillation, and an oscillator that generates different vector sequences in accordance with values of the seeds stored in the seed storage and outputs the vector sequences as excitation vectors. The speech coder also includes a linear predictive coding synthesis filter that receives, as input, the excitation vectors, which are the vector sequences generated in accordance with the values of the seeds, that synthesizes the excitation vectors, and that outputs a synthesized speech.
摘要:
A speech encoder includes an adaptive codebook that generates an adaptive codevector representing a pitch component, a random codebook that generates a random codevector representing a random component, and a synthesis filter that generates a synthetic speech signal by being excited by the adaptive codevector and the random codevector. The random codebook includes an input vector provider configured to provide an input vector, and an excitation vector generator configured to generate an excitation vector as the random codevector by dispersing the input vector by using a fixed pattern. A length of the fixed pattern is shorter than a length of a sub-frame.
摘要:
An audio encoding device which can improve encoding performance while performing division search on an algebraic codebook in an audio encoding. In a distortion minimizing unit (112) of a CELP encoding device: a maximum correlation value calculation unit (221) calculates a correlation value by using each pulse and a target signal in each candidate position for four pulses constituting the fixed codebook so as to acquire a maximum value of the correlation value for each pulse and calculates a maximum correlation value by using the maximum value of the correlation value; a sorting unit (222) divides the four pulses into two subsets each having two pulses; and a search unit (224) performs a division search on the fixed codebook and acquires a code indicating the positions and polarities of the four pulses where the encoding distortion is minimum.
摘要:
An address generator for searching an algebraic codebook is disclosed. The address generator includes: a multiplier multiplying the dimension and a width value of a correlation matrix; a first adder adding a length value and an offset address of the correlation matrix; and a second adder adding the results of the multiplier and the first adder to generate an address for algebraic codebook searching. The amount of calculation required for an address calculation to search an algebraic codebook can be reduced.
摘要:
Provided are a combined, fixed codebook searching method and apparatus used in a code excited linear prediction (CELP) speech codec. The method is used in a code excited linear prediction (CELP) speech codec, and includes searching for a fixed codebook using a full search method that searches for the fixed codebook at all pulse positions; selecting a fixed codebook searching method by counting the number of users who are accessing a gateway, comparing the number of users with a predetermined threshold, and selecting a proper fixed codebook searching method based on the result of comparison; searching for the fixed codebook using the selected fixed codebook searching method; and checking whether the search for the fixed codebook is complete for all tracks of the CELP speech codec, terminating a routine of searching for the fixed codebook when it is determined the search is complete for all the tracks, and selecting a fixed codebook searching method again in consideration of the number of gateway users when there remains a track to be searched for. Accordingly, a fixed codebook searching method is selected in consideration of the number of users who are accessing a gateway, thereby enabling an effective adjustment of either the quality of sound or the channel capacity of the gateway.
摘要:
A wideband audio coding concept is presented that provides good audio quality at bit rates below 3 bits per sample with an algorithmic delay of less than 10 ms. The concept is based on the principle of Linear Predictive Coding (LPC) in an analysis-by-synthesis framework. A spherical codebook is used for quantisation at bit rates which are higher in comparison to low bit rate speech coding for improved performance for audio signals. For superior audio quality, noise shaping is employed to mask the coding noise. In order to reduce the computational complexity of the encoder, the analysis-by synthesis framework has been adapted for the spherical codebook to enable a very efficient excitation vector search procedure. Furthermore, auxiliary information gathered in advance is employed to reduce a computational encoding and decoding complexity at run time significantly. This auxiliary information can be considered as the SCELP codebook. Due to the consideration of the characteristics of the apple-peeling-code construction principle, this codebook can be stored very efficiently in a read-only-memory.
摘要:
A method for searching an excitation (or fixed) codebook in a speech coding system. In a speech coding system including a synthesis filter for synthesizing a speech signal, a fixed codebook searcher according to the present invention segments a speech signal frame into a plurality of subframes to generate an excitation signal to be used in a synthesis filter, segments again each of the subframes into a plurality of subgroups, and searches the respective subframes each comprised of a plurality of pulse position/amplitude combinations for pulses. The fixed codebook searcher searches the respective subgroups for a predetermine number of pulses having non-zero amplitude, and generates the searched pulses as an initial vector. Next, the fixed codebook searcher selects a pulse combination including at least one pulse among the pulses of the initial vector, and then substitutes pulses of the selected pulse combination for pulses in other positions in the subgroups. The selection and the substitution are repeatedly performed on all the pulses of the initial vector.
摘要:
A method for searching a codebook which predicts a residual element of an input voice signal includes combining each track of the input signal, forming track units including at least two tracks, and determining a pulse code for each track. The method further includes calculating energy for each track using an energy formula including a vector dot product, arranging or selecting codewords in a small track energy order, and searching or selecting an optimal pulse for a single- or double-pulse track of the selected codeword.