摘要:
Provided are a vector quantization device, a voice coding device, a vector quantization method, and a voice coding method which enable a reduction in the calculation amount of voice codec without deterioration of voice quality. In the vector quantization device, a first reference vector calculation unit (201) calculates a first reference vector by multiplying a target vector (x) by an auditory weighting LPC synthesis filter (H), and a second reference vector calculation unit (202) calculates a second reference vector by multiplying an element of the first reference vector by a filter having a high pass characteristic. A polarity preliminary selection unit (205) generates a polar vector by disposing a unit pulse having a positive or negative polarity, which is selected on the basis of the polarity of an element of the second reference vector, in the position of said element.
摘要:
A method (300) and apparatus (100) generate a candidate code-vector to code an information signal. The method can include producing (310) a target vector from a received input signal. The method can include constructing (320) a plurality of inverse weighting functions based on the target vector. The method can include evaluating (330) an error value associated with each of the plurality of inverse weighting functions to produce a fixed codebook code-vector. The method can include generating (340) a codeword representative of the fixed codebook code-vector, where the codeword can be used by a decoder to generate an approximation of the input signal.
摘要:
A method and apparatus to search a codebook including pulses that model a predetermined component of a speech signal. The method includes the operations of selecting a predetermined number of paths corresponding to a predetermined number of pulse locations that are most consistent with the predetermined component, from among paths corresponding to pulse locations of a predetermined pulse location set allocated to at least one branch that connects one state of a predetermined Trellis structure to another state, performing the path selecting operation on each of states other than the one state, and selecting a path corresponding to pulse locations that are most consistent with the predetermined component from among paths including the selected paths, wherein each path corresponds to a union of plural tracks of an algebraic codebook. Accordingly, a number of calculations required during a codebook search is reduced.
摘要:
A fixed code book (FCB) search device simplifies an error minimizing process and reduces a calculation amount so as to prevent deterioration of a coding performance. The FCB search device includes a pulse shape convolution inverse filter having an inverse feature of a pulse diffusion filter and supplied with an ideal residual signal; a pulse candidate preparatory selector that pre-selects a plurality of pulse candidates from the ideal residual signal to which the inverse filter is applied; and a pulse candidate final selector that finally selects one pulse from the selected candidates. Using this configuration, a search is made for an algebra code book to which the pulse diffusion is applied.
摘要:
A CELP speech decoder includes an adaptive codebook that generates an adaptive code vector and a random codebook that generates a random code vector. The random codebook includes an input vector provider that provides an input vector including at least one pulse, each pulse having a position and a polarity, a fixed waveform storage that stores at least one fixed waveform, and a selector that selects at least one of a first process and a second process based on a value of an adaptive codebook gain. The random codebook further includes a convolution section that generates the random code vector by convoluting the at least one fixed waveform with the input vector when the first process is selected. A synthesis filter outputs synthesized speech by performing linear prediction coefficient synthesis on a signal based on the adaptive code vector and the random code vector.
摘要:
A fixed code book (FCB) search device simplifies an error minimizing process and reduces a calculation amount so as to prevent deterioration of a coding performance. The FCB search device (100) includes: a pulse shape convolution inverse filter (104) having an inverse feature of a pulse diffusion filter and supplied with an ideal residual signal; a pulse candidate preparatory selection unit (105) for pre-selecting a plurality of pulse candidates from the ideal residual signal to which the inverse filter is applied; and a pulse candidate final selection unit (109) for finally selecting one pulse from the selected candidates. By using this configuration, search is made for an algebra code book to which the pulse diffusion is applied.
摘要:
A speech coder includes a seed storage that stores a plurality of seeds used as an initial state of oscillation. An oscillator generates different vector sequences in accordance with values of the seeds stored in the seed storage and outputs the vector sequences as excitation vectors. A linear predictive coding synthesis filter receives, as input, the excitation vectors synthesizes the excitation vectors, and outputs a synthesized speech. The seed storage stores the plurality of seeds prepared in advance as the initial state of oscillation such that the vector sequences generated in the oscillator serve as effective excitation vectors from which the synthesized speech can be generated when the vector sequences are input to the linear predictive coding synthesis filter the oscillator receives, the seeds from the seed storage, generates, using the input seeds, vector sequences that serve as the effective excitation vectors from which the synthesized speech can be generated in the linear predictive coding synthesis filter, and outputs the vector sequences.
摘要:
An excitation vector generator includes an input vector providing system that is capable of providing an input vector having at least one pulse, each pulse having a predetermined position and a respective polarity. A fixed waveform storage system is capable of storing at least one fixed waveform. An arranging system is capable of arranging the at least one fixed waveform in accordance with the position and the polarity of the at least one pulse.
摘要:
A random code vector reading section and a random codebook of a conventional CELP type speech coder/decoder are respectively replaced with an oscillator for outputting different vector streams in accordance with values of input seeds, and a seed storage section for storing a plurality of seeds. This makes it unnecessary to store fixed vectors as they are in a fixed codebook (ROM), thereby considerably reducing the memory capacity.
摘要:
A noise canceller removes a noise component from an input speech signal. The noise canceller includes a noise cancellation coefficient adjuster that adjusts a noise cancellation coefficient to determine an amount of noise cancellation. A noise spectrum storage device stores an estimated noise spectrum. A noise estimator estimates a noise spectrum by comparing an input spectrum with a noise spectrum stored in the noise spectrum storage device. A noise canceling/spectrum compensator subtracts the noise spectrum stored in the noise spectrum storage device from the input spectrum based on a coefficient acquired by the noise cancellation coefficient adjuster.