摘要:
In order to achieve a speech encoding method and device of high quality, which are small in local occurrence of abnormal noise in decoded speech, the speech encoding method and device include: fixed excitation generating means 13 for generating a plurality of fixed excitations; a first distortion calculating portion 23 for calculating a distortion related to a waveform defined between a signal to be encoded which is obtained from the input speech and a synthetic vector which is obtained from the fixed excitation as a first distortion for each of the fixed excitations; a second distortion calculating portion 24 for calculating a second distortion different from the first distortion which is defined between the signal to be encoded and the synthetic vector determined from the fixed excitation for each of the fixed excitations; an evaluation value calculating portion 29 for calculating a given evaluation value for search by using the first distortion and the second distortion for each of the vectors; and searching means 20 for selecting the fixed excitation that minimizes the evaluation value for search and outputting a code which is associated with the selected fixed excitation in advance.
摘要:
The present invention comprises: first periodicity providing means for emphasizing periodicity of a fixed code vector output from at least one fixed excitation code book by use of a first periodicity emphasis coefficient adaptively determined based on a predetermined rule; and second periodicity providing means for emphasizing periodicity of a fixed code vector output from at least one fixed excitation code book by use of a predetermined second periodicity emphasis coefficient.
摘要:
A method and an apparatus for processing a sound signal are provided, which process an input sound signal including degraded sound such as quantization noise so as to make the degraded sound subjectively unperceptible. A transformation strength controller calculates a spectrum of a decoded speech after perceptually weighting the decoded speech as the input sound signal, and calculates transformation strength based on the extent of the amplitude and the continuity of the spectrum. A signal transformer obtains a spectrum of the decoded speech, smoothes the amplitude and disturbs the phase based on the transformation strength, and the obtained signal is returned back to a signal region as a transformed decoded speech. A signal evaluator obtains background noise likeness by analyzing the decoded speech and the obtained value is made to be an addition control value. In the weighted value adder, when the addition control value appears to be the background noise likeness, the weight for adding to the decoded speech is reduced, the weight for adding to the transformed decoded speech is increased, and an output speech is obtained.
摘要:
Drive sound source coding means, decoding means has a plurality of algebraic sound source coding means, decoding means having sound source position tables different in distribution lean of sound source position candidates in a frame, each algebraic sound source coding means, decoding means for referencing spectrum envelope information and coding the sound source of an input voice based on a sound source position selected from among the sound source position candidates in the sound source position table and a polarity and selection means for selecting the algebraic sound source coding means, decoding means with the smallest coding distortion from among the plurality of algebraic sound source coding means, decoding means and outputting code representing the drive sound source position output by the selected algebraic sound source coding means, and polarity.
摘要:
A signal encoding system A1 includes a bark spectrum calculating device 2 for calculating a bark spectrum as a parameter based on an auditory model, a bark spectrum encoding device 3 for encoding the bark spectrum, a sound source calculating device 4 and a sound source encoding device 5. The bark spectrum calculating device 2 includes a power spectrum calculating device 6, a critical band integrating device 7, an equal loudness compensating device 8 and a loudness converting device 9. These devices are formed by engineering the functions and effects which are similar to those of the auditory model. The decoding process perform the conversion in the opposite direction. As a result, the signals can be encoded and decoded through less calculation in a manner well matching the human auditory characteristics. When speech signals are to be encoded, it can be realized through less calculation and memory while suppressing noise components other than the speech signal.
摘要:
Filters 106-1-106-N divide an input signal 100 into N band-limited signals 107-1-107-N, and multipliers 108-1-108-N carry out dynamic range control of the N band-limited signals 107-1-107-N, respectively. After that, filters 111-1-111-N eliminate odd harmonics caused by the dynamic range control, and a signal synthesis unit 113 combines the signals passing through the filters 111-1-111-N into a single output signal 114.
摘要:
A noise suppression device includes: a power spectrum calculator converting an input signal of time domain into power spectra of frequency domain; a voice/noise determination unit determining whether the power spectra indicate voice or noise; a noise spectrum estimation unit estimating noise spectra of the power spectra; a period component estimation unit analyzing a harmonic structure constituting the power spectra and estimating periodical information about the power spectra; a weighting coefficient calculator calculating a weighting coefficient for weighting the power spectra; a suppression coefficient calculator calculating a suppression coefficient for suppressing noise included in the power spectra; a spectrum suppression unit suppressing amplitude of the power spectra in accordance with the suppression coefficient; and an inverse Fourier transformer converting the power spectra output by the spectrum suppression unit into a signal of time domain to generate a noise-suppressed signal.
摘要:
A noise suppression device includes: a power spectrum calculator converting an input signal of time domain into power spectra of frequency domain; a voice/noise determination unit determining whether the power spectra indicate voice or noise; a noise spectrum estimation unit estimating noise spectra of the power spectra; a period component estimation unit analyzing a harmonic structure constituting the power spectra and estimating periodical information about the power spectra; a weighting coefficient calculator calculating a weighting coefficient for weighting the power spectra; a suppression coefficient calculator calculating a suppression coefficient for suppressing noise included in the power spectra; a spectrum suppression unit suppressing amplitude of the power spectra in accordance with the suppression coefficient; and an inverse Fourier transformer converting the power spectra output by the spectrum suppression unit into a signal of time domain to generate a noise-suppressed signal.
摘要:
A sound encoder multiplexes a plurality of codes into a sound code in an order determined by a multiplexing order determination unit (12), and a sound decoder demultiplexes the sound code into a plurality of codes one by one in an order determined by a demultiplexing order determination unit (14).
摘要:
A speech encoding apparatus calculates encoding distortion of a noise-like fixed code vector and multiplies the encoding distortion by a fixed weight corresponding to the noise-like degree of the noise-like fixed code vector, calculates encoding distortion of a non-noise-like fixed code vector and multiplies the encoding distortion by a fixed weight corresponding to the non-noise-like fixed code vector, and selects the fixed excitation code associated with multiplication result with a smaller value.