摘要:
Speech is data-compressed and coded for transmission using TDHC (Time Domain Harmonic Compression) for the voiced signal, and decimated sampling for the unvoiced signal. Features include voiced/unvoiced detection, pitch period detection, border detection, coding by ADPCM, and optimum quantization coding. Reception involves decoding and data-reconstruction.
摘要:
Several encoders perform a local decoding of a speech signal and extract excitation information and vocal tract information from a speech signal for an encoding operation. The transmission rate ratio between the excitation information and the vocal tract information are different for each encoder. An evaluation/selection unit evaluates the quality of decoded signals subjected to a local decoding in each of the encoders, determines the most suitable encoders from among the several encoders based on the result of the evaluation, and selects the most suitable encoder, thereby outputting the selection result as selection information. The decoder decodes a speech signal based on selection information, vocal tract information and excitation information. The evaluation/selection unit selects the output from the encoder in which the quality of a locally decoded signal is the most preferable. When vocal tract information changes little, the vocal tract information is not output, thereby allowing for increased quality of information. As much of the surplus of unused vocal tract information as possible is assigned to a residual signal. Thus, the quality of a decoded speech signal is improved.
摘要:
Coding transmission equipment wherein an plurality of adaptive coding units having different processing characteristics are provided, and the adaptive coding unit having the optimum processing characteristic for a current input signal is selected from among the plurality of adaptive coding units, and an output signal from the optimum adaptive coding unit and the unit number thereof are transmitted to the receiver side. At the receiver side, the processing characteristic of a decoder is changed to become the optimum processing characteristic indicated by the received unit number, whereby a high quality recovered signal is realized.
摘要:
Disclosed is a voice coding/decoding system having a transmitting part for transmitting a coded signal of an input voice signal at a bit rate lower than a predetermined transmission bit rate and a receiving part for receiving and decoding the coded signal transmitted from the transmission part. To enable the coding and transmitting of an input voice signal in an optimum state without passing through a buffer memory and without having a negative influence on the coder, the transmitting part provides coders for coding the input voice signal and groups of entropy coders. The inputs of the entropy coders in each group are connected to the output of one of the plurality of coders. The transmitting part further provides an evaluation part for evaluating the characteristics of the outputs of the coders and the entropy coders. The evaluation part extracts those entropy coders having output bit rates lower than the transmission bit rate and extracts, from the coders connected to the extracted entropy coders, a coder having the best output characteristic. Then, the evaluation part outputs a selecting signal indicating the combination of the selected coder and an entropy coder from the extracted entropy coders. The transmitting part further provides a selecting part for selecting, in response to the selecting signal, the codeword passed through the combination of the coder and the entropy coder to be transmitted.
摘要:
A code excited linear prediction (CELP) type speech signal coding system is provided, a code vector obtained by applying linear prediction to a vector of a residual speech signal of white noise is stored in a code book. A pitch prediction vector obtained by applying linear prediction to a residual signal of a preceding frame is given a delay corresponding to a pitch frequency and added to the code vector. Use is made of an impulse vector obtained by applying linear prediction to a residual signal vector of impulses having a predetermined relationship with the vectors of the white noise code book. Variable gains are given to at least the above code vector and impulse vector, a reproduced signal is produced, and this reproduced signal is used for identification of the input speech signal. Thus, a pulse series corresponding to the sound source of voiced speech sounds is created.
摘要:
A gain-shape vector quantization apparatus is provided for encoding and decoding, to transmit and receive compressed speech signals. A selected plurality of vectors are read from a code book based upon an index signal. The vectors are added in an adder and synthesis filtered by a synthesis filter, in either order, to produce an output. This output is subtracted from an input speech signal to produce an error signal. An evaluation unit produces an index to select the plurality of vectors read from the code book memory based on the error signal in order to minimize this error signal. The evaluation unit produces gain adjusting signals which can be used to adjust gains of the vectors read from the code book. In an encoder, signals indicative of the gain adjusting signal and the index signal are transmitted by a transmitter of the encoder to send a quantized speech signal to a receiver of a decoder. In the decoder, after the signals indicative of the gain adjusting signal and the index are received by the receiver of the decoder, an index and gain adjusting signal is derived for use to control reading of vectors from a code book and gains thereon to reproduce the speech signal.
摘要:
Pitch periods for a long term predictor included in a speech codec are searched in two searching stages. In the first searching stage, probable pitch periods are searched skipping a constant number of pitch periods, and in the second searching stage, pitch periods including the pitch period determined in the first searching stage and pitch periods neighboring the pitch period on both sides are searched.
摘要:
A speech coding apparatus coupled to a transmission channel includes m (m is an integer greater than 1) coders, m decoders and m or (m-1) error correcting coders. The apparatus also includes an evaluation unit which evaluates a quality of each of reproduced speech signals from the input speech signal and the reproduced speech signals and which outputs an evaluated quality of each of the reproduced speech signals. The quality of each of the reproduced speech signals is evaluated in a state having no transmission error. A decision unit identifies one of the m coders which provides the reproduced speech signal having a smallest distortion on the basis of the evaluated quality of each of the reproduced speech signals, a current error rate of the transmission channel and error correcting abilities of the error correcting coders, and generates a coder identification number representative of a selected one of the m coders. An output part outputs a multiplexed transmission signal including the coded speech signal generated by the one of the m coders identified by the decision unit and the error correcting code generated by a corresponding one of the m error correcting coders.
摘要:
A speech coding apparatus includes multipliers and prediction filters which successively process a plurality of signal vectors obtained from an index 2.sup.M and dimension N code book to obtain a reproduced speech signal. Error detectors are provided which find the error between the input speech signal and reproduced speech signal. Evaluators are also provided which calculate the optimum signal vectors giving the smallest errors. The multipliers are connected to a reduced code book, which is constituted of n number of code book blocks of index 2.sup.M/n and dimension N/n (where n is an integer of two or more). There are n number of multipliers, n number of prediction filters, n number of error detectors, and n number of evaluators corresponding to the code book blocks.
摘要:
A speech coding apparatus which selects an optimum code from a code book, the optimum code giving the minimum magnitude of error signal between the input signal and the reproduced signal obtained by a filter calculation using a linear prediction parameter from a linear predictive analysis unit with respect to the codes of the code book, wherein the code book is formed by thinning to 1/M (M being an integer of two or more) the plurality of sampling values constituting the codes. To compensate for the deterioration of the quality of the reproduced signal caused by thinning the sampling values in this way, an additional linear predictive analysis unit is further introduced and use made of an amended linear prediction parameter instead of the linear prediction parameter from the originally provided linear predictive analysis unit.