摘要:
The present invention advantageously provides a manner by which to further suppress noise superimposed upon an information signal without increasing distortion to the signal, e.g., speech. By distributing the noise suppression, the quality of the information signal provided to a listener is improved. In one embodiment, a first noise suppressor is employed at the transmitter to suppress noise superimposed upon an information signal prior to its transmission by the transmitter, and a second noise suppressor is employed at the receiver to suppress the noise component of a communication signal received at the receiver.
摘要:
Adaptive speech coding includes receiving an original speech signal, performing on the original speech signal a current coding operation, and adapting the current coding operation in response to information used in the current coding operation. Adaptive speech decoding includes receiving coded information, performing a current decoding operation on the coded information, and adapting the current decoding operation in response to information used in the current decoding operation.
摘要:
An apparatus and method for reducing sparseness in a coded speech signal. Sparse codebook values are generated from a codebook. An anti-sparseness operation is performed on the sparse codebook values to produce output codebook values having a greater density of non-zero values than the sparse codebook values. The output codebook values are processed by a speech processor to generate an encoded speech signal during an encoding operation or a decoded speech signal during a decoding operation.
摘要:
A source/channel encoding mode control method in a TDMA radio communication system determines the current type of source signal to be encoded and transmitted, restricts encoding to a class of source/channel encoding modes compatible with the determined type of source signal, determines a quality measure for previously transmitted signals that have been received and decoded, and selects based on the quality measure, the most suitable source/channel encoding mode for the determined class.
摘要:
Sparseness is reduced in an input digital signal which includes a first sequence of sample values. An output digital signal is produced in response to the input digital signal. The output digital signal includes a second sequence of sample values, which second sequence of sample values has a greater density of non-zero sample values than the first sequence of sample values.
摘要:
A decoder improves delayed packet concealment in a packet network by using two decoder sections. A first decoder section bases its decoding during the concealment phase on erroneous filter states and a set of speech parameters, whereas a second decoder section bases its decoding on saved and updated filter states and the same speech parameters. The outputs of the two decoder sections are thereafter combined to form the final speech signal. This decoding strategy produces a speech signal with smooth transitions from delayed to non-delayed packets and uses information from the most recent packets for speech generation.
摘要:
In producing from an original speech signal a plurality of parameters from which an approximation of the original speech signal can be reconstructed, a coded signal of the original speech signal is generated. At least one of the parameters is determined using first and second differences between the original speech signal and the coded signal. The first difference is a difference between a waveform associated with the original speech signal and a waveform associated with the coded signal, and the second difference is a difference between an energy parameter derived from the original speech signal and a corresponding energy parameter associated with the coded signal.
摘要:
A post-processing method for a speech decoder which outputs a decoded speech signal in the time domain provides high frequency resolution based on a frequency spectrum having non-harmonic and noise deficiencies. This is obtained by transforming the decoded time domain signal to a frequency domain signal by using a high frequency resolution transform (FFT). Then an analysis of the energy distribution of the frequency domain signal is made throughout its frequency area (4 kHz) to find the disturbing frequency components and to prioritize such frequency components which are situated in the higher part of the frequency spectrum. Next, the suppression degree for the disturbing frequency components is found based on prioritizing. Finally the steps of controlling a post-filtering of the transform in dependence of the finding, and inverse transforming the post-filtered transform in order to obtain a post-filtered decoded speech signal in the time domain are performed.
摘要:
A speech encoding/decoding apparatus. A speech encoding apparatus has a coding portion for receiving input information related to an uncoded signal representative of an original speech signal, the coding portion including a fixed coding portion for receiving the input information and producing a first coded signal estimate, and an adaptive coding portion for receiving the input information and producing a second coded signal estimate. A controller is connected to the fixed coding portion and the adaptive coding portion for receiving information indicative of speech characteristics of the uncoded signal and generates a control signal; and a code modifier receives the first coded signal estimate from the fixed coding portion and the control signal from the controller and produces a modified signal estimate.
摘要:
Perceptually relevant non-speech information can be preserved during encoding of an audio signal by determining whether the audio signal includes such information. If so, a speech/noise classification of the audio signal is overriden to prevent misclassification of the audio signal as noise.