摘要:
In a speech recognizer, for recognizing unknown utterances in isolated-word speech or continuous speech, improved recognition accuracy is obtained by augmenting the usual spectral representation of the unknown utterance with a dynamic component. A corresponding dynamic component is provided in the templates with which the spectral representation of the utterance is compared. In preferred embodiments, the representation is mel-based cepstral and the dynamic components comprise vector differences between pairs of primary cepstra. Preferably the time interval between each pair is about 50 milliseconds. It is also preferable to compute a dynamic perceptual loudness component along with the dynamic parameters.
摘要:
A receiver of the present invention addresses the need for improved interference suppression without the number of transmissions by the power control system being increased, and, to this end, provides a receiver for a CDMA communications system which employs interference subspace rejection to tune a substantially null response to interference components from selected signals of other user stations. Preferably, the receiver also tunes a substantially unity response for a propagation channel via which a corresponding user's signal was received. The receiver may be used in a base station or in a user/mobile station.
摘要:
In recent years, the telecommunications industry has witnessed the proliferation of a variety of digital vocoders in order to meet bandwidth demands of different wireline and wireless communication systems. The rapid growth in the diversity of networks and the number of users of such networks is increasing the number of instances where two vocoders are placed in tandem to serve a single connection. Such arrangements of low bit-rate codecs can degrade the quality of the transmitted speech. To overcome this problem the invention provides a novel method and an apparatus for transmitting digitized voice signals in the wireless communications environment. The apparatus is capable of converting a compressed speech signal from one format to another format via an intermediate common format, thus avoiding the necessity to successively de-compress voice data to a PCM type digitization and then recompress the voice data.
摘要:
An audio signal encoding device is provided including an input for receiving a sub-frame of an audio signal to be encoded, an adaptive codebook and a processing unit. The adaptive codebook stores at least one prior knowledge entry which includes a data element representative of characteristics of at least a portion of a previously generated audio signal sub-frame. The processing unit generates a set of parameters allowing for synthesization of the audio signal sub-frame received at the input on the basis of at least the sub-frame of the audio signal received at the input and the data element stored in the adaptive codebook. A corresponding decoding device for synthesizing an audio signal on the basis of a set of parameters is also provided.
摘要:
The invention relates to a linear prediction audio signal processing apparatus, such as a vocoder, including a nonlinear filter to attenuate the residual signal used to excite a linear prediction synthesis filter. The nonlinear filter is capable of reducing the noise component in the signal while keeping only the periodic component of the speech signal. This feature enhances speech quality. The invention also extends to a novel method for processing a residual signal used to excite a linear prediction synthesis filter in order to attenuate wide band additive noise in the speech signal as constructed by the synthesis filter.
摘要:
A code-excited linear prediction (CELP) coding method and code divide the residual signal into frequency bands. Codebooks provided for each band decrease in size with increasing band frequency. Reduction in codebook size with increasing frequency together with reduction in sampling rate with decreasing frequency provide reductions in codebook search complexity that allow real time implementation on digital signal processor chips.
摘要:
A dual tone multi-frequency (DTMF) receiver for use in detecting digitally transmitted signals in the telephone industry. The receiver derives linear predictive coefficients for the digital signals in the data frame. This information is used to compute frequency response magnitudes at the DTMF frequencies. A plurality of magnitude comparisons are then performed to verify the presence of true DTMF signals and concurrently the frequency of these signals is obtained.
摘要:
An audio signal encoding device is provided comprising an input for receiving a sub-frame of an audio signal, a voiced audio signal synthesis stage, an unvoiced audio signal synthesis stage, and a processing unit. The voiced audio signal synthesis stage is operative for producing a first synthetic audio signal approximating the sub-frame of an audio signal received at the input on the basis of a first set of parameters. The unvoiced audio signal synthesis stage is operative for producing a second synthetic audio signal approximating the sub-frame of an audio signal received at the input on the basis of a second set of parameters. The processing unit is operative for releasing a set of parameters allowing to generate a selected one of the first synthetic audio signal and the second synthetic audio signal.
摘要:
The invention relates to a linear prediction audio signal processing apparatus, such as a vocoder, including a nonlinear filter to attenuate the residual signal used to excite a linear prediction synthesis filter. The nonlinear filter is capable of reducing the noise component in the signal while keeping only the periodic component of the speech signal. This feature enhances speech quality. The invention also extends to a novel method for processing a residual signal used to excite a linear prediction synthesis filter in order to attenuate wide band additive noise in the speech signal as constructed by the synthesis filter.
摘要:
A synchronous discontinuous transmission medium access control (SDTX-MAC) method and apparatus for more efficiently using existing uplink channels by sharing these uplink channels between multiple terminals engaged in bursty data transmission. This is accomplished by assigning each mobile terminal an individual time slot and by not requiring each mobile terminal to broadcast its identity. This results in a reduction in the number of receivers on each base station and a reduction in the length of the synchronization message.