摘要:
An encoder comprising an input for inputting frames of an audio signal in a frequency band, at least a first excitation block for performing a first excitation for a speech like audio signal, and a second excitation block for performing a second excitation for a non-speech like audio signal. The encoder further comprises a filter for dividing the frequency band into a plurality of sub bands each having a narrower bandwidth than the frequency band. The encoder also comprises an excitation selection block for selecting one excitation block among the at least first excitation block and the second excitation block for performing the excitation for a frame of the audio signal on the basis of the properties of the audio signal at least at one of the sub bands. The invention also relates to a device, a system, a method and a storage medium for a computer program.
摘要:
A method of encoding speech in a communications system includes the steps of receiving a speech signal including voice signals and background signals, and detecting voice activity and providing an indicator when no voice activity is detected. The speech signal is encoded to generate a plurality of parameters representing the signal. When the indicator is not present, a first parametric representation of the speech signal is output, including the plurality of parameters. When the indicator is present, at least one of the plurality of parameters is modified and a second parametric representation of the speech signal, including the modified parameter is output.
摘要:
A method for use by a speech decoder in handling bad frames received over a communications channel a method in which the effects of bad frames are concealed by replacing the values of the spectral parameters of the bad frames (a bad frame being either a corrupted frame or a lost frame) with values based on an at least partly adaptive mean of recently received good frames, but in case of a corrupted frame (as opposed to a lost frame), using the bad frame itself if the bad frame meets a predetermined criterion. The aim of concealment is to find the most suitable parameters for the bad frame so that subjective quality of the synthesized speech is as high as possible.
摘要:
A system and method for locating a preferable playback start location after a winding or rewinding action in an audio playing device. In response to an adjustment of the playing location for audio content to a desired playing position, the system determines whether at least one non-speech or silent period of at least a predetermined duration exists within the vicinity of the desired playing position. If at least one such non-speech or silent period exists within the vicinity of the desired playing position, the system adjusts the playing position to fall within one of the at least one non-speech period or silent period.
摘要:
A method of encoding speech in a communications system includes the steps of receiving a speech signal including voice signals and background signals, and detecting voice activity and providing an indicator when no voice activity is detected. The speech signal is encoded to generate a plurality of parameters representing the signal. When the indicator is not present, a first parametric representation of the speech signal is output, including the plurality of parameters. When the indicator is present, at least one of the plurality of parameters is modified and a second parametric representation of the speech signal, including the modified parameter is output.
摘要:
A method and corresponding codec for (channel) encoding speech or other data bits for transmission via a wireless communication channel, the method providing unequal error protection (UEP) using only a single encoder, and including: a step of determining how many bits to puncture in each of typically two protection classes (CA CB) so as to achieve either a predetermined or iterated desired level of error protection; and a step of identifying which bits to puncture for each class so as to provide relatively strong and uniform protection for all bits in the first class (CA), but protection that decreases in the same manner as the subjective importance decreases from the beginning to the end of the other classes. The method also accounts for so-called soft puncturing by modulators transmitting multiple bits per symbol with weaker protection for some of the bits of each symbol.
摘要:
A method of transmitting a codeword over a transmission channel using a plurality of radio bursts. The codeword comprises a first sequence of time ordered protected bits and a second sequence of time ordered unprotected bits, and the radio bursts together provide a set of time ordered bit positions. Successive bits of said first sequence are allocated to the radio bursts in a cyclical manner so that adjacent protected bits are allocated to different radio bursts, while successive bits of said second sequence are allocated to remaining bit positions of the radio bursts in the time order of those remaining bit positions. The radio bursts are then transmitted sequentially on different frequency bands.
摘要:
A speech encoding or decoding arrangement (711, 721, 811, 821) comprises a speech signal input and a multiple mode speech encoder (402) or decoder (411) for encoding or decoding speech signals coupled to the speech signal input selectabily with a first encoding or decoding mode associated with a first bandwidth or a second encoding or decoding mode associated with a second bandwidth. It comprises a soft bandwidth switching block (401, 412, 500) with an input (IN) and an output (OUT). In an encoding arrangement the input (IN) is coupled to the speech signal input and the output (OUT) is coupled to the multiple mode speech encoder (402). In a decoding arrangement the input (IN) is coupled to the multiple mode speech decoder (411) and the output (OUT) is the output of the decoding arrangement. The soft bandwidth switching block (401, 412, 500) is arranged to gradually change the bandwidth of a speech signal coupled to the multiple mode speech encoder or decoder as a response to an instruction for changing speech signal bandwidth (421).
摘要:
An encoder comprising an input for inputting frames of an audio signal in a frequency band, at least a first excitation block for performing a first excitation for a speech like audio signal, and a second excitation block for performing a second excitation for a non-speech like audio signal. The encoder further comprises a filter for dividing the frequency band into a plurality of sub bands each having a narrower bandwidth than the frequency band. The encoder also comprises an excitation selection block for selecting one excitation block among the at least first excitation block and the second excitation block for performing the excitation for a frame of the audio signal on the basis of the properties of the audio signal at least at one of the sub bands. The invention also relates to a device, a system, a method and a storage medium for a computer program.
摘要:
A wireless telecommunications system comprises a mobile station (MS) and a network. The mobile station has a multi-rate speech encoder which produces an encoded speech signal which is transmitted to the network. The network has a multi-rate speech decoder which decodes the encoded speech signal to produce a decoded speech signal. The network also comprises a signal analyser which measures speech characteristics of the decoded speech signal to produce speech characteristics information and an up-link mode control unit which receives the speech characteristics information and produces a mode command. The mode command is transmitted by the network to the mobile station where it is used to control the speech encoding bit rate of the multi-rate speech encoder.