摘要:
A comfort noise block, that include a hangover period and comfort noise parameters, is transmitted in such a manner that it is not interrupted by other messages, such as FACCH messages. This is accomplished in a mobile station by a determination of whether any FACCH messages are required to be transmitted. If such FACCH messages exist, a further determination may be made as to which transmission can be made in the shortest time (i.e., the FACCH message or messages or the comfort noise parameters message), and this transmission is made first. In any event the comfort noise parameters block is transmitted without interruption. In a further embodiment of this invention the comfort noise parameters message is transmitted by being concatenated with another message, such as a neighbor channel measurement results message, so as to reduce overhead, conserve bandwidth, and reduce power consumption. An element of the comfort noise parameters message is a Random Excitation Spectral Control (RESC) information element, which is used in the decoder for improving the spectral content of the generated comfort noise so as to better match the background noise at the transmitter.
摘要:
Disclosed herein are methods and apparatus for improving the quality of synthesized speech that is transmitted through a channel that is susceptible to transmission errors. In a presently preferred embodiment of the invention a speech signal is assumed to be first encoded using a Linear Predictive Coding (LPC) technique prior to transmission. The parameters that describe the short-term spectral behavior of the speech signal are received and then applied to and processed by a non-linear median processing block only on an occurrence of a predetermined number of transmission errors in the received LPC speech signal. The median-processed short term speech parameters are subsequently employed, together with a received excitation signal, in a synthesis filter to synthesize a speech signal of improved quality over what would be obtained if the short term speech parameters were not median processed to compensate for the transmission errors.
摘要:
An improved method for generating comfort noise (CN) in a mobile terminal operating in a discontinuous transmission (DTX) mode. In one embodiment the invention provides an improved method for comfort noise generation, in which a random excitation is modified by a spectral control filter so that the frequency content of comfort noise and background noise become similar. In another embodiment the transmitter identifies speech coding parameters that are not representative of the actual background noise, and replaces the identified parameters with parameters having a median value. In this manner the non-representative parameters do not skew the result of an averaging operation.
摘要:
A communication architecture for delivery of grammar and speech related information such as text-to-speech (TTS) data to a speech recognition server operating with a wireless telecommunication system for use with automatic speech recognition and interactive voice-based applications. In the invention, a mobile client retrieves a Web page containing multi-modal content hosted on a origin server via WAP gateway. The content may include a grammar file and/or TTS strings embedded in the content or reference URL(s) pointing to their storage locations. The client then sends the grammar and/or TTS strings to a speech recognition server via a wireless packet streaming protocol channel. When URL(s) are received by the client and sent to the SRS, the grammar file and/or TTS strings are obtained via a high speed HTTP connection. The speech processing results and the synthesized speech are returned to the client over the established wireless UDP connection.
摘要:
This invention is related to tandem free operation (TFO) in mobile cellular systems. The present invention implements a tandem free operation by using a special feedback loop which makes the decoded parameters available, performs the comfort noise insertion and bad frame handling operations, produces the parameter quantisation indices corresponding to the output of these operations, and synchronises the speech encoders and the speech decoders in the transmission path from the uplink mobile station to the downlink mobile station. This functionality is realized by partly decoding and re-encoding the parameters and synchronising and resetting the quantiser prediction memories in a specific manner. A basic idea of the invention is, that during BFH and CNI processes, a re-encoding block produces models of encoded speech parameters from the BFH/CNI processed speech parameters. These models of encoded speech parameters are then transmitted to the receiving end. The present invention provides a solution to the problem created by predictive, more generally non-stateless encoders in TFO operation.
摘要:
The invention relates to a method for transmitting background noise information including a silence descriptor identifier and background noise parameters in a communication system in which the information to be transmitted is formed into data frames. The data frames are subjected to channel coding to form channel-coded frames. The channel-coded frames are interleaved to be transmitted in two or more data transmission frames, and information of two channel-coded frames is transmitted in each data transmission frame. A first silence descriptor frame is formed provided with the silence descriptor identifier. The first silence descriptor frame is subjected to channel coding to form a channel-coded silence descriptor frame. The channel-coded silence descriptor frame is transmitted in two or more data transmission frames, and at least one data transmission frame transmitting part of the channel-coded silence descriptor frame is also used to transmit at least the background noise parameters.
摘要:
An improved method for generating comfort noise (CN) in a mobile terminal operating in a discontinuous transmission (DTX) mode. In one embodiment the invention provides an improved method for comfort noise generation, in which a random excitation is modified by a spectral control filter so that the frequency content of comfort noise and background noise become similar. In another embodiment the transmitter identifies speech coding parameters that are not representative of the actual background noise, and replaces the identified parameters with parameters having a median value. In this manner the non-representative parameters do not skew the result of an averaging operation.
摘要:
A comfort noise block, that include a hangover period and comfort noise parameters, is transmitted in such a manner that it is not interrupted by other messages, such as FACCH messages. This is accomplished in a mobile station by a determination of whether any FACCH messages are required to be transmitted. If such FACCH messages exist, a further determination may be made as to which transmission can be made in the shortest time (i.e., the FACCH message or messages or the comfort noise parameters message), and this transmission is made first. In any event the comfort noise parameters block is transmitted without interruption. In a further embodiment of this invention the comfort noise parameters message is transmitted by being concatenated with another message, such as a neighbor channel measurement results message, so as to reduce overhead, conserve bandwidth, and reduce power consumption. An element of the comfort noise parameters message is a Random Excitation Spectral Control (RESC) information element, which is used in the decoder for improving the spectral content of the generated comfort noise so as to better match the background noise at the transmitter.
摘要:
A method to detect a presence of a hangover period in a speech decoder in a communication system that utilizes discontinuous transmission between a transmitter and a receiver. The discontinuous transmission is defined by temporally sequential frame periods of which some periods will contain a transmitted frame and some periods will contain no transmission. The discontinuous transmission is constituted by information transmission periods of at least one frame that contain information given by a user to the transmitter, and silence periods having a length of at least one frame period and containing other information than that given by the user. An irregularly occurring period (T) of at least one frame exists between an information transmission period and the following silence period, the irregularly occurring period forming a hangover period for determining the information relating to the silence period. In the receiver the number of frame periods is counted until a certain moment; the beginning of the silence period is detected; and, based on said counted number of frame periods and the beginning of the silence period, a determination is made whether or not there is a hangover period (T) between the information transmission period and the silence period.