摘要:
A method for speech signal processing is provided. Energy attenuation gain values are set for background noise signals corresponding to obtained background noise frames subsequent to an erasure concealment frame, so that differences between the energy attenuation gain values of the background noise signals corresponding to the background noise frames and the energy attenuation gain values of signals corresponding to their respective previous frames are within a threshold range. Energy attenuation of the background noise signals corresponding to the background noise frames is controlled by using the energy attenuation gain values. An apparatus for speech signal processing is also provided in embodiments of the present invention. By using the embodiments of the present invention, the energy transition between the area of erasure concealment signal and the area of background noise signal may be made natural and smooth, so as to improve the audio comfortable sensation of the listener.
摘要:
An encoding method includes extracting background noise characteristic parameters within a hangover period; for a first superframe after the hangover period, performing background noise encoding based on the extracted background noise characteristic parameters; for superframes after the first superframe, performing background noise characteristic parameter extraction and DTX decision for each frame in the superframes after the first superframe; and for the superframes after the first superframe, performing background noise encoding based on extracted background noise characteristic parameters of the current superframe, background noise characteristic parameters of a plurality of superframes previous to the current superframe, and a final DTX decision. Also, a decoding method and apparatus and an encoding apparatus are disclosed.
摘要:
An encoding method includes extracting background noise characteristic parameters within a hangover period, for a first superframe after the hangover period, performing background noise encoding based on the extracted background noise characteristic parameters, for superframes after the first superframe, performing background noise characteristic parameter extraction and DTX decision for each frame in the superframes after the first superframe, and for the superframes after the first superframe, performing background noise encoding based on extracted background noise characteristic parameters of the current superframe, background noise characteristic parameters of a plurality of superframes previous to the current superframe, and a final DTX decision. Also, a decoding method and apparatus and an encoding apparatus are disclosed. Bandwidth occupancy may be reduced substantially while the signal quality may be guaranteed.
摘要:
An encoding method includes extracting background noise characteristic parameters within a hangover period; for a first superframe after the hangover period, performing background noise encoding based on the extracted background noise characteristic parameters; for superframes after the first superframe, performing background noise characteristic parameter extraction and DTX decision for each frame in the superframes after the first superframe; and for the superframes after the first superframe, performing background noise encoding based on extracted background noise characteristic parameters of the current superframe, background noise characteristic parameters of a plurality of superframes previous to the current superframe, and a final DTX decision. Also, a decoding method and apparatus and an encoding apparatus are disclosed.
摘要:
A method and apparatus for generating an excitation signal for background noise are provided. The method includes: generating a quasi excitation signal by utilizing coding parameters in a speech coding/decoding stage and a transition length of an excitation signal; and obtaining the excitation signal for background noise in a transition stage by generating a weighted sum of the quasi excitation signal and a random excitation signal of a background noise frame. Moreover, the apparatus includes: a quasi excitation signal generation unit and a transition stage excitation signal acquisition unit. Through the synthesizing scheme of comfortable background noise according to the present invention, the transition of a synthesized signal from speech to background noise could be more natural, smooth and continuous, which makes the listeners feel more comfortable.
摘要:
An encoding method includes extracting background noise characteristic parameters within a hangover period, for a first superframe after the hangover period, performing background noise encoding based on the extracted background noise characteristic parameters, for superframes after the first superframe, performing background noise characteristic parameter extraction and DTX decision for each frame in the superframes after the first superframe, and for the superframes after the first superframe, performing background noise encoding based on extracted background noise characteristic parameters of the current superframe, background noise characteristic parameters of a plurality of superframes previous to the current superframe, and a final DTX decision. Also, a decoding method and apparatus and an encoding apparatus are disclosed. Bandwidth occupancy may be reduced substantially while the signal quality may be guaranteed.
摘要:
The invention discloses a multi-stage quantization method, which includes the following steps: obtaining a reference codebook according to a previous stage codebook; obtaining a current stage codebook according to the reference codebook and a scaling factor; and quantizing an input vector by using the current stage codebook. The invention also discloses a multi-stage quantization device. With the invention, the current stage codebook may be obtained according to the previous stage codebook, by using the correlation between the current stage codebook and the previous stage codebook. As a result, it does not require an independent codebook space for the current stage codebook, which saves the storage space and improves the resource usage efficiency.
摘要:
A DTX decision method includes: obtaining sub-band signal(s) according to an input signal; obtaining a variation of characteristic information of each of the sub-band signals; and performing DTX decision according to the variation of the characteristic information of each of the sub-band signals. With the invention, a complete and appreciate DTX decision result is obtained by making full use of the noise characteristic in the speech encoding/decoding bandwidth and using band-splitting and layered processing. As a result, the SID encoding/CNG decoding may closely follow the characteristic variation of the actual noise.
摘要:
A DTX decision method includes: obtaining sub-band signal(s) according to an input signal; obtaining a variation of characteristic information of each of the sub-band signals; and performing DTX decision according to the variation of the characteristic information of each of the sub-band signals. With the invention, a complete and appreciate DTX decision result is obtained by making full use of the noise characteristic in the speech encoding/decoding bandwidth and using band-splitting and layered processing. As a result, the SID encoding/CNG decoding may closely follow the characteristic variation of the actual noise.
摘要:
A method and an apparatus for generating comfortable noises so as to improve user experience are disclosed. The method includes: if a received data frame is a noise frame, calculating a corresponding energy attenuation parameter based on the noise frame and a data frame received earlier than the noise frame; and attenuating noise energy based on the energy attenuation parameter to obtain a comfortable noise signal. An apparatus for generating comfortable noise is also provided.