摘要:
An audio signal processing system includes parallel speech and generic audio signal processing paths. One path includes a linear predictive coder and a resampling filter having a non-linear phase characteristic. A phase compensation filter is disposed along the one of the processing paths to compensate for the non-linearity of the resampling filter thereby enabling relatively seamless switching between the coders resulting in a reduction of audio artifacts that would otherwise result from the non-linear phase characteristic of the resampling filter during playback.
摘要:
An audio encoding method and apparatus and an audio decoding method and apparatus are provided. The audio signal decoding method includes extracting a downmix signal and object-based side information from an audio signal; generating a modified downmix signal based on the downmix signal and extracted information which is extracted from the object-based side information; generating channel-based side information based on the object-based side information and control data for rendering the downmix signal; and generating a multi-channel audio signal based on the modified downmix signal and the channel-based side information.
摘要:
An apparatus for processing an audio signal and method thereof are disclosed. The apparatus comprises an information receiving unit receiving a downmix signal including and a plurality of preset informations; an external preset information receiving unit receiving a plurality of external preset informations being inputted from external; an external preset applying-determining unit determining whether the plurality of the external preset informations applies to the downmix signal; an external preset information selecting unit selecting one external preset information, if the plurality of external preset informations is selected; and a rendering unit controlling the object by applying the external preset information to the all data regions.Accordingly, an audio signal can be efficiently reconstructed by individually selecting and applying external preset information by a data region unit or by selecting and applying the same external preset information to a whole downmix signal. And, feedback information can be received from a user by displaying an object adjusted by having external preset rendering parameter applied thereto on a screen, according to a characteristic of an audio source.
摘要:
A codec supporting switching between time-domain aliasing cancellation transform coding mode and time-domain coding mode is made less liable to frame loss by adding a further syntax portion to the frames, depending on which the parser of the decoder may select between a first action of expecting the current frame to have, and thus reading forward aliasing cancellation data from the current frame and a second action of not-expecting the current frame to have, and thus not reading forward aliasing cancellation data from the current frame. In other words, while a bit of coding efficiency is lost due to the provision of the new syntax portion, it is merely the new syntax portion which provides for the ability to use the codec in case of a communication channel with frame loss. Without the new syntax portion, the decoder would not be capable of decoding any data stream portion after a loss and will crash in trying to resume parsing. Thus, in an error prone environment, the coding efficiency is prevented from vanishing by the introduction of the new syntax portion.
摘要:
An encoder comprising an input for inputting frames of an audio signal in a frequency band, at least a first excitation block for performing a first excitation for a speech like audio signal, and a second excitation block for performing a second excitation for a non-speech like audio signal. The encoder further comprises a filter for dividing the frequency band into a plurality of sub bands each having a narrower bandwidth than the frequency band. The encoder also comprises an excitation selection block for selecting one excitation block among the at least first excitation block and the second excitation block for performing the excitation for a frame of the audio signal on the basis of the properties of the audio signal at least at one of the sub bands. The invention also relates to a device, a system, a method and a storage medium for a computer program.
摘要:
Provided are an apparatus and method for coding and decoding a multi-object audio signal. The apparatus includes a down-mixer for down-mixing the audio signals into one down-mixed audio signal and extracting supplementary information including header information and spatial cue information for each of the audio signals, a coder for coding the down-mixed audio signal, and a supplementary information coder for generating the supplementary information as a bit stream. The header information includes identification information for each of the audio signals and channel information for the audio signals.
摘要:
A method for encoding audio frames by producing a first frame of coded audio samples by coding a first audio frame in a sequence of frames, producing at least a portion of a second frame of coded audio samples by coding at least a portion of a second audio frame in the sequence of frames, and producing parameters for generating audio gap filler samples, wherein the parameters are representative of either a weighted segment of the first frame of coded audio samples or a weighted segment of the portion of the second frame of coded audio samples.
摘要:
An encoder and decoder for processing an audio signal including generic audio and speech frames are provided herein. During operation, two encoders are utilized by the speech coder, and two decoders are utilized by the speech decoder. The two encoders and decoders are utilized to process speech and non-speech (generic audio) respectively. During a transition between generic audio and speech, parameters that are needed by the speech decoder for decoding frame of speech are generated by processing the preceding generic audio (non-speech) frame for the necessary parameters. Because necessary parameters are obtained by the speech coder/decoder, the discontinuities associated with prior-art techniques are reduced when transitioning between generic audio frames and speech frames.
摘要:
In an embodiment, bitstream elements of sub-frames are encoded differentially to a global gain value so that a change of the global gain value results in an adjustment of an output level of the decoded representation of the audio content. Concurrently, the differential coding saves bits. Even further, the differential coding enables the lowering of the burden of globally adjusting the gain of an encoded bitstream. In another embodiment, a global gain control across CELP coded frames and transform coded frames is achieved by co-controlling the gain of the codebook excitation of the CELP codec, along with a level of the transform or inverse transform of the transform coded frames. In another embodiment, the gain value determination in CELP coding is performed in the weighted domain of the excitation signal.
摘要:
A method and apparatus for signal processing which enable data compression and recovery with high transmission efficiency are disclosed. Data coding and entropy coding are performed with correlation and grouping is used to increase coding efficiency. A method for signal processing according to this invention, the method includes decapsulating the signal received over an Internet protocol network, obtaining a pilot reference value corresponding to a plurality of data and a pilot difference value corresponding the pilot reference value from the decapsulated signal and obtaining the data using the pilot reference value and the pilot difference value.