Abstract:
A transient detector (100) analyzes (110) a given frame n of the input audio signal to determine, based on audio signal characteristics of the given frame n, a transient hangover indicator for a following frame n+1, and signals (120) the determined transient hangover indicator to an associated audio encoder (10) to enable proper encoding of the following frame n+1.
Abstract:
A network processing node (e.g., MGW, MRFP) and method are described herein that can: (1) receive packets on a first heterogeneous link (e.g., wireless link); (2) manipulate the received packets based on known characteristics about a second heterogeneous link (e.g., “Internet” link); and (3) send the manipulated packets on the second heterogeneous link (e.g., “Internet” link). For example, the network processing node can manipulate the received packets by adding redundancy, removing redundancy, frame aggregating (re-packetizing), recovering lost packets and/or re-transmitting packets.
Abstract:
The invention provides an efficient technique for encoding a multi-channel audio signal. The invention relies on the principle of encoding (S1) a signal representation of one or more of the multiple channels in a first encoding process, and encoding another signal representation of one or more channels in a second, filter-based encoding process. A basic idea according to the invention is to select (S2), for the second encoding process, a combination of i) frame division configuration of an overall encoding frame into a set of sub-frames, and ii) filter length for each sub-frame, according to a predetermined criterion. The second signal representation is then encoded (S3) in each sub-frame of the overall encoding frame according to the selected combination. The possibility to select frame division configuration and at the same time adjust the filter length for each sub-frame provides added degrees of freedom, and generally results in improved performance.
Abstract:
A method for audio coding and decoding comprises primary encoding of a present audio signal sample into an encoded representation (T(n)), and non-causal encoding of a first previous audio signal sample into an encoded enhancement representation (ET(n−N+)). The method further comprises providing of the encoded representations to an end user. At the end user, the method comprises primary decoding of the encoded representation (T*(n)) into a present received audio signal sample, and non-causal decoding of the encoded enhancement representation (ET*(n−N+)) into an enhancement first previous received audio signal sample. The method further comprises improving of a first previous received audio signal sample, corresponding to the first previous audio signal sample, based on the enhancement first previous received audio signal sample. Devices and systems for audio coding and decoding are also presented.
Abstract:
The invention provides a highly efficient technique for encoding a multi-channel audio signal. The invention relies on the basic principle of encoding a first signal representation of one or more of the multiple channels in a first encoder (130) and encoding a second signal representation of one or more of the multiple channels in a second, multi-stage, encoder (140). This procedure is significantly enhanced by providing a controller (150) for adaptively allocating a number of encoding bits among the different encoding stages of the second, multi-stage, encoder (140) in dependence on multi-channel audio signal characteristics.
Abstract:
A vector quantizer includes a lattice quantizer (10) approximating a vector x by a lattice vector belonging to a lattice Λ0. A lattice vector decomposer (14) connected to the lattice quantizer successively decomposes the lattice vector into a sequence of quotient vectors y, and a sequence of remainder vectors ri on successive lattices ΛI−1 by lattice division with a corresponding predetermined sequence of integers pi≧2, where i=1 . . . k and k is a positive integer representing the number of elements in each sequence.
Abstract:
In signal processing using digital filtering the representation of a filter is adapted depending on the filter characteristics If e.g. a digital filter is represented by filter coefficients for transform bands Nos. 0, 1, . . . , K in the frequency domain, a reduced digital filter having coefficients for combined transform bands, i.e. subsets of the transformed bands, Nos. 0, 1, . . . , L, is formed and only these coefficients are stored. When the actual filtering in the digital filter is to be performed, an actual digital filter is obtained by expanding the coefficients of the reduced digital filter according to a mapping table and then used instead of the original digital filter.
Abstract:
An overall encoding procedure and associated decoding procedure are presented. The encoding procedure involves at least two signal encoding processes operating on signal representations of a set of audio input channels. Local synthesis is used in connection with a first encoding process to generate a locally decoded signal, including a representation of the encoding error of the first encoding process. This locally decoded signal is applied as input to a second encoding process. The overall encoding procedure generates at least two residual encoding error signals from at least one of said encoding processes, including at least said second encoding process. The residual error signals are then subjected to compound residual encoding in a further encoding process, preferably based on correlation between the residual error signals.
Abstract:
Information about excitation signals of a first signal encoded by CELP is used to derive a limited set of candidate excitation signals for a second correlated second signal. Preferably, pulse locations of the excitation signals of the first encoded signal are used for determining the set of candidate excitation signals. More preferably, the pulse locations of the set of candidate excitation signals are positioned in the vicinity of the pulse locations of the excitation signals of the first encoded signal. The first and second signals may be multi-channel signals of a common speech or audio signal. However, the first and second signals may also be identical, whereby the coding of the second signal can be utilized for re-encoding at a lower bit rate.
Abstract:
The invention concerns a frequency-domain error concealment technique for information that is represented, on a frame-by-frame basis, by coding coefficients. The basic idea is to conceal an erroneous coding coefficient by exploiting coding coefficient correlation in both time and frequency. The technique is applicable to any information, such as audio, video and image data, that is compressed into coding coefficients and transmitted under adverse channel conditions. The error concealment technique proposed by the invention has the clear advantage of exploiting the redundancy of the original information signal in time as well as frequency. For example, this offers the possibility to exploit redundancy between frames (inter-frame) as well as within frames (intra-frame). The use of coding coefficients from the same frame as the erroneous coding coefficient is sometimes referred to as intra-frame coefficient correlation and it is a special case of the more general frequency correlation.