摘要:
Methods and apparatus for quickly selecting an optimal excitation waveform from a codebook are presented herein. In encoding schemes that use forward and backward pitch enhancement, storage and processor load is reduced by approximating a two-dimensional autocorrelation matrix with a one-dimensional autocorrelation vector. The approximation is possible when a cross-correlation element is configured to determine the autocorrelation matrix of an impulse response and a pulse energy determination element is configured to determine the energy of a pulse code vector that incorporates secondary pulse positions.
摘要:
A method and apparatus for coding a quasi-periodic speech signal. The speech signal is represented by a residual signal generated by filtering the speech signal with a Linear Predictive Coding (LPC) analysis filter. The residual signal is encoded by extracting a prototype period from a current frame of the residual signal. A first set of parameters is calculated which describes how to modify a previous prototype period to approximate the current prototype period. One or more codevectors are selected which, when summed, approximate the error between the current prototype period and the modified previous prototype. A multi-stage codebook is used to encode this error signal. A second set of parameters describe these selected codevectors. The decoder synthesizes an output speech signal by reconstructing a current prototype period based on the first and second set of parameters, and the previous reconstructed prototype period. The residual signal is then interpolated over the region between the current and previous reconstructed prototype periods. The decoder synthesizes output speech based on the interpolated residual signal.
摘要:
A method and apparatus for identifying frequency bands to compute linear phase shifts between frame prototypes in a speech coder includes partitioning the frequency spectrum of a prototype of a frame by dividing the frequency spectrum into segments, assigning one or more bands to each segment, and establishing, for each segment, a set of bandwidths for the bands. The bandwidths may be fixed and uniformly distributed in any given segment. The bandwidths may be fixed and non-uniformly distributed in any segment. The bandwidths may be variable and non-uniformly distributed in any given segment.
摘要:
Techniques for estimating distortion due to quantization of data are described. A histogram with multiple bins may be obtained for a set of coefficients to be quantized. Distortion due to quantization of the set of coefficients may be estimated based on the histogram and average distortions for the histogram bins. The number of coefficients in each bin may be multiplied with an average distortion for the bin to obtain a per-bin distortion. The per-bin distortions for all of the bins may be accumulated and scaled with a correction factor to obtain the estimated distortion. The techniques may be used to estimate distortions for a set of coding elements. Distortion and rate may be estimated for each coding element for each of multiple quantization steps. A set of quantization steps may be selected for the set of coding elements based on the estimated distortions and the estimated rates for the set of coding elements for different quantization steps.
摘要:
This disclosure describes selective deblock filtering techniques that are particularly useful with coding standards that do not specify in-loop deblock filtering for standard compliance. In accordance with this disclosure, deblock filtering may be selectively performed with respect to block boundaries of a given video block being coded, a motion compensation process can be performed using a filtered version of the prediction video frame used to code the given video block, or both. This disclosure also provides selection rules that can be applied to determine what type of deblocking techniques to apply in various scenarios. The selection rules may improve the video coding and may also ensure that mismatch between video blocks at an encoder and a decoder is not introduced by the deblock filtering.
摘要:
Systems, methods, and apparatus for the detection of signals having spectral peaks with narrow bandwidth are described herein. The range of described configurations includes implementations that perform such detection using parameters of a linear prediction coding (LPC) analysis scheme.
摘要:
This disclosure describes selective deblock filtering techniques that are particularly useful with coding standards that do not specify in-loop deblock filtering for standard compliance. In accordance with this disclosure, deblock filtering may be selectively performed with respect to block boundaries of a given video block being coded, a motion compensation process can be performed using a filtered version of the prediction video frame used to code the given video block, or both. This disclosure also provides selection rules that can be applied to determine what type of deblocking techniques to apply in various scenarios. The selection rules may improve the video coding and may also ensure that mismatch between video blocks at an encoder and a decoder is not introduced by the deblock filtering.
摘要:
Techniques for estimating distortion due to quantization of data are described. A histogram with multiple bins may be obtained for a set of coefficients to be quantized. Distortion due to quantization of the set of coefficients may be estimated based on the histogram and average distortions for the histogram bins. The number of coefficients in each bin may be multiplied with an average distortion for the bin to obtain a per-bin distortion. The per-bin distortions for all of the bins may be accumulated and scaled with a correction factor to obtain the estimated distortion. The techniques may be used to estimate distortions for a set of coding elements. Distortion and rate may be estimated for each coding element for each of multiple quantization steps. A set of quantization steps may be selected for the set of coding elements based on the estimated distortions and the estimated rates for the set of coding elements for different quantization steps.
摘要:
The disclosure describes FGS video coding techniques that use cycle-aligned fragments (CAFs). The techniques may perform cycle-based coding of FGS video data block coefficients and syntax elements, and encapsulate cycles in fragments for transmission. The fragments may be cycle-aligned such that a start of a payload of each of the fragments substantially coincides with a start of one of the cycles. In this manner, cycles can be readily accessed via individual fragments. Some cycles may be controlled with a vector mode to scan to a predefined position within a block before moving to another block. In this manner, the number of cycles can be reduced, reducing the number of fragments and associated overhead. The CAFs may be entropy coded independently of one another so that each fragment may be readily accessed and decoded without waiting for decoding of other fragments. Independent entropy coding may permit parallel decoding and simultaneous processing of fragments.
摘要:
In a device configurable to encode speech performing an closed loop re-decision may comprise representing a speech signal by amplitude components and phase components for a current frame and a past frame. In a first closed loop stage, a first set of compressed components and a first set of uncompressed components for a current frame may be generated. A first set of features may be generated by comparing current and past frame amplitude and/or phase components. In a second closed loop stage, a second set of compressed components for the current frame may be generated by compressing the first set of compressed components and compressing the first set of uncompressed components. Generation of a second set of features may be based on the second set of compressed components from the current frame and a combination of amplitude and/or phase components from the past frame.