摘要:
Apparatus and methods are provided for encoding an input frame of a video sequence for transmission over a channel. The method and apparatus decompose the input frame into multiple subbands and divide the multiple subbands into multiple blocks corresponding to a region of the input frame. The blocks in the highest frequency subbands of the multiple blocks are selected based upon a luminance component of the input frame and the multiple blocks in the highest frequency subbands are classified into a multiple classes to provide a multiple class labels. The multiple class labels are collected to form a subband class map for each of the multiple blocks in the highest frequency subbands and a global class map is constructed from a majority evaluation of the subband map for each of the multiple blocks. The multiple blocks within the multiple subbands are grouped which have one of the class labels to form multiple subband class sequences.
摘要:
A hyperspectral image encoder (10) for compressing hyperspectral imagery includes a differential pulse code modulation (DPCM) loop (26) to perform data decorrelation in the spectral domain and a discrete wavelet transform (DWT) processing means (28) to perform decorrelation in the spatial domain. The DPCM loop (26) determines an error image between a present image at the input of the encoder (10) and a predicted image. The DWT processing means (28) then divides the error image into a plurality of frequency subbands and quantizes information within the subbands in accordance with a plurality of predetermined quantization states to provide a coded output signal. In one embodiment, an interband predictor (24) is provided to predict an image, for use in calculating the error image, using the coded output signal.
摘要:
Methods for generating an image mosaic are provided. In one respect, pixels saliency of a first image and a second image are determined. One salient pixel may be selected from the determined pixels saliency group of the first image and one salient pixel may be selected from the determined pixels saliency group of the second image. A mosaicking technique of the first and second image may be performed if the one salient pixel of the first image and the one salient pixel of the second image are registered successfully.
摘要:
A multi-carrier communication system (400) groups subchannels (802) into different quality-of-signal (QoS) regions (804). An unconstrained optimization process (1200) is performed independently for the subchannels (802) of the different QoS regions (804) to allocate bit rates and power to the individual subchannels (802) so that the indicated QoS will result. Coders (504, 508, 512, 516, 1700) partition and error-correction encode source information using encoding schemes matched to the different QoS regions (804). A set (1100) of only a few directed QoS partition vectors (1102) direct the unconstrained optimization process (1200) to attempt bit-rate and power allocations on only a few promising groupings of subchannels (802) and QoS regions (804). An iterative process may take place between bit-rate and power allocation on one side and source information coding on the other for different directed QoS partition vectors (1102) to identify the best solution. A coder (1700) may use a dual allocation process (1800) to iteratively combine a rate-allocation process (1724) with a QoS-region-allocation process (1726) to specify codebooks (1720) for use by a codebook quantizer (1718).
摘要:
A multi-carrier communication system (400) groups subchannels (802) into different quality-of-signal (QoS) regions (804). An unconstrained optimization process (1200) is performed independently for the subchannels (802) of the different QoS regions (804) to allocate bit rates and power to the individual subchannels (802) so that the indicated QoS will result. Coders (504, 508, 512, 516) partition and error-correction encode source information using encoding schemes matched to the different QoS regions (804). A set (1100) of only a few directed QoS partition vectors (1102) direct the unconstrained optimization process (1200) to attempt bit-rate and power allocations on only a few promising groupings of subchannels (802) and QoS regions (804). An iterative process may take place between bit-rate and power allocation on one side and source information coding on the other for different directed QoS partition vectors (1102) to identify the best solution.
摘要:
A system is presented for compression of hyperspectral imagery. Specifically, DPCM is used for spectral decorrelation, while an adaptive 2-D discrete cosine transform (DCT) (22) coding scheme is used for spatial decorrelation. Trellis coded quantization (23) is used to encode the transform coefficients. Side information and rate allocation strategies (36) are discussed. Entropy-constrained codebooks are designed using a modified version of the generalized Lloyd algorithm
摘要:
A method and apparatus (100) for pitch-epoch-synchronous source-filter speech encoding by means of error component modeling methods (310) which capture fundamental orthogonal (uncorrelated) basis elements of an excitation source waveform. A periodic waveform model (318) along with four orthogonal error waveforms, desirably including phase error (319), ensemble error (321), standard deviation error (323), and mean error (324) waveforms, are incorporated together to form a complete description of the excitation. These error waveforms (319,321, 323, 324) represent those portions of the excitation that are not represented by the purely periodic model. By thus orthogonalizing the error components, the perceptual effect of each element is isolated from the composite set, and can thus be encoded separately. In addition to high-quality, fixed-rate operation, the identity-system capability and low complexity of the speech encoding method and apparatus make them applicable to variable-rate applications without changing underlying modeling methods.
摘要:
Methods and devices for a media processing is provided. In one respect, the methods can provide initiating a bandwidth throttle or a frame rate throttle when resources of a network exceed resources of client device. The methods of the present disclosure may also provide techniques for handling lost packets during transmission using wavelet coefficients.
摘要:
Apparatus and methods are provided for object recognition and compression. The apparatus (114) comprises an object processor (402) configured to receive the image (120) and synthesize a contour (404) of an object within the image (120) and a classification engine (406) configured to receive the contour (404) of the image (120) and recognize the object within the image as a member of a first object class if the object substantially meets first object criteria of the first object class that is at least partially related to the target-specific utility of the image. The apparatus (114) also comprises a multi-rate encoder (116) configured to compress a first region of the image (120) having said object recognized as said member of said first object class at a first coding rate, said first coding rate providing a first coding resolution of said first region that is greater than a second coding resolution provided by a second coding rate for the image.
摘要:
A communication system (20) employs fixed rate channel-optimized, trellis-coded quantization (COTCQ) at a plurality of diverse encoding bit rates. COTCQ is performed through a COTCQ encoder (40) and COTCQ decoder (54). The COTCQ encoder and decoder (40,54) each include a codebook table (62) having at least one codebook (64) for each encoding bit rate. Each codebook (64) is configured in response to the bit error probability of the channel (26) through which the communication system (20) communicates. The bit error probability influences codebooks through the calculation of channel transition probabilities for all combinations of codewords (90) receivable from the channel (26) given all combinations of codewords (90) transmittable through the channel (26). Channel transition probabilities are responsive to base channel transition probabilities and the hamming distances between indices for codewords within subsets of the transmittable and receivable codewords.