Abstract:
A dyadic spatial down sampling filter having tap values configured according to a Kaiser window function a beta factor of approximately 2.5, having approximately 1.5 side lobes, and having a down sampling ratio of approximately 1.9. The dyadic spatial down sampling filter may have tap values [−1, 17, 32, 17, −1]/64. A dyadic spatial up sampling filter having tap values configured according to a Kaiser window function having a beta factor of approximately 1.5, having approximately 2 side lobes, and having an up sampling ratio of approximately 2. The dyadic spatial up sampling filter may have tap values [−5.44, 0, 20.71, 33.46, 20.71, 0, −5.44]/64.0, or tap values [−5, 0, 21, 32, 21, 0, −5]/64, or tap values [−5, 21, 21, −5]/32.
Abstract:
A method and system for sampling video data uses re-sampling filters having lengths optimized relative to a quantization parameter of video processing. The method uses modeling of an optimal length of the re-sampling filter as a function of the quantization parameter to derive empirical formulas and a look up table for optimal lengths of re-sampling filters. The resulting re-sampling filters are selectively adapted for sampling video data having different bit rates.
Abstract:
A localized content adaptive filter system including a tile buffer having an output providing first image information, a frequency analyzer providing a frequency information signal based on frequency content of the first image information, and an adaptive filter which is adjusted based on the frequency information signal. The frequency analyzer may include a wavelet transform filter and a frequency content analyzer. The adaptive filter may include filter select logic which receives the frequency information signal and second image information associated with the first image information, and which provides filtered image information. The filter select logic determines a filter based on the frequency information signal and the determined filter filters the second image information to provide the filtered image information. The filter select logic may include preconfigured filters in which a filter is selected by the frequency information, or a programmable filter which is programmed according to the frequency information.
Abstract:
A method for transmitting encoded video data includes determining an available space for transmitting an encoded video data in a last remaining information block of a video packet. The method further includes if the available space is sufficient, based on a comparison of the an estimated size of the encoded video data with the available space, then transmitting the encoded video data using the last remaining information block of the video packet. The method further includes if the available space is insufficient, changing a coding scheme associated with the encoded video data to transform the encoded video data into reduced size video data and transmitting the reduced size video data using the last remaining information block of the video packet.
Abstract:
A method of scaling complexity of a video processing system including determining a power saving factor based on an operating parameter and adjusting processing of video information based on the power saving factor to reduce computation complexity. The operating parameter may include available power and/or available processing capacity. A method of complexity scalability for a video processing system using prioritized layered coding including determining a power saving factor based on one or more metrics, such as power capacity and/or available processing capacity, and reducing processing complexity of multiple prioritized coding functions in a predetermined order of priority based on the level of the power saving factor. A video processing system including a power management circuit which determines the power saving factor and a video encoder system which correspondingly adjusts computation complexity.
Abstract:
A localized content adaptive filter system including a tile buffer having an output providing first image information, a frequency analyzer providing a frequency information signal based on frequency content of the first image information, and an adaptive filter which is adjusted based on the frequency information signal. The frequency analyzer may include a wavelet transform filter and a frequency content analyzer. The adaptive filter may include filter select logic which receives the frequency information signal and second image information associated with the first image information, and which provides filtered image information. The filter select logic determines a filter based on the frequency information signal and the determined filter filters the second image information to provide the filtered image information. The filter select logic may include preconfigured filters in which a filter is selected by the frequency information, or a programmable filter which is programmed according to the frequency information.
Abstract:
An error detection and correction circuit for a video decoder that reconstructs a lost macroblock including a clustering circuit, a classification circuit and an error concealment circuit. The clustering circuit clusters macroblocks adjacent to the lost macroblock into one or more defined clusters. The classification circuit assigns the lost macroblock or each sub-block of the lost macroblock to a defined cluster. The error concealment circuit reconstructs attributes of the lost macroblock or its sub-block based on selected attributes of a defined cluster to which the lost block is assigned. Clustering is based on entire adjacent macroblocks or sub-blocks thereof. The clustering circuit may perform clustering using any one or more of the attributes of the macroblocks including coding parameters, texture statistics, color components, frequency analysis, and image processing operators. The lost macroblock may be assigned as a whole or subdivided into lost sub-blocks that are individually assigned to clusters.
Abstract:
A sound imaging system and method for generating multi-channel audio data from an audio/video signal having an audio component and a video component. The system comprises: a system for associating sound sources within the audio component to video objects within the video component of the audio/video signal; a system for determining position information of each sound source based on a position of the associated video object in the video component; and a system for assigning sound sources to audio channels based on the position information of each sound source.
Abstract:
A method and image processing system are disclosed that extract feature information from an image using a model. An object in the image is first identified. The model is used at the front-end to quickly determine/define the approximate location/position of specific features of the object. Based in this modeling information, the location/position of the specific features are then determined. A feedback process may also be used to update/customize the model and improve performance.
Abstract:
The system disclosed herein builds a database of image-related data by inputting a plurality of images, and for each image: dividing the image into N1 (N1≧1) regions which are substantially homogenous with respect to at least one attribute of the image, generating a graph based on M1 (M1≧1) of the N1 regions, and storing data for the graph in the database. The database may then be used to determine whether a query image is similar to one or more of the plurality of images. To this end, the system inputs the query image, and divides the query image into N2 (N2≧1) regions which are substantially homogenous with respect to at least one attribute of the query image. The system then generates a graph based on M2 (M2≧1) of the N2 regions, and compares the generated graph to other graphs in the database that correspond to the plurality of images. Thereafter, an image is retrieved from memory that has a graph that is substantially similar to the generated graph.