摘要:
The color tone compensation method provides a simple and efficient method to compensate the color tone differences between two different sources of images. A first image sample, such as a still image, from a first image capturing source and a second image sample, such as a video frame, from a second image capturing source are aligned, and a tone-mapping estimation routine is applied to the two aligned images. The tone-mapping estimation routine uses the pixel intensity value histograms associated with the two aligned images and generates a tone mapping table. The tone mapping table includes a conversion intensity value for each intensity value in the second image. The conversion intensity value is a statistical measure, such as the mean, calculated according to the data in the corresponding pixel intensity value histogram. The tone-mapping table is applied to any image generated by the second image capturing source, thereby generating a new image with similar color tone as the first image generated by the first image capturing source.
摘要:
Motion Compensated Prediction (MCP) has been a key factor in most advanced video compression schemes. For further reduction in the residual signal energy in B-frames, bidirectional prediction where two motion-compensated signals are superimposed has also been utilized in most prior video coding standards such as MPEG-2 or MPEG-4/AVC. Syntax changes and appropriate motion vector prediction that allows efficient use of multi-parameter MCP is described. The prediction signal is constructed by linearly combining the motion-compensated signals from each parameter (or motion vector).
摘要:
Entropy encoding is performed in the inventive apparatus and method in response to the scanning of transform coefficients following an initial scanning pattern selected on the basis of probability statistics of non-zero coefficients for each block position. These non-zero probability statistics are ranked for a given combination of coding characteristics within the current block to arrive at an initial scanning pattern. The same initial scanning pattern selection is performed in the decoder to allow the transform coefficients to be extracted in their proper order from encoded video data. The pattern selection is applicable to both intra prediction and inter prediction. Transform coefficients are more accurately ordered in response to the invention because in adapting pattern initialization to quantization step size, high-frequency basis functions are properly taken into account.
摘要:
The currently existing ISO/AVC file format is modified by providing extensions to store and access video content currently being defined by the SVC standard. Specifically, extensions to the AVC file format are made to provide a new SVC file format that enables the storage and access of scalable video data. The scalable video data is stored as a single track within a media data section of the SVC file format. New extensions are defined for description entries and boxes within a metadata section of the SVC file format. These extensions provide means for extracting sub-streams or layers from the single track of scalable video data stored in the media data section.
摘要:
An apparatus or method of efficiently coding high resolution video using texture analysis and synthesis techniques in a scalable video coding framework. A high-resolution video signal is spatially downsampled and encoded into a base-layer. Texture and structure information are extracted from the downsampled signal and base-layer for use by a texture synthesizer. The texture synthesizer is structurally and texturally aware utilizing edge information from a base-layer of the synthesizer to improve synthesis. After synthesis, a video quality assessor directs enhancement-layer coding of unacceptably synthesized areas by alternate (non-textural) coding means, such as conventional AVC or MPEG-2 coding. In one mode, the quality assessor iteratively improves synthesis of certain blocks to make them acceptable for enhancement-layer coding. The apparatus or method outputs a bit stream, or bit streams, containing both the coded base-layer and the enhancement-layer.
摘要:
A codec configured to operate in a sequential mode extends the current AVC standard in order to provide support for coding high resolution still image pictures within the AVC coding of a lower resolution video. The sequential mode codec is configured according to the modified AVC standard. The codec is capable of capturing an AVC video stream while concurrently capturing high resolution still images at random intervals relative to the video stream. Each captured high resolution still image is embedded within the video stream. The video stream including the embedded high resolution still images is encoded as a single layer bit stream. Each embedded high resolution still image signals or forces the start of a new GOP in the coded video sequence. The AVC standard is extended to include new syntax for the sequence parameter sets. In one embodiment, a modified sequence parameter set defines a new profile that signals the presence of a high resolution still image in sequence with AVC video.
摘要:
An image processing apparatus and an image processing method, applied to e.g. a TV conference system or a TV telephone system, in which an image to be transmitted or received is captured and reconstructed into a virtual view point image which may appear as if it is captured by a virtual camera. The correspondence between pixels may be taken with higher robustness and higher accuracy, while mismatch in such correspondence, otherwise caused due to disparity, may be reduced to as small a value as possible. To this end, the disparity between the images, obtained on imaging an object from different view points by at least two cameras 11a, 12a, are detected and, based on the detected disparity, each of the images is separated into plural image regions. The relation of correspondence is found of the separated each image region, in association with the object, from one pixel position on the same horizontal lines to the next. From the pixel positions and the associated luminance components, for which the relation of correspondence has been found, a virtual view point image, to be generated by a virtual camera, mounted imaginarily, is constructed.
摘要:
The currently existing ISO/AVC file format is modified by providing extensions to store and access video content currently being defined by the SVC standard. Specifically, extensions to the AVC file format are made to provide a new SVC file format that enables the storage and access of scalable video data. The scalable video data is stored as a single track within a media data section of the SVC file format. New extensions are defined for description entries and boxes within a metadata section of the SVC file format. These extensions provide means for extracting sub-streams or layers from the single track of scalable video data stored in the media data section.
摘要:
A method permitting compact ways to update relationships between entities in an audiovideo sequence, or serial set of sequences, the method comprising: writing a description between the entities, the description containing relations; determining the relations that may be represented by parameters, each parameter having a numerical value; and obtaining from the user, one or more of the following: (a) the numerical value for the parameter; (b) a description of the parameter containing the numerical value; and (c) a description capable of setting the parameter dynamically.
摘要:
A system and method of coding (encoding and/or decoding) video content to extend file formats for storage. The system and method utilizes the concept to define additional sample group description entries. By way of example the method can comprise the steps of: (1) receiving a file with encoded media data as a scalable video codec stream; (2) extracting information identifying the various spatial resolutions, temporal resolutions, quality resolutions or combinations of spatio-temporal-quality resolutions from the media data; (3) generating new description entries and dependency grouping box; (4) populating boxes with extracted metadata; and (5) incorporating metadata into a file associated with the media data using a specific media file format.