摘要:
Method for encoding a digital video stream, comprising the steps of encoding a video sequence into a full frame sequence, forming a decimated frame sequence by removing a predetermined number of frames from the full frame sequence by means of temporal decimation, locally decoding the full frame sequence, locally decoding the decimated frame sequence, temporally interpolating the decoded decimated frame sequence by means of an interpolator, comparing the locally decoded frames of the full frame sequence with the corresponding frames of the locally interpolated frame sequence, determining residual information for a frame based on at least the comparison for that frame, and providing an output stream comprising the decimated frame sequence and the determined residual information.
摘要:
In macroblock-based coding systems such as MPEG-1 video, MPEG-2 video and MPEG-4 visual, an encoder in the invention decides on macroblock level whether the macroblock is to be encoded or whether local motion-compensated interpolation processing at a compatible decoder can be used to reconstruct the macroblock. In the latter case, the macroblock is skipped. If the decoder detects a skipped macroblock, the decoder reconstructs the macroblock and overwrites the data conventionally generated in MPEG under the skipped macro-block condition.
摘要:
There is provided an image encoding system (300, 400) including an encoder (300) for receiving input image data and generating corresponding encoded image output data. The encoder includes image processing features (310, 320, 330, 360) for processing said input image data to generate for each input image therein a plurality of corresponding image layers including at least one basic layer BLOP and at least one enhancement layer ELOP. Moreover, the encoder (300) further includes encoding features (350) for receiving said image layers and generating therefrom the encoded image output data. The encoding features further comprising block selecting features (340) for selecting one or more sub-regions of said at least one enhancement layer and modelling said one or more sub-regions for representation thereof in the image output data by way of descriptive model parameters.
摘要:
The invention relates to a video encoder (201) for encoding a video signal. The video encoder comprises a segmentation processor (207) which divides the picture into picture regions. Preferably, picture regions having a high degree of flatness or uniformity are determined in this way. A characteristics processor (209) determine a spatial frequency characteristic for each picture region, and a coding controller (211) selects an encoding block size, such as a prediction block size for motion estimation, in response to the spatial frequency characteristic. An encode processor (213) encodes the picture using the selected encoding block size. Specifically, increasing block sizes are selected for increasing degrees of uniformity or flatness indicated by the spatial frequency characteristic. Thereby, an increasing proportion of high frequency components and a consistent choice of encoding block sizes are maintained, and thus the coding artefacts from many encoders having variable prediction block sizes is reduced. The invention is particularly suitable for H.264 and similar encoders.
摘要:
Coding of a video signal is provided according to a predefined standard, wherein in a given operation mode some of the tools provided by the predefined standard are disabled, and wherein an identification of the disabled tools is included in the bit-stream, the disabled tools being one or more out of the group of: bidirectional predictive coding of pictures or picture parts, use of a de-blocking filter, use of more than one reference picture.
摘要:
A content augmentation process for personal recordings involves a service center (SC). The service center (SC) collects personal recordings from various different users via a network so as to constitute a database (DB) of personal recordings. The service center (SC) identifies personal recordings within the database (DB) that concern a particular scene and that are mutually complementary so as to form a selection of personal recordings (FSRR) for content augmentation purposes. The service center (SC) applies a content augmentation process (AUGP) to the selection of personal recordings (FSRR) so as to obtain an enhanced representation (CA).
摘要:
Techniques utilising Time Scale Modification (TSM) of signals are described. The signal is analysed and divided into frames of similar signal types. Techniques specific to the signal type are then applied to the frames thereby optimising the modification process. The method of the present invention enables TSM of different audio signal parts to be realized using different methods, and a system for effecting said method is also described.
摘要:
A method and system for encrypting a video data stream, the video data stream partitioned into units based upon a type of data contained within the units. The method comprising: determining for each unit the type of data contained within the unit; and encrypting a particular unit or a portion of the particular unit based upon the type of data contained within the unit.
摘要:
There is provided an image encoding system (300, 400) including an encoder (300) for receiving input image data and generating corresponding encoded image output data. The encoder includes image processing features (310, 320, 330, 360) for processing said input image data to generate for each input image therein a plurality of corresponding image layers including at least one basic layer BLOP and at least one enhancement layer ELOP. Moreover, the encoder (300) further includes encoding features (350) for receiving said image layers and generating therefrom the encoded image output data. The encoding features further comprising block selecting features (340) for selecting one or more sub-regions of said at least one enhancement layer and modelling said one or more sub-regions for representation thereof in the image output data by way of descriptive model parameters.
摘要:
2D/3D video conversion using a method for providing an estimation of visual depth for a video sequence, the method comprises an audio scene classification (34) in which a visual depth categorization index of visual depth (37) of a scene is made on basis of an analysis of audio information (32) for the scene, wherein the visual depth categorization index (37) is used in a 5 following visual depth estimation (38) based on video information (33) for the same scene, thereby reducing the calculation load and speeding up the processing.