摘要:
A new predictive coding is used to increase the temporal frame rate and coding efficiency without introducing excessive delay. Currently the motion vector for the blocks in the bi-directionally predicted frame is derived from the motion vector of the corresponding block in the forward predicted frame using a linear motion model. This however is not effective when the motion in the image sequence is not linear. The efficiency of this method can be further improved if a non-linear motion model is used. In this model a delta motion vector is added to or subtracted from the derived forward and backward motion vector, respectively. The encoder performs an additional search to determine if there is a need for the delta motion vector. The presence of this delta motion vector in the transmitted bitstream is signalled to the decoder which then takes the appropriate action to make use of the delta motion vector to derive the effective forward and backward motion vectors for the bi-directionally predicted block.
摘要:
A transmission method for transmitting coefficients representing image data is provided. The transmission method includes transmitting coded quantized coefficients obtained by coding quantized coefficients. The quantized coefficients are obtained by quantizing the coefficients using a complete quantization matrix having multiple quantization elements. The transmission method also includes transmitting a coded quantization matrix obtained by coding a truncated quantization matrix. The truncated quantization matrix is generated by truncating the complete quantization matrix. The coded quantization matrix has bits aligned in the order of bits obtained by coding the quantization elements included in the truncated quantization matrix and bits obtained by coding an end code.
摘要:
A method of encoding a local time base embedded in the compressed data is disclosed. The local time base is encoded in two parts. The first part has a modulo time base that indicates the specific interval in the reference time base and the second part has a time base increment relative to the reference time. Two forms of time base increment is used to allow for the possibility of different encoding order and displaying order. A mechanism for the synchronization of multiple compressed streams with local time base is also described. A time base offset mechanism is also used to allow finer granularity synchronization of the multiple compressed streams.
摘要:
A method of encoding a visual object. The method includes encoding the visual object to obtain compressed coded data using at least one of a plurality of different operations which can be performed on a block by block basis. The method also includes encoding information associated with a computational complexity of decoding the compressed coded data. The encoded information relates to the number of times each of the plurality of different operations is performed.
摘要:
A method for encoding a visual object, includes encoding a visual object to obtain compressed coded data, wherein the compressed coded data is obtained through at least one of a plurality of different operations which can be performed on a block by block basis. A descriptor capable of including information of a respective parameter related to the number of times that each of the operations was performed in a unit of blocks is generated, the descriptor associated with a complexity of decoding the compressed coded data. The descriptor is multiplexed with the compressed coded data, wherein the descriptor comprises an identification number identifying a descriptor type, a length field indicating a size of the descriptor, and the parameter.
摘要:
When dividing inputted image data to be coded into image data of a plurality of small regions which are adjacent to each other and coding the image data of an objective small region to be processed among the image data of the plurality of divided small regions which are adjacent to each other, reconstructed image data of a reproduction small region adjacent to the image data of the objective small region to be processed is used as image data of an intra-frame prediction small region of the objective small region to be processed, the image data of the intra-frame prediction small region is used as image data of an optimum prediction small region and image data of a difference small region which are differences between the image data of the objective small region to be processed and the image data of the optimum prediction small region is generated. Then, the generated image data of the difference small region is coded and outputted, and then the coded image data of the difference small region is decoded, so that the reconstructed image data of the reproduction small region is generated by adding the decoded image data of the difference small region to the image data of the optimum prediction small region.
摘要:
The present invention provides a unique intra prediction process which improves the efficiency of video coding. H.264/AVC uses reference pixels in a horizontal boundary located immediately above a target block to be predicted and reference pixels in a vertical boundary located immediately left of the target block. In the present invention, at least some of one of an array of horizontal boundary pixels and an array of vertical boundary pixels are retrieved. Then, the retrieved pixels are added to the other boundary pixels to extend the array thereof. Intra prediction is performed, based solely on the extended array of boundary pixels.
摘要:
A block divider divides an input image into a plurality of regions. A prediction signal generator generates a prediction signal for the pixel signal contained in an object region, which is the object of processing, among the plurality of regions, and generates a prediction signal by using a texture synthesis method for forming a texture signal for the object region. A subtracter determines a residual signal between the pixel signal of the object region and the prediction signal, and a transformer and quantizer encode the residual signal to generate a compressed signal.
摘要:
In a moving picture encoding apparatus 10, as an embodiment of the present invention, a predicted image generator 16 generates a predicted image with respect to a target image, using a reference image stored in a frame memory 20. A difference image generator 18 performs a difference operation between the target image and the predicted image to generate a difference image. An encoder 22 encodes the difference image to generate an encoded difference signal. A decoder 28 decodes the encoded difference signal to generate a decoded difference image. A reproduced image generator 30 performs a summation of the decoded difference image and the predicted image to generate a reproduced image. An image updater 32 performs a weighted summation of a first image which is one of the reproduced image and the reference image, and a second image which is the other of the reproduced image and the reference image, to generate an updated image. The updated image is stored into the frame memory 20 to be used as a reference image in encoding of another target image.
摘要:
In order to improve the encoding efficiency while avoiding an increase in the size or memory band of a frame memory and having adaptability in the encoding/decoding processing of a moving picture, a bit length extension converter converts a target picture having a bit length N into an extended target picture having a bit length M, a compressor encodes the converted picture, and an expander restores the encoded picture. Then, a bit length reduction converter converts the picture into a reproduction picture having a bit length L smaller than the bit length M, and this reproduction picture is stored in a frame memory as a reference picture.