摘要:
A video signal is encoded to provide a coded video signal, wherein a frame of the video signal is divided into a multiplicity of nonoverlapping blocks and each of the blocks contains K.times.L pixels, K and L being positive integers, by the steps of: dividing K.times.L pixels included in a block into a bright group and a dark group based on intensity values of the pixels; deciding a bit plane which contains K.times.L binary pixels, each of the binary pixels denoting either the bright or the dark group where a corresponding pixel in the block belongs to; dividing the bit plane to N subblocks each of which contains M binary pixels; deciding N majority values each of which represents a more frequently occurring binary pixel value for each of the N subblocks, and for providing a modified bit plane which contains the N majority values as its N binary pixel values; providing a restored bit plane which consists of N subblocks each of which contains M binary pixels having a value corresponds to one of the N majority values; determining a sample mean and a sample variance of the block; determining two reconstruction values which denote a representative intensity value of the pixels included in the bright or the dark group; and combining the two reconstruction values, and the modified bit plane, to thereby provide a block of the coded video signal.
摘要:
An image information processing apparatus includes a detector for detecting the move vector of an image of the present relative to that of the preceding field; a circuit for processing, on the basis of the move vector, the image of the present field stored in a field memory; and a residual memory for storing the residual information calculated during the period in which the image data of one horizontal scanning line is inputted. The residual information stored in the residual memory is shunted into a blank area of the field memory during a predetermined interval after the period in which the image data is input. The processing circuit performs a wobble correction to correct any shaking of the image between fields or compresses the data quantity of the dynamic image in image transmission, and the information of one entire picture is processed after being divided into a plurality of blocks, whereby the storage capacity of the residual memory can be widely reduced.
摘要:
A moving picture coding method includes: making a determination as to whether or not to code all blocks in a current picture in the skip mode; setting, based on a result of the determination, a first flag indicating whether or not a temporally neighboring block is to be referenced, a value of a parameter for determining a total number of merging candidates, and a second flag for each block included in the current picture, the second flag indicating whether or not the block is to be coded in the skip mode; calculating, as a merging candidate, a neighboring block usable for merging; and coding an index which indicates a merging candidate to be used for coding of the current block and attaching the coded index to a bitstream.
摘要:
A video coding method includes the following. A bitstream is decoded to obtain a feature map of a target object in a current picture. The feature map of the target object in the current picture is input to a visual task network and a prediction result output by the visual task network is obtained.
摘要:
The present invention relates to a method and apparatus for encoding and decoding a video image based on transform. The method for decoding a video includes: determining a transform mode of a current block; inverse-transforming residual data of the current block according to the transform mode of the current block; and rearranging the inverse-transformed residual data of the current block according to the transform mode of the current block, wherein the transform mode includes at least one of SDST (Shuffling Discrete Sine Transform), SDCT (Shuffling Discrete cosine Transform), DST (Discrete Sine Transform) or DCT (Discrete Cosine Transform).
摘要:
An method for determining a video coding test sequence, an electronic device, and a computer readable storage medium are provided. The method includes: determining a candidate video set including multiple candidate videos corresponding to a target service requirement; classifying the candidate videos by content categories to obtain a target distribution of content categories; clustering the candidate videos by values of a preset coding complexity to obtain multiple video classes; selecting from each of the video classes respectively a target class-representative video such that an actual distribution of content categories is consistent with the target distribution of content categories; and constructing a target video coding test sequence based on the target class-representative videos.
摘要:
Systems, methods, and instrumentalities are disclosed relating to intra prediction of a video signal based on mode-dependent subsampling. A block of coefficients associated with a first sub block of a video block, one or more blocks of coefficients associated with one or more remaining sub blocks of the video block, and an indication of a prediction mode for the video block may be received. One or more interpolating techniques, a predicted first sub block, and the predicted sub blocks of the one or more remaining sub blocks may be determined. A reconstructed first sub block and one or more reconstructed remaining sub blocks may be generated. A reconstructed video block may be formed based on the prediction mode, the reconstructed first sub block, and the one or more reconstructed remaining sub blocks.
摘要:
An encoder includes circuitry configured to receive an input video, select a current frame identify a first sub-picture of the current frame to be encoded using a lossless encoding protocol, and encode the current frame, wherein encoding the current frame includes encoding the first sub-picture using the lossless encoding protocol.
摘要:
The disclosure relates to an image processing apparatus for compressing or decompressing a segment of an image, the segment being non-rectangular and comprising a plurality of pixels, each pixel comprising a pixel value, the pixel values of the plurality of pixels forming a pixel value vector, the apparatus comprising: a processor configured to compress the segment or configured to decompress the segment, wherein compressing the segment comprises computing a plurality of expansion coefficients by expanding the pixel value vector into a plurality of basis vectors, wherein the basis vectors are discrete approximations of solutions of a boundary value problem of the Helmholtz equation on the segment of the image; and wherein decompressing the segment comprises computing the pixel value vector by forming a linear combination of the basis vectors according to the plurality of expansion coefficients.
摘要:
An encoding method according to the present disclosure includes: inputting three-dimensional data including three-dimensional coordinate data to a deep neural network (DNN); encoding the three-dimensional data by the DNN to generate encoded three-dimensional data; and outputting the encoded three-dimensional data.