摘要:
A system is provided for determining a binary codeword for a symbol representing a transform coefficient within transform units (TUs) that divide up coding units (CUs) in a High Efficiency Video Coding (HEVC) system. The system determines a truncated rice prefix and, when a parameter variable is greater than zero, determines a truncated rice suffix for the symbol. The system determines a main prefix either from the truncated rice prefix alone, or from a combination of the truncated rice prefix and the truncated rice suffix. When the main prefix is the same as a comparison string, the system also determines a main suffix. The system determines the final binary codeword for the symbol either from the main prefix alone, or from a combination of the main prefix and the main suffix.
摘要:
A method for encoding a video stream divided in macroblocks using an encoding scheme, the video stream comprising a transparency level channel, said method comprising: - classifying said macroblocks into inner macroblocks, for which a transparency value provided by said transparency information channel is substantially uniform, and transition macroblocks, for which a transparency value provided by said transparency level channel is not substantially uniform; - determining a statistic of said transparency value for each one of said inner macroblocks; and - configuring a respective parameter of said encoding scheme for each one of said inner macroblocks in function of its respective statistic.
摘要:
A method of encoding a video data signal (15) is provided, together with a method for decoding. The encoding comprises providing color information (51) for pixels in an image, providing a depth map with depth information (52) for the pixels, providing transition information (56, 57, 60, 70, 71) being representative of a width (63, 73) of a transition region (61, 72) in the image, the transition region (61, 72) comprising a depth transition (62) and blended pixels in which colors of a foreground object and a background object are blended, and generating (24) the video data signal (15) comprising encoded data representing the color information (51), the depth map (52) and the transition information (56, 57, 60, 70, 71). The decoding comprises using the transition information (56, 57, 60, 70, 71) for determining the width (63, 73) of the transition regions (61, 72) and for determining alpha values (53) for pixels inside the transition regions (61, 72). The determined alpha values (53) are used for determining the color of a blended pixel at the transition of a foreground object and a background object.
摘要:
An alpha image encoding and decoding scheme is disclosed. In the encoding an alpha image, is decomposed into image blocks (600) comprising multiple image element (610). The blocks (600) are compressed into block representations (700). A block representation (700) comprises at least a color codeword (710), an alpha codeword (720), an alpha modifying codeword (730) and a sequence (740) of alpha modifier indices. The color (710) and alpha (720) codeword (710) are representations of the colors and alpha value of the image elements (610) of the block (600), respectively. The alpha modifying codeword (730) is a representation of a set of multiple alpha modifiers for modifying an alpha value represented by the alpha codeword (720). The index sequence (740) includes an alpha index for each image element (610) in the block (600), where an alpha index identifies one of alpha modifiers in the alpha modifier set.
摘要:
Intended is to obtain a video encoding device, a video encoding method, and a video encoding program which enable to prevent reduction in compression efficiency caused by drastic changes in symbol occurrence probabilities in context adaptive coding, and a video decoding device, a video decoding method and a decoding program corresponding thereto. The video data 101 is input into video encoding device on a macroblock basis, and after quantization, the PCM determination unit 139 determines whether the coded data 123 is PCM mode or not. Even when the coded data 123 is PCM mode, the context updating unit 301 executes context updating processing so as to improve efficiency of binary arithmetic coding. According to the determination of PCM mode or non PCM mode, the third switch 148 supplies either binary arithmetic coding output or PCM coding output as its output data.
摘要:
A coded bit stream 30 generated on a coding side consists of a VO header 30a, a VOL header 30b, a GOV header 30c, a VOP header 30d and VOP data 30e, and the VOL header 30b multiplexes an object intra-coded indicator signal 7' indicating whether all the VOP data 30e contained in a VOL or GOV are intra coded or not. This enables a decoding side to recognize whether all the VOP data 30e contained in the VOL or GOV in the coded bit stream 30 are infra coded or not by only analyzing the object intra-coded indicator signal 7'. This can facilitate such processings as frame skip control or random access of the VOPs.
摘要:
A first aspect of the invention relates to a method for creating a binary mask image from an a inputted digital image of a scanned document, comprising the steps of creating a binarized image by binarizing the inputted digital image, detecting first text regions representing light text on a dark background, and inverting the first text regions, such that the inverted first text regions are interpretable in the same way as dark text on a light background. A second aspect of the invention relates to a method for comparing in a binary image a first pixel blob with a second pixel blob to determine whether they represent matching symbols, comprising the steps of detecting a line in one blob not present in the other and/or determining if one of the blobs represents an italicized symbol where the other does not.
摘要:
A generic spatially-scalable shape encoding apparatus and method for handling different mask decomposition methods, while maximizing coding efficiency of the encoder, is disclosed. The present generic spatially-scalable shape encoding applies three encoding steps to maximize the coding efficiency of the encoder, i.e., mask mode encoding, base mask layer coding and enhancement mask layer coding.
摘要:
The present invention concerns an encoding apparatus comprising: an image analysis module that generates shape, texture and motion information from image data representing at least a video object; a parameter coding module that adjusts texture information generated by the image analysis module based on original shape information from the image analysis module and encodes the adjusted texture information and shape and motion information; a decoding module that decodes encoded texture, shape and motion information from the parameter coding module; and a memory that stores decoded data from the decoding module. Further, the invention relates to a method of encoding texture information for a video object, the method comprising: performing image analysis to create a bounding shape for a current video object plane S k (VOP S k ); estimating texture and shape motion parameters for the VOP S k ; predictively encoding the texture and shape motion parameters with respect to a reference VOP S' k-1 ; transmitting and decoding the encoded texture and shape motion parameters; and storing a new reference VOP in memory.