摘要:
A method (200) of constructing at least one list of reference pictures for inter-layer prediction of a current picture is provided. The method comprises inserting (210) reference pictures into a first set of reference pictures or a second set of reference pictures, based on respective values of a scalability identifier associated with the reference pictures and a value of the scalability identifier associated with the current picture, and inserting (220) the first set of reference pictures and the second set of reference pictures into the at least one list of reference pictures. By taking indications for similarities between reference layers and the current layer into account, a more efficient multi-layer video compression is achieved. Further, a corresponding computer program, a corresponding computer program product, and a corresponding device are provided.
摘要:
A block-specific filter decision value is calculated for a pixel block (10) in a video frame. If the block-specific filter decision value is below a block-specific threshold, each row or column (12) in the block (10) is individually processed in order to select between a strong and a weak de-blocking filter. A respective line-specific filter decision value is thereby calculated for each row or column (12) in the block (10) and compared to a line-specific threshold. If the line-specific filter decision value calculated for a row or column (12) is below the line-specific threshold a strong de-blocking filter is selected for the row or column (12), otherwise a weak de-blocking filter is instead selected to combat any blocking artifacts.
摘要:
Visible artifacts in a video stream of pictures with slices are reduced by having a separate maximum transform size for intra coding units in inter coded slices as compared to intra coding units in intra coded slices and/or inter coding units or by penalizing the usage of large transform size for such intra coding units in inter coded slices as compared to intra coding units in intra coded slices and/or inter coding units.
摘要:
Current deblocking filters are using the same filters with the same filtering strength irrespective of the block size and the size of the transform used. However, in the new video coding standards such as emerging HEVC the PU sizes can vary from 4 to 64 and the TU sizes can vary from 4 to 32. Therefore, filtering the same amount of pixels (e.g. two or three) from the block boundary for the block of size 4 can be excessive, while for the block size 32 it may not be enough, with the result that the boundary between two blocks is still visible. Hence, there is a need for an efficient deblocking filter control that can be used to reduce blocking artifacts at block boundaries and that does not have the above mentioned drawbacks. It is a general objective to provide an efficient deblocking filter control. Thus, the objective is solved by applying different filters for different block sizes such as CU, PU or/and TU sizes. Accordingly, the deblocking filtering strength is adjusted based on the block size, which implies that the amount of modification applied to pixels by the deblocking filter is varied depending on the block size. The amount of modification that is being varied is in one embodiment the number of pixels to be modified.
摘要:
A media container file (1) is generated by organizing media data (2; 3) defined by a media track (12) in the file (1). Sub-track information (72, 74) identifying media data portions (4, 5; 6, 7, 8) of the media data (2; 3) is organized for each sub-track of multiple sub-tracks defined in the media track (12). At least one of the sub-tracks is assigned selection information (62, 64) defining a selective processing of the media data portion (4, 5; 6, 7, 8) defined by the sub-track in relation to other media data organized in the media container file (1). The media data (2, 3) advantageously relate to layered media or media defining multiple camera views which are organized into sub-tracks (12). The selection information (62, 64) allows selection among tracks (12) and sub-tracks when setting up a media session and switching between tracks (12) and sub-tracks during such a media session.
摘要:
Blocking artifacts at a block boundary (1) between a block (10) and a neighboring block (20) in a video frame are reduced by calculating an offset based on pixel values of pixels (11, 13) in a line (12) of pixels (11, 13, 15, 17) in the block (10) and based on pixel values of pixels (21, 23) in a corresponding line (22) of pixels (21, 23, 25, 27) in the neighboring block (20). The offset is added to the pixel value of the pixel (11) closest to the block boundary (1) in the line (12) of pixels (11, 13, 15, 17) and is subtracted from the pixel value of the pixel (21) closest to the block boundary (1) in the corresponding line (22) of pixels (21, 23, 25, 27). The resulting deblocking filter has good low-pass characteristics and is efficient for reducing blocking artifact.
摘要:
Pixel values of pixels (12, 14, 16, 22, 24, 26) in a line (15) of pixels (12, 14, 16, 18, 22, 24, 26, 28) are filtered with a strong deblocking filter to obtain filtered pixel values. Each filtered pixel value is clipped off to a respective clipping parameter value defined based on a position of the pixel (12, 14, 16, 22, 24, 26) relative to a block boundary (2) between two adjacent blocks (10, 20) of pixels (12, 14, 16, 18, 22, 24, 26, 28). The clipping parameter values change at least linearly depending in the pixel position relative to the block boundary (2) so that pixels (12, 16) in the line (15) of pixels (12, 14, 16, 18, 22, 24, 26, 28) having different positions from the block boundary (2) will have different clipping parameter values.
摘要:
There is provided a video apparatus having a stereoscopic display associated therewith, the video apparatus arranged to: receive at least one image and at least one reference parameter associated with said image; calculate a baseline distance for synthesizing a view, the calculation based upon the received at least one reference parameter and at least one parameter of the stereoscopic display; synthesize at least one view using the baseline distance and the received at least one image; and send the received at least one image and the synthesized at least one image to the stereoscopic display for display.