Reference Picture Management in Video Coding

    公开(公告)号:US20210258568A1

    公开(公告)日:2021-08-19

    申请号:US17176595

    申请日:2021-02-16

    Abstract: A method of decoding a coded video bitstream includes obtaining a reference picture list structure for a current slice represented in the coded video bitstream, wherein the reference picture list structure contains a number of entries; obtaining a default number of active entries in a reference picture list for the current slice; constructing a reference picture list for the current slice, the reference picture list containing a number of active and inactive entries; setting the number of active entries in the reference picture list equal to the number of entries in the reference picture list structure when the default number of active entries in the reference picture list is greater than the number of entries in the reference picture list structure; and obtaining, based on at least one active entry of the reference picture list, at least one reconstructed block of the current slice.

    MEDIA DATA PROCESSING METHOD AND APPARATUS

    公开(公告)号:US20210243473A1

    公开(公告)日:2021-08-05

    申请号:US17232584

    申请日:2021-04-16

    Abstract: This application provides a media data processing method and apparatus. A media processing device receives a media stream. The media stream includes media data recorded at a plurality of viewpoints. The device obtains metadata information of the media stream. The metadata information includes viewpoint identification information of the viewpoints. The device displays media data of a first viewpoint based on the viewpoint identification information. The device also displays indications of other viewpoints when displaying the media data of the first viewpoint, so that a user of the device can switch to display media data of other viewpoints.

    Index Signaling For Reference Picture List Structures

    公开(公告)号:US20210195178A1

    公开(公告)日:2021-06-24

    申请号:US17196205

    申请日:2021-03-09

    Abstract: A method of decoding a coded video bitstream is provided. The method includes parsing a flag; parsing a first reference picture list structure; determining that an index to a second reference picture list structure is not present in a slice header of the coded video bitstream and inferring that the index to the second reference picture list structure is the same as an index to the first reference picture list structure when the flag has a first value; determining that the index to the second reference picture list structure is present in the slice header when the flag has a second value; generating a reference picture list using the first reference picture list structure or the second reference picture list structure; and performing inter-prediction based on the reference picture list to generate a reconstructed block.

    Prediction Type Signaling and Temporal Order Signaling in Point Cloud Coding (PCC)

    公开(公告)号:US20210134018A1

    公开(公告)日:2021-05-06

    申请号:US17146234

    申请日:2021-01-11

    Abstract: An apparatus comprises an encoder configured to obtain point clouds, generate a first field that implements prediction type signaling of the point clouds, generate a second field that implements temporal order signaling of the point clouds, and encode the first field and the second field into an encoded bitstream; and an output interface coupled to the encoder and configured to transmit the encoded bitstream. An apparatus comprises a receiver configured to receive an encoded bitstream; and a processor coupled to the encoded bitstream and configured to decode the encoded bitstream to obtain a first field and second field, wherein the first field implements prediction type signaling of point clouds, and wherein the second field implements temporal order signaling of the point clouds, and generate the point clouds based on the first field and the second field.

    Avoidance of Redundant Signaling in Multi-Layer Video Bitstreams

    公开(公告)号:US20250119589A1

    公开(公告)日:2025-04-10

    申请号:US18920491

    申请日:2024-10-18

    Inventor: Ye-Kui Wang

    Abstract: A method of decoding is provided. The method includes receiving a video bitstream including a plurality of layers having sublayers and a video parameter set (VPS) including a first flag having a first value, wherein the first flag having the first value specifies that temporal identifiers (IDs) of a highest sublayer representation for level information, decoded picture buffer (DPB) parameters, and hypothetical decoder refresh (HRD) parameters are not present in the VPS and are inferred to be equal to a maximum number of the sublayers that may be present in a layer of the plurality of layers specified by the VPS; obtaining the level information, the HRD parameters, and the DPB parameters corresponding to the temporal ID of the highest sublayer representation from the VPS; and decoding a picture from one of the plurality of layers to obtain a decoded picture. A corresponding method of encoding is also provided.

    Tile Group Signaling In Video Coding

    公开(公告)号:US20250119555A1

    公开(公告)日:2025-04-10

    申请号:US18922546

    申请日:2024-10-22

    Abstract: A method implemented in an encoder for encoding a video bitstream that includes coded data for a plurality of pictures, each of the plurality of pictures comprises at least one slice. The method includes encoding a flag that indicates whether tile information for a picture is present in a picture parameter set or present in a slice header, where the tile information indicates a location of a slice within the picture; encoding the tile information in only the picture parameter set when the flag indicates that the tile information for the picture is encoded in the picture parameter set; encoding the tile information in only the slice header when the flag indicates that the tile information for the picture is encoded in the slice header; and encoding data of the picture in the video bitstream based on the tile information.

    Decoded picture buffer operation for resolution changes

    公开(公告)号:US12273548B2

    公开(公告)日:2025-04-08

    申请号:US17701382

    申请日:2022-03-22

    Inventor: Ye-Kui Wang

    Abstract: A video coding mechanism is disclosed. The mechanism includes receiving a bitstream comprising a plurality of pictures. A no output of prior pictures flag (NoOutputOfPriorPicsFlag) is set when a value of maximum picture width in luma samples (PicWidthMaxInSamplesY) for a current access unit (AU) is different from a value of PicWidthMaxInSamplesY for a preceding AU in decoding order. A decoded picture buffer (DPB) is emptied without output of contained pictures based on a value of the NoOutputOfPriorPicsFlag. A current picture is decoded and stored in the DPB. The current picture is output from the DPB for display as part of a decoded video sequence.

    Slicing and tiling in video coding
    170.
    发明授权

    公开(公告)号:US12267531B2

    公开(公告)日:2025-04-01

    申请号:US17665263

    申请日:2022-02-04

    Abstract: A video coding mechanism is disclosed. The mechanism includes receiving at a decoder, a bitstream including a video coding layer (VCL) network abstraction layer (NAL) unit containing a slice of image data divided into a plurality of tiles. A number of the tiles in the VCL NAL unit are determined. A number of entry point offsets for the tiles is also determined as one less than the number of the tiles in the VCL NAL unit. Each entry point offset indicates a starting location of a corresponding tile in the VCL NAL unit. The number of entry point offsets is not explicitly signaled in the bitstream. The entry point offsets for the tiles are obtained based on the number of entry point offsets. The tiles are decoded at the entry point offsets to generate a reconstructed image.

Patent Agency Ranking