MULTI-TYPE TREE CODING
    1.
    发明公开

    公开(公告)号:US20240323373A1

    公开(公告)日:2024-09-26

    申请号:US18680359

    申请日:2024-05-31

    申请人: VID SCALE, INC.

    摘要: A block may be identified. The block may be partitioned into one or more (e.g., two) sibling nodes (e.g., sibling nodes B0 and B1). A partition direction and a partition type for the block may be determined. If the partition type for the block is binary tree (BT), one or more (e.g., two) partition parameters may be determined for sibling node B0. A partition parameter (e.g., a first partition parameter) may be determined for sibling node B1. A decoder may determine whether to receive an indication of a second partition parameter for B1 based on, for example, the partition direction for the block, the partition type for the block, and the first partition parameter for B1. The decoder may derive the second partition parameter based on, for example, the partition direction and type for the block, and the first partition parameter for B1.

    METHODS AND APPARATUS FOR REDUCING THE CODING LATENCY OF DECODER-SIDE MOTION REFINEMENT

    公开(公告)号:US20240244253A1

    公开(公告)日:2024-07-18

    申请号:US18623846

    申请日:2024-04-01

    申请人: VID SCALE, INC.

    摘要: Embodiments of video coding systems and methods are described for reducing coding latency introduced by decoder-side motion vector refinement (DMVR). In one example, two non-refined motion vectors are identified for coding of a first block of samples (e.g. a first coding unit) using bi-prediction. One or both of the non-refined motion vectors are used to predict motion information for a second block of samples (e.g. a second coding unit). The two non-refined motion vectors are refined using DMVR, and the refined motion vectors are used to generate a prediction signal of the first block of samples. Such embodiments allow the second block of samples to be coded substantially in parallel with the first block without waiting for completion of DMVR on the first block. In additional embodiments, optical-flow-based techniques are described for motion vector refinement.

    Face discontinuity filtering for 360-degree video coding

    公开(公告)号:US12003770B2

    公开(公告)日:2024-06-04

    申请号:US17884399

    申请日:2022-08-09

    申请人: VID SCALE, INC.

    摘要: Systems, methods, and instrumentalities are disclosed for discontinuous face boundary filtering for 360-degree video coding. A face discontinuity may be filtered (e.g., to reduce seam artifacts) in whole or in part, for example, using coded samples or padded samples on either side of the face discontinuity. Filtering may be applied, for example, as an in-loop filter or a post-processing step. 2D positional information related to two sides of the face discontinuity may be signaled in a video bitstream so that filtering may be applied independent of projection formats and/or frame packing techniques.

    Symmetric merge mode motion vector coding

    公开(公告)号:US11979595B2

    公开(公告)日:2024-05-07

    申请号:US17438119

    申请日:2020-03-11

    申请人: VID SCALE, INC.

    发明人: Hua Yang Yuwen He

    摘要: Systems, devices, and methods are described herein for symmetric merge mode motion vector coding. Symmetric bi-prediction (bi-pred) motion vectors (MVs) may be constructed from available candidates in a merge candidate list for regular inter prediction merge mode and/or affine prediction merge mode. Available MV merge candidates may be symmetrically extended or mapped in either direction (e.g., between reference pictures before and after a current picture), for example, when coding a picture that allows bi-directional motion compensation prediction (MCP). A symmetric bi-pred merge candidate may be selected among merge candidates for predicting the motion information of a current prediction unit (PU). The symmetric mapping construction may be repeated by a decoder (e.g., based on a coded index of the MV merge candidate list), for example, to obtain the same merge candidates and coded MV at an encoder.

    Geometric conversion for 360-degree video coding

    公开(公告)号:US11778231B2

    公开(公告)日:2023-10-03

    申请号:US16301998

    申请日:2017-05-25

    申请人: VID SCALE, INC.

    摘要: Processing a 360-degree video content for video coding may include receiving the video content in a first geometry. The video content may include unaligned chroma and luma components associated with a first chroma sampling scheme. The unaligned chroma and luma components may be aligned to a sampling grid associated with a second chroma sampling scheme that has aligned chroma and luma components. A geometric conversion to the video content may be performed. The video content, that may comprise the aligned chroma and luma components, in the first geometry may be converted to a second geometry. The first geometry may be a stitched geometry, and the second geometry may be a coding geometry. The converted video content in the second geometry may include the chroma and luma components aligned to the sampling grid associated with the second chroma sampling scheme.

    ADAPTIVE MOTION VECTOR PRECISION FOR AFFINE MOTION MODEL BASED VIDEO CODING

    公开(公告)号:US20230136599A1

    公开(公告)日:2023-05-04

    申请号:US18089027

    申请日:2022-12-27

    申请人: Vid Scale, Inc.

    摘要: Systems and methods are described for video coding using affine motion models with adaptive precision. In an example, a block of video is encoded in a bitstream using an affine motion model, where the affine motion model is characterized by at least two motion vectors. A precision is selected for each of the motion vectors, and the selected precisions are signaled in the bitstream. In some embodiments, the precisions are signaled by including in the bitstream information that identifies one of a plurality of elements in a selected predetermined precision set. The identified element indicates the precision of each of the motion vectors that characterize the affine motion model. In some embodiments, the precision set to be used is signaled expressly in the bitstream; in other embodiments, the precision set may be inferred, e.g., from the block size, block shape or temporal layer.

    FACE DISCONTINUITY FILTERING FOR 360-DEGREE VIDEO CODING

    公开(公告)号:US20220385942A1

    公开(公告)日:2022-12-01

    申请号:US17884399

    申请日:2022-08-09

    申请人: VID SCALE, INC.

    摘要: Systems, methods, and instrumentalities are disclosed for discontinuous face boundary filtering for 360-degree video coding. A face discontinuity may be filtered (e.g., to reduce seam artifacts) in whole or in part, for example, using coded samples or padded samples on either side of the face discontinuity. Filtering may be applied, for example, as an in-loop filter or a post-processing step. 2D positional information related to two sides of the face discontinuity may be signaled in a video bitstream so that filtering may be applied independent of projection formats and/or frame packing techniques.

    SWITCHING LOGIC FOR BI-DIRECTIONAL OPTICAL FLOW

    公开(公告)号:US20220377316A1

    公开(公告)日:2022-11-24

    申请号:US17763024

    申请日:2020-09-23

    申请人: VID SCALE, INC.

    摘要: An apparatus may be configured to determine a reference picture listed in a first reference picture list and a reference picture listed in a second reference picture list, for a coding block. The apparatus may be configured to determine whether to perform bi-directional optical flow (BDOF) for the coding block based at least in part on whether a distance between a picture associated with the coding block and the reference picture listed in the first reference picture list differs from a distance between the picture associated with the coding block and the reference picture listed in the second reference picture list. The apparatus may be configured to decode the coding block based on the determination of whether to perform BDOF for the coding block.

    METHODS AND APPARATUS FOR KERNEL TENSOR AND TREE PARTITION BASED NEURAL NETWORK COMPRESSION FRAMEWORK

    公开(公告)号:US20220360778A1

    公开(公告)日:2022-11-10

    申请号:US17621476

    申请日:2020-07-02

    申请人: VID SCALE, INC.

    摘要: A method of encoding or decoding a video comprising a current picture, a first reference picture, and a weight tensor associated with a trained neural network (NN) model are provided. The method includes generating any number of kernel tensors, input channels and output channels associated with the weight tensor, each kernel tensor being associated with any of: a layer type, an input signal type, and a tree partition type, and each kernel tensor including weight coefficients, generating, for each of the any number of kernel tensors, tree partitions for any of a coding tree unit (CTU), a coding unit (CU), a prediction unit (PU), and a transform unit (TU) according to respective tree partition types associated with each of the any number of kernel tensors, and generating a compressed representation of the trained NN model by compressing and coding the any number of kernel tensors.