PARTITIONING INFORMATION IN NEURAL NETWORK-BASED VIDEO CODING

    公开(公告)号:US20250119541A1

    公开(公告)日:2025-04-10

    申请号:US18984150

    申请日:2024-12-17

    Applicant: Bytedance Inc.

    Abstract: A method implemented by a video coding apparatus. The method includes applying a neural network (NN) filter to an unfiltered sample of a video unit to generate a filtered sample. The NN filter includes an NN filter model generated based on partitioning information of the video unit. Usage of the NN filter is indicated by one or more syntax elements in a bitstream. A conversion is performed between a video media file and a bitstream based on the filtered sample.

    METHOD, APPARATUS, AND MEDIUM FOR VIDEO PROCESSING

    公开(公告)号:US20250039401A1

    公开(公告)日:2025-01-30

    申请号:US18913890

    申请日:2024-10-11

    Abstract: Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, a distortion value of the current video block based on a set of distortion metrics, the set of distortion metrics comprising at least one of: a first distortion metric determined according to a first machine learning model, a second distortion metric determined according to a second machine learning model, or a third distortion metric determined without using the first and second machine learning models; and performing the conversion based on the distortion value. In this way, a rate-distortion optimization process based on the distortion value can be improved, and thus the coding performance can be enhanced.

    METHOD, APPARATUS, AND MEDIUM FOR VISUAL DATA PROCESSING

    公开(公告)号:US20240430482A1

    公开(公告)日:2024-12-26

    申请号:US18823504

    申请日:2024-09-03

    Abstract: Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: obtaining, for a conversion between visual data and a bitstream of the visual data, an intermediate representation of the visual data, the intermediate representation being different from a quantized latent representation of the visual data and being generated based on at least one of the following: at least one parameter, at least a part of the quantized latent representation, a prediction of the at least a part of the quantized latent representation, or a difference between the prediction and the at least a part of the quantized latent representation; and performing, for the conversion, a synthesis transform on the intermediate representation, wherein the quantized latent representation is generated based on applying a first neural network to the visual data.

    METHOD, APPARATUS, AND MEDIUM FOR VISUAL DATA PROCESSING

    公开(公告)号:US20250159258A1

    公开(公告)日:2025-05-15

    申请号:US19022954

    申请日:2025-01-15

    Abstract: Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: obtaining, for a conversion between visual data and a bitstream of the visual data, region information indicating positions and sizes of a plurality of regions in a quantized latent representation of the visual data; selecting, based on the region information, a set of target neighboring samples from a plurality of candidate neighboring samples of a current sample in the quantized latent representation, the set of target neighboring samples being in the same region as the current sample; determining statistical information of the current sample based on the set of target neighboring samples; and performing the conversion based on the statistical information.

Patent Agency Ranking