-
公开(公告)号:US20240137517A1
公开(公告)日:2024-04-25
申请号:US18397302
申请日:2023-12-27
Inventor: Chaoyi Lin , Yue Li , Kai Zhang , Zhaobin Zhang , Li Zhang
IPC: H04N19/132 , G06T3/4046 , G06T3/4053 , H04N19/33 , H04N19/59 , H04N19/88
CPC classification number: H04N19/132 , G06T3/4046 , G06T3/4053 , H04N19/33 , H04N19/59 , H04N19/88
Abstract: A method of processing video data. The method includes applying a super resolution (SR) process to a video unit at a specific position relative to one or more in-loop filters when the one or more in-loop filters are applied to the video unit, and performing a conversion between a video comprising the video unit and a bitstream of the video based on the SR process and the one or more in-loop filters as applied. A corresponding video coding apparatus and non-transitory computer-readable recording medium are also disclosed.
-
公开(公告)号:US20250142130A1
公开(公告)日:2025-05-01
申请号:US19010466
申请日:2025-01-06
Applicant: Bytedance Inc.
IPC: H04N19/85 , H04N19/176 , H04N19/186 , H04N19/70
Abstract: A mechanism for processing video data is disclosed. The mechanism determines to modify a video unit attendant to applying a video compression function. The modification may include applying a geometric conversion to the video unit. A conversion is performed between a visual media data and a bitstream based on the modified video unit.
-
公开(公告)号:US20250119541A1
公开(公告)日:2025-04-10
申请号:US18984150
申请日:2024-12-17
Applicant: Bytedance Inc.
IPC: H04N19/117 , H04N19/186 , H04N19/70 , H04N19/82
Abstract: A method implemented by a video coding apparatus. The method includes applying a neural network (NN) filter to an unfiltered sample of a video unit to generate a filtered sample. The NN filter includes an NN filter model generated based on partitioning information of the video unit. Usage of the NN filter is indicated by one or more syntax elements in a bitstream. A conversion is performed between a video media file and a bitstream based on the filtered sample.
-
公开(公告)号:US20250039401A1
公开(公告)日:2025-01-30
申请号:US18913890
申请日:2024-10-11
IPC: H04N19/154 , H04N19/176
Abstract: Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: determining, for a conversion between a current video block of a video and a bitstream of the video, a distortion value of the current video block based on a set of distortion metrics, the set of distortion metrics comprising at least one of: a first distortion metric determined according to a first machine learning model, a second distortion metric determined according to a second machine learning model, or a third distortion metric determined without using the first and second machine learning models; and performing the conversion based on the distortion value. In this way, a rate-distortion optimization process based on the distortion value can be improved, and thus the coding performance can be enhanced.
-
公开(公告)号:US20240430482A1
公开(公告)日:2024-12-26
申请号:US18823504
申请日:2024-09-03
Inventor: Semih ESENLIK , Yaojun Wu , Zhaobin Zhang , Yue Li , Kai Zhang , Li Zhang
IPC: H04N19/61 , H04N19/124 , H04N19/132 , H04N19/42 , H04N19/463 , H04N19/91
Abstract: Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: obtaining, for a conversion between visual data and a bitstream of the visual data, an intermediate representation of the visual data, the intermediate representation being different from a quantized latent representation of the visual data and being generated based on at least one of the following: at least one parameter, at least a part of the quantized latent representation, a prediction of the at least a part of the quantized latent representation, or a difference between the prediction and the at least a part of the quantized latent representation; and performing, for the conversion, a synthesis transform on the intermediate representation, wherein the quantized latent representation is generated based on applying a first neural network to the visual data.
-
公开(公告)号:US20240364878A1
公开(公告)日:2024-10-31
申请号:US18759646
申请日:2024-06-28
Inventor: Wenbin YIN , Kai Zhang , Yue Li , Hongbin Liu , Li Zhang
IPC: H04N19/117 , H04N19/139 , H04N19/176 , H04N19/186 , H04N19/70 , H04N19/82
CPC classification number: H04N19/117 , H04N19/139 , H04N19/176 , H04N19/186 , H04N19/70 , H04N19/82
Abstract: Embodiments of the disclosure provide a solution for video processing. A method for video processing is proposed. The method includes: determining, during a conversion between a video unit of a video and a bitstream of the video unit, information of a previously coded picture associated with the video unit; during a filtering process, filtering at least one sample in a current picture associated with the video unit based on the information; and performing the conversion based on the filtered at least one sample.
-
公开(公告)号:US20240276020A1
公开(公告)日:2024-08-15
申请号:US18624889
申请日:2024-04-02
Applicant: Lemon Inc. , Beijing Bytedance Network Technology Co., Ltd. , Bytedance Inc. , Bytedance (HK) Limited
IPC: H04N19/82 , H04N19/117 , H04N19/70
CPC classification number: H04N19/82 , H04N19/117 , H04N19/70
Abstract: A method implemented by a video coding apparatus includes applying a neural network (NN) filter to an unfiltered sample of a video unit to generate a filtered sample. The NN filter is applied based on a syntax element of the video unit. The method also includes converting between a video media file and a bitstream based on the filtered sample that was generated.
-
公开(公告)号:US20240236327A1
公开(公告)日:2024-07-11
申请号:US18414665
申请日:2024-01-17
Inventor: Chaoyi Lin , Yue Li , Kai Zhang , Li Zhang
IPC: H04N19/132 , H04N19/117 , H04N19/186 , H04N19/42 , H04N19/70 , H04N19/80
CPC classification number: H04N19/132 , H04N19/117 , H04N19/186 , H04N19/42 , H04N19/70 , H04N19/80
Abstract: A video processing method includes determining information included in a bitstream that indicates whether down-sampling is performed on a video unit, and performing a conversion between the video unit and the bitstream based on the bitstream.
-
公开(公告)号:US20240236325A9
公开(公告)日:2024-07-11
申请号:US18399926
申请日:2023-12-29
Inventor: Chaoyi Lin , Yue Li , Kai Zhang , Zhaobin Zhang , Li Zhang
IPC: H04N19/132 , H04N19/154 , H04N19/186
CPC classification number: H04N19/132 , H04N19/154 , H04N19/186 , H04N19/625
Abstract: A method of processing video data. The method includes down-sampling a video unit of a video prior to application of a super resolution (SR) process and performing a conversion between the video including the video unit and a bitstream of the video based on the video unit as down-sampled. A corresponding video coding apparatus and non-transitory computer-readable recording medium are also disclosed.
-
公开(公告)号:US20250159258A1
公开(公告)日:2025-05-15
申请号:US19022954
申请日:2025-01-15
Applicant: Douyin Vision (Beijing) Co., Ltd. , Bytedance Inc.
Inventor: Semih Esenlik , Zhaobin Zhang , Yaojun Wu , Yue Li , Kai Zhang , Li Zhang
IPC: H04N19/625 , H04N19/124 , H04N19/63
Abstract: Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: obtaining, for a conversion between visual data and a bitstream of the visual data, region information indicating positions and sizes of a plurality of regions in a quantized latent representation of the visual data; selecting, based on the region information, a set of target neighboring samples from a plurality of candidate neighboring samples of a current sample in the quantized latent representation, the set of target neighboring samples being in the same region as the current sample; determining statistical information of the current sample based on the set of target neighboring samples; and performing the conversion based on the statistical information.
-
-
-
-
-
-
-
-
-