-
公开(公告)号:US20240373048A1
公开(公告)日:2024-11-07
申请号:US18778818
申请日:2024-07-19
Inventor: Semih ESENLIK , Yaojun WU , Zhaobin ZHANG , Yue LI , Kai ZHANG , Li ZHANG
IPC: H04N19/42 , H04N19/124 , H04N19/132 , H04N19/136 , H04N19/184 , H04N19/192
Abstract: Embodiments of the present disclosure provide a solution for data processing. A method for data processing is proposed. The method comprises: determining, during a conversion between data and a bitstream of the data, a first part of a first sample of a reconstructed latent representation of the data, the first part indicating a prediction of the first sample; determining a second part of the first sample, the second part indicating a difference between the first sample and the first part; and performing the conversion based on the second part.
-
公开(公告)号:US20240430430A1
公开(公告)日:2024-12-26
申请号:US18823515
申请日:2024-09-03
Inventor: Semih ESENLIK , Zhaobin ZHANG , Yaojun WU , Kai ZHANG , Yue LI , Li ZHANG
IPC: H04N19/132 , H04N19/124
Abstract: Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: obtaining, for a conversion between visual data and a bitstream of the visual data, statistical information associated with a first representation of the visual data, the first representation being generated based on applying a first neural network to the visual data; determining at least one sample from a second representation of the visual data based on the statistical information, a value of the at least one sample being absent from the bitstream, the second representation being obtained by quantizing the first representation; and performing the conversion based on the determining.
-
公开(公告)号:US20240380904A1
公开(公告)日:2024-11-14
申请号:US18778813
申请日:2024-07-19
Inventor: Semih ESENLIK , Yaojun WU , Zhaobin ZHANG , Yue LI , Kai ZHANG , Li ZHANG
IPC: H04N19/189 , H04N19/119 , H04N19/124 , H04N19/132 , H04N19/436
Abstract: Embodiments of the present disclosure provide a solution for data processing. A method for data processing is proposed. The method comprises: processing, during a conversion between data and a bitstream of the data, a first set of samples of a reconstructed latent representation of the data and a second set of samples of the reconstructed latent representation by using a model, the first set of samples being associated with a first sample of the reconstructed latent representation and the second set of samples being associated with a second sample of the reconstructed latent representation; and performing the conversion based on a result of the processing.
-
公开(公告)号:US20240430428A1
公开(公告)日:2024-12-26
申请号:US18823527
申请日:2024-09-03
Inventor: Semih ESENLIK , Zhaobin ZHANG , Yaojun WU , Kai ZHANG , Yue LI , Li ZHANG
IPC: H04N19/124 , H04N19/119 , H04N19/132 , H04N19/42 , H04N19/463 , H04N19/60 , H04N19/91
Abstract: Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: obtaining, for a conversion between visual data and a bitstream of the visual data, a first representation of the visual data, the first representation being obtained by quantizing a second representation of the visual data, the second representation being generated based on applying a first neural network to the visual data; adjusting a plurality of sets of first samples of the first representation with different parameters; and performing the conversion based on the plurality of sets of adjusted first samples.
-
公开(公告)号:US20250168412A1
公开(公告)日:2025-05-22
申请号:US19034280
申请日:2025-01-22
Applicant: Bytedance Inc.
Inventor: Zhaobin ZHANG , Semih ESENLIK , Kai ZHANG , Li ZHANG
IPC: H04N19/85 , H04N19/186 , H04N19/80
Abstract: Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: determining, for a conversion between at least one bitstream of visual data and the visual data, a residual representation of the visual data at least based on a first probability distribution parameter of the visual data and a gain parameter, the residual representation representing a residual value compared to a second probability distribution representation of the visual data, the gain parameter adjusting a value range of the residual representation; and performing the conversion based on the residual representation.
-
公开(公告)号:US20240430482A1
公开(公告)日:2024-12-26
申请号:US18823504
申请日:2024-09-03
Inventor: Semih ESENLIK , Yaojun Wu , Zhaobin Zhang , Yue Li , Kai Zhang , Li Zhang
IPC: H04N19/61 , H04N19/124 , H04N19/132 , H04N19/42 , H04N19/463 , H04N19/91
Abstract: Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: obtaining, for a conversion between visual data and a bitstream of the visual data, an intermediate representation of the visual data, the intermediate representation being different from a quantized latent representation of the visual data and being generated based on at least one of the following: at least one parameter, at least a part of the quantized latent representation, a prediction of the at least a part of the quantized latent representation, or a difference between the prediction and the at least a part of the quantized latent representation; and performing, for the conversion, a synthesis transform on the intermediate representation, wherein the quantized latent representation is generated based on applying a first neural network to the visual data.
-
-
-
-
-