METHOD, APPARATUS, AND MEDIUM FOR VISUAL DATA PROCESSING

    公开(公告)号:US20240430430A1

    公开(公告)日:2024-12-26

    申请号:US18823515

    申请日:2024-09-03

    Abstract: Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: obtaining, for a conversion between visual data and a bitstream of the visual data, statistical information associated with a first representation of the visual data, the first representation being generated based on applying a first neural network to the visual data; determining at least one sample from a second representation of the visual data based on the statistical information, a value of the at least one sample being absent from the bitstream, the second representation being obtained by quantizing the first representation; and performing the conversion based on the determining.

    METHOD, APPARATUS, AND MEDIUM FOR VISUAL DATA PROCESSING

    公开(公告)号:US20250168412A1

    公开(公告)日:2025-05-22

    申请号:US19034280

    申请日:2025-01-22

    Applicant: Bytedance Inc.

    Abstract: Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: determining, for a conversion between at least one bitstream of visual data and the visual data, a residual representation of the visual data at least based on a first probability distribution parameter of the visual data and a gain parameter, the residual representation representing a residual value compared to a second probability distribution representation of the visual data, the gain parameter adjusting a value range of the residual representation; and performing the conversion based on the residual representation.

    METHOD, APPARATUS, AND MEDIUM FOR VISUAL DATA PROCESSING

    公开(公告)号:US20240430482A1

    公开(公告)日:2024-12-26

    申请号:US18823504

    申请日:2024-09-03

    Abstract: Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: obtaining, for a conversion between visual data and a bitstream of the visual data, an intermediate representation of the visual data, the intermediate representation being different from a quantized latent representation of the visual data and being generated based on at least one of the following: at least one parameter, at least a part of the quantized latent representation, a prediction of the at least a part of the quantized latent representation, or a difference between the prediction and the at least a part of the quantized latent representation; and performing, for the conversion, a synthesis transform on the intermediate representation, wherein the quantized latent representation is generated based on applying a first neural network to the visual data.

Patent Agency Ranking