METHOD, APPARATUS, AND MEDIUM FOR VISUAL DATA PROCESSING

    公开(公告)号:US20250159258A1

    公开(公告)日:2025-05-15

    申请号:US19022954

    申请日:2025-01-15

    Abstract: Embodiments of the present disclosure provide a solution for visual data processing. A method for visual data processing is proposed. The method comprises: obtaining, for a conversion between visual data and a bitstream of the visual data, region information indicating positions and sizes of a plurality of regions in a quantized latent representation of the visual data; selecting, based on the region information, a set of target neighboring samples from a plurality of candidate neighboring samples of a current sample in the quantized latent representation, the set of target neighboring samples being in the same region as the current sample; determining statistical information of the current sample based on the set of target neighboring samples; and performing the conversion based on the statistical information.

    NEURAL NETWORK-BASED ADAPTIVE IMAGE AND VIDEO COMPRESSION METHOD

    公开(公告)号:US20250168370A1

    公开(公告)日:2025-05-22

    申请号:US19033178

    申请日:2025-01-21

    Applicant: Bytedance Inc.

    Abstract: An image decoding method including transforming an input image into latent samples using an analysis transform; quantizing the latent samples using a hyper encoder to generate quantized hyper latent samples; encoding the quantized hyper latent samples into a bitstream using entropy encoding; applying a latent sample prediction process to obtain quantized latent samples and quantized residual latent samples based on the latent samples using the quantized hyper latent samples; obtaining prediction samples following the latent sample prediction process; and entropy encoding the quantized hyper latent samples and the quantized residual latent samples into the bitstream.

    NEURAL NETWORK-BASED IMAGE AND VIDEO COMPRESSION METHOD WITH PARALLEL PROCESSING

    公开(公告)号:US20250159214A1

    公开(公告)日:2025-05-15

    申请号:US19021539

    申请日:2025-01-15

    Applicant: Bytedance Inc.

    Abstract: An image decoding method including obtaining reconstructed latents ŷ[:,:,:] using an arithmetic decoder; feeding the reconstructed latents into a synthesis neural network; tile partitioning output feature maps into multiple parts based on decoded parameters at one or multiple locations; separately feeding each of the multiple parts into a next stage of a plurality of convolutional layers to obtain spatially partitioned feature maps at an output; and cropping and stitching the spatially partitioned feature maps back to a whole feature map spatially until an image is reconstructed.

Patent Agency Ranking