-
公开(公告)号:WO2023049655A1
公开(公告)日:2023-03-30
申请号:PCT/US2022/076496
申请日:2022-09-15
发明人: ZHU, Yinhao , YANG, Yang , COHEN, Taco Sebastiaan
摘要: Systems and techniques are described herein for processing media data using a neural network system. For instance, a process can include obtaining a latent representation of a frame of encoded image data and generating, by a plurality of decoder transformer layers of a decoder sub-network using the latent representation of the frame of encoded image data as input, a frame of decoded image data. At least one decoder transformer layer of the plurality of decoder transformer layers includes: one or more transformer blocks for generating one or more patches of features and determine self-attention locally within one or more window partitions and shifted window partitions applied over the one or more patches; and a patch un-merging engine for decreasing a respective size of each patch of the one or more patches.
-
2.
公开(公告)号:WO2023048070A1
公开(公告)日:2023-03-30
申请号:PCT/JP2022/034656
申请日:2022-09-16
IPC分类号: H04N19/59
摘要: This disclosure discloses a method of compressing feature data corresponding to video data. The method comprising: generating feature data including a number of channels corresponding to a scale for each of N pictures included in video data, concatenating the generated feature data about the channel dimension, reducing the number of channels in the concatenated feature data to generate reduced concatenated feature data and encoding the reduced concatenated feature data into a bitstream.
-
公开(公告)号:WO2023026645A1
公开(公告)日:2023-03-02
申请号:PCT/JP2022/023843
申请日:2022-06-14
IPC分类号: H04N19/59
摘要: 符号化装置(100)は、回路と、メモリとを備え、回路は、第1参照ピクチャリストに複数の第1参照ピクチャ候補の1つ以上を登録し、第1参照ピクチャリストから第1参照ピクチャを選択し、第1参照ピクチャにおける第1参照ブロック、及び、RPRを用いて、カレントブロックを符号化し、RPRでは、第1参照ピクチャのピクチャサイズがカレントピクチャのピクチャサイズとは異なる場合、第1参照ブロックがリサンプリングされ、回路は、複数の第1参照ピクチャ候補のそれぞれについて、当該第1参照ピクチャ候補が第1ピクチャサイズを有する場合、第1参照ピクチャリストに当該第1参照ピクチャ候補を登録する。
-
4.
公开(公告)号:WO2022268907A1
公开(公告)日:2022-12-29
申请号:PCT/EP2022/067064
申请日:2022-06-22
申请人: VALEO VISION
发明人: ALMEHIO, Yasser
IPC分类号: H04N19/93 , H04N19/59 , F21S41/153 , H03M7/30 , H03M7/46 , G06T9/00 , G06T9/005 , H03M7/48 , H04N19/132 , H04N19/182 , H04N19/98
摘要: The invention referst to a method for managing an image in an automotive lighting device (10). This method comprises the steps of providing a first image pattern (1) comprising a plurality of pixels (11), select a relevant portion of the value of each pixel and prepare compressed data related to the relevant values, together with data related to the position of the pixel with a value equal to zero.
-
5.
公开(公告)号:WO2022265627A1
公开(公告)日:2022-12-22
申请号:PCT/US2021/037593
申请日:2021-06-16
申请人: GOOGLE LLC
发明人: GULERYUZ, Onur G. , DU, Ruofei , HOPPE, Hugues H. , FANELLO, Sean Ryan Francesco , CHOU, Philip Andrew , TANG, Danhang , DAVIDSON, Philip
IPC分类号: H04N19/117 , H04N19/147 , G06N3/02 , G06N3/08 , H04N19/85 , G06N20/00 , H04N19/186 , H04N19/59 , G06N3/045 , G06N3/084
摘要: Nonlinear peri-codec optimization for image and video coding includes obtaining a source image including pixel values expressed in a first defined image sample space, generating a neuralized image representing the source image, the neuralized image including pixel values that are expressed as neural latent space values, encoding the input image wherein the neural latent space values are used as pixel values in a second defined image sample space and the input image is in an operative image format of the encoder, such that a decoder decodes the encoded image to obtain a reconstructed image in the second defined image sample space, wherein the reconstructed image is a reconstructed neuralized image including reconstructed neural latent space values, such that a deneuralized reconstructed image corresponding to the source image is obtained by a nonlinear post-codec image processor in the first defined image sample space.
-
公开(公告)号:WO2022264622A1
公开(公告)日:2022-12-22
申请号:PCT/JP2022/015302
申请日:2022-03-29
申请人: シャープ株式会社
IPC分类号: H04N19/503 , H04N19/59 , H04N19/85
摘要: あらかじめ定められたモデルパラメータの集合から入力動画像に適したモデルパラメータを選択し、適用する解像度逆変換を行う場合、複数のモデルパラメータに適さない動画像は低品質になる可能性がある。有理数倍のスケーリングを行うニューラルネットワークと、有理数倍の補間を行う補間部を備える予測部を備え、参照画像の実際の幅と高さと、対象画像の実際の幅と高さから、上記ニューラルネットワークによる第1のスケーリング倍率と、上記補間部による第2のスケーリング倍率を導出し、上記ニューラルネットワークによる第1のスケーリングと、上記補間部による第2のスケーリングを用いて補間画像を導出する。
-
公开(公告)号:WO2022200130A1
公开(公告)日:2022-09-29
申请号:PCT/EP2022/056732
申请日:2022-03-15
IPC分类号: H04N19/59 , H04N19/593 , H04N19/80 , H04N19/176
摘要: Several methods are described to jointly use the ABT (Asymmetric Binary Tree) partitioning mode and Matrix-based Intra Prediction (MIP). In a first embodiment, we propose to forbid the use of the MIP intra prediction mode, for block sizes that are resulted from ABT partitioning. In a second embodiment, we propose to allow the MIP intra prediction for block sizes not equal to a power of two in width or height, by extending the block before MIP and crop the predicted block to the original size after MIP. In a third embodiment, we propose to adapt the down-sampling of the boundary reference samples and the up-sampling of the reduced predicted blocks, to the block sizes introduced by ABT partitioning. In a further embodiment, we set the reduced predicted block to size 8x8 in any case the initial block size is 8 and larger than 8 in a direction.
-
公开(公告)号:WO2022180031A1
公开(公告)日:2022-09-01
申请号:PCT/EP2022/054392
申请日:2022-02-22
发明人: BORDES, Philippe , GALPIN, Franck , NASER, Karam , CHEN, Ya , DUMAS, Thierry , ROBERT, Antoine
IPC分类号: H04N19/117 , H04N19/172 , H04N19/59 , H04N19/82
摘要: A method for reconstructing at least one part of a first picture, from at least one part of a second picture is provided, said first picture and said second picture having different sizes. The reconstructing comprising decoding said second picture from a bitstream and determining at least one first sample of said at least one part of the first picture using at least one resampling filter applied to at least one second sample of said at least one part of the decoded second picture. A corresponding apparatus for reconstructing at least one part of a first picture is provided. A method for encoding/decoding a video, and corresponding apparatuses, are provided which comprise the reconstructing at least one part of a first picture, from at least one part of a second picture, said first picture and said second picture having different sizes.
-
9.
公开(公告)号:WO2022084702A1
公开(公告)日:2022-04-28
申请号:PCT/GB2021/052770
申请日:2021-10-25
申请人: DEEP RENDER LTD
发明人: BESENBRUCH, Chri , CHERGANSKI, Aleksandar , FINLAY, Christopher , LYTCHIER, Alexander , RAYNER, Jonathan , RYDER, Tom , XU, Jan , ZAFAR, Arsalan
摘要: There is disclosed a computer-implemented method for lossy or lossless image or video compression and transmission, the method including the steps of: (i) receiving an input image; (ii) encoding the input image using an encoder trained neural network, to produce a y latent representation; (iii) encoding the y latent representation using a hyperencoder trained neural network, to produce a z hyperlatent representation; (iv) quantizing the z hyperlatent representation using a predetermined entropy parameter to produce a quantized z hyperlatent representation; (v) entropy encoding the quantized z hyperlatent representation into a first bitstream, using predetermined entropy parameters; (vi) processing the quantized z hyperlatent representation using a hyperdecoder trained neural network to obtain a location entropy parameter µy, an entropy scale parameter σy, and a context matrix Ay of the y latent representation; (vii) processing the y latent representation, the location entropy parameter µy and the context matrix Ay , using an implicit encoding solver, to obtain quantized latent residuals; (viii) entropy encoding the quantized latent residuals into a second bitstream, using the entropy scale parameter σy ; and (ix) transmitting the first bitstream and the second bitstream. Related computer-implemented methods, systems, computer-implemented training methods and computer program products are disclosed.
-
10.
公开(公告)号:WO2022073811A1
公开(公告)日:2022-04-14
申请号:PCT/EP2021/076733
申请日:2021-09-29
IPC分类号: H04N19/117 , H04N19/176 , H04N19/136 , H04N19/59 , H04N19/70 , H04N19/82 , H04N19/86
摘要: A method for video decoding pictures comprising : reconstructing (701) a picture at a first spatial resolution; and, obtaining (702) metadata associated to said picture representative of an information specifying that applying at least one in-loop filtering and/or at least one post-filtering on at least a portion of said reconstructed picture at a second spatial resolution different from the first resolution is allowed.
-
-
-
-
-
-
-
-
-