SIMPLIFICATION OF BI-DIRECTIONAL OPTICAL FLOW CALCULATION IN VIDEO CODING

    公开(公告)号:WO2020236400A1

    公开(公告)日:2020-11-26

    申请号:PCT/US2020/030064

    申请日:2020-04-27

    IPC分类号: H04N19/42 H04N19/577

    摘要: A video coder is configured to code a block of video data using bi-prediction with bi-directional optical flow. The video coder may determine an offset using bi-directional optical flow and may add the offset to prediction samples determined from the bi-prediction. In one example, the video coder code a current block of video data using bi-prediction and bi-directional optical flow, wherein the bi-directional flow does not include one or more of a rounding operation or a division by 2 in an offset calculation. Additionally, the video coder may perform a motion vector refinement calculation for the bi-directional flow, wherein the motion vector refinement calculation is compensated to account for the offset calculation not including the division by 2.

    LOW DISPLACEMENT RANK BASED DEEP NEURAL NETWORK COMPRESSION

    公开(公告)号:WO2020190696A1

    公开(公告)日:2020-09-24

    申请号:PCT/US2020/022585

    申请日:2020-03-13

    IPC分类号: H04N19/42 G06N3/08

    摘要: A method and an apparatus for performing deep neural network compression use an approximation training set along with information, such as in matrices representing weights, biases and non-linearities, to iteratively compress a p re-trained deep neural network by low displacement rank based approximation of the network layer weight matrices. The low displacement rank approximation allows for replacement of an original layer weight matrices of the pre-trained deep neural network as the sum of a small number of structured matrices, allowing compression and low inference complexity.

    크로마 포맷에 대한 정보를 시그널링 하는 방법 및 장치

    公开(公告)号:WO2020189960A1

    公开(公告)日:2020-09-24

    申请号:PCT/KR2020/003463

    申请日:2020-03-12

    摘要: 본 개시에 따른 디코딩 장치에 의하여 수행되는 영상 디코딩 방법은, 현재 블록에 대한 예측 정보를 포함하는 비트스트림을 수신하는 단계, 상기 현재 블록에 대한 상기 예측 정보에 포함된, 상기 현재 블록에 대한 크로마 포맷 샘플링 구조를 나타내는 크로마 포맷 인덱스 및 세 컬러 컴포넌츠가 분리되어 코딩되는지 여부를 나타내는 세퍼레이트 컬러 플래인 플래그를 기반으로, 상기 현재 블록에 대한 크로마 어레이 타입을 도출하는 단계, 상기 도출된 크로마 어레이 타입을 기반으로 상기 현재 블록에 대한 예측 샘플들을 도출하는 단계 및 상기 예측 샘플들을 기반으로 상기 현재 블록에 대한 복원 샘플들을 도출하는 단계를 포함하는 것을 특징으로 한다.

    PARALLEL CODING OF SYNTAX ELEMENTS FOR JPEG ACCELERATOR

    公开(公告)号:WO2020092795A3

    公开(公告)日:2020-05-07

    申请号:PCT/US2019/059203

    申请日:2019-10-31

    申请人: FUNGIBLE, INC.

    摘要: A device includes a memory configured to store image data and an image coding unit implemented in circuitry. The image coding unit is configured to code a first value of a first instance of a first syntax element of a first block of image data and determine a first context for coding a second value of a second instance of the first syntax element of a second block of the image data. The image coding unit is configured to context-based code the second value of the second instance of the first syntax element of the second block of the image data after coding the first value of the first instance of the first syntax element using the first context and code a third value of a first instance of a second syntax element of the first block in parallel with coding the second value or after coding the second value.

    動画像符号化方法及び動画像符号化装置

    公开(公告)号:WO2020054060A1

    公开(公告)日:2020-03-19

    申请号:PCT/JP2018/034232

    申请日:2018-09-14

    摘要: 動画像符号化方法は、動画像の符号化のための所定の第一のモード群(11a)から、少なくとも一つのモードを第一の候補モード(11b)として選択する第一のモード選択ステップ(S10)と、選択された第一の候補モード(11b)に基づいて、所定の第二のモード群(13a)から、一つのモードを符号化モード(14a)として選択する第二のモード選択ステップ(S11)と、選択された符号化モード(14a)で動画像を符号化する符号化ステップ(S12)とを含む。

    FLEXIBLE IMPLEMENTATIONS OF MULTIPLE TRANSFORMS

    公开(公告)号:WO2019212987A1

    公开(公告)日:2019-11-07

    申请号:PCT/US2019/029731

    申请日:2019-04-29

    摘要: In video coding, a transform can be selected from multiple transform sets. To efficiently implement multiple transforms, a unified architecture of implementing the Discrete Trigonometric Transforms (DTTs) or flipped DTT can be used. In the proposed unified architecture, the relationships between the transforms are utilized. In particular, all transforms can be implemented based on DCT-II, DCT-IV, a reverse order operation, and a sign changing operation for odd elements. The DCT-II can be implemented at a minimum size, and other sizes for DCT-II can be implemented recursively from the minimum size DCT-II and DCT-IV at various sizes. In one example, the multiple transforms are {DCT-II, DST-II, DCT-III, DST-III, DCT-IV, DST-IV}. The relationships between transforms can also be used to guide the design of additional transforms that can be implemented by the unified architecture.

    一种基于压缩感知的质量可分级快速编码方法

    公开(公告)号:WO2019179096A1

    公开(公告)日:2019-09-26

    申请号:PCT/CN2018/111537

    申请日:2018-10-24

    摘要: 本发明公开了一种基于压缩感知的质量可分级快速编码方法,属于视频编码技术领域。本发明方法利用压缩感知理论的稀疏性,在对质量可分级增强层进行编码时对残差块尺寸为8x8的子块进行稀疏表示,编码时为了满足标准编码结构提出补0操作再进行熵编码。本发明还利用了基本层和增强层之间的层间相关性来快速选择子块编码模式以进一步降低编码算法的计算复杂度。相比现有技术,本发明方法能够在保持编码后图像质量的前提下,有效地降低编码端的码率,提高编码器的编码效率。

    NEW SAMPLE SETS AND NEW DOWN-SAMPLING SCHEMES FOR LINEAR COMPONENT SAMPLE PREDICTION

    公开(公告)号:WO2019162414A1

    公开(公告)日:2019-08-29

    申请号:PCT/EP2019/054377

    申请日:2019-02-21

    摘要: The disclosure regards cross-component prediction and methods for deriving of a linear model for obtaining a first-component sample for a first- component block from an associated reconstructed second-component sample of a second-component block in the same frame, the method comprising determining the parameters of a linear equation representing a straight line passing through two points, each point being defined by two variables, the first variable corresponding to a second-component sample value, the second variable corresponding to a first-component sample value, based on reconstructed samples of both the first-component and the second-component; and deriving the linear model defined by the straight line parameters; wherein said determining the parameters uses integer arithmetic.

    CONTEXT DERIVATION FOR COEFFICIENT CODING
    69.
    发明申请

    公开(公告)号:WO2019112669A1

    公开(公告)日:2019-06-13

    申请号:PCT/US2018/051041

    申请日:2018-09-14

    申请人: GOOGLE LLC

    发明人: KUUSELA, Aki HE, Dake

    摘要: Coding a transform block having transform coefficients is described. A plurality of register arrays is defined to each hold one or more stored values regarding the coding context based on at least one spatial template for a coding context. The register arrays are initialized by setting the stored values to default values, and values for the transform coefficients from the transform block are coded in a reverse scan order. The values for the transform coefficients are indicative of magnitudes of the transform coefficients. For each of one or more transform coefficients, the coding includes determining the coding context using at least some of the stored values from the register arrays, entropy coding a value for the transform coefficient using the coding context, and updating the register arrays subsequent to entropy coding the value for the transform coefficient.