Method and apparatus for candidate list pruning

    公开(公告)号:US12238277B2

    公开(公告)日:2025-02-25

    申请号:US18074769

    申请日:2022-12-05

    Abstract: Video signal coding and decoding functions can generate lists of potential candidates to use in coding and decoding, for example, predictors. Video signal coding component candidate undergo operations before potential inclusion in candidate lists. The candidates are checked after being modified by the operations to see if other equal candidates are already in the candidate list. If equal candidates are not in the list, the modified candidates are added to the candidate list. If equal candidates are already in the list, the modified candidates are not added to the list. Operations that can be performed comprise rounding and clipping.

    TEMPORAL ATTENTION-BASED NEURAL NETWORKS FOR VIDEO COMPRESSION

    公开(公告)号:US20250056036A1

    公开(公告)日:2025-02-13

    申请号:US18721964

    申请日:2022-12-20

    Abstract: Systems, methods, and instrumentalities are disclosed for video encoding and/or video decoding using artificial neural networks (e.g., convolutional neural networks or recurrent neural networks), attention, and/or attention with spatial attributes. For example, an apparatus may be configured to perform one or more of the following: obtaining a context block, a current block, and a latent vector associated with the context block; performing at least one convolution on the context block, the reference block, and the latent vector; generating motion flow data associated with the current block based on the at least one convolution; or generating a bitstream the comprises an indication of the motion flow data. The motion flow data may be quantized. The generated bitstream may comprise an indication of the quantized motion flow data.

    Method and apparatus for combined intra prediction modes

    公开(公告)号:US11477436B2

    公开(公告)日:2022-10-18

    申请号:US17049724

    申请日:2019-04-03

    Abstract: Encoders and decoders of digital video signals use combined intra prediction modes for some images. In at least one embodiment, combined intra prediction modes can result from the intra prediction modes of neighboring blocks. The combined intra prediction mode can be added to a most probable modes list. Various embodiments describe techniques for combining the intra prediction modes, comprising a combination of reference samples and a combination of directions to form a prediction. Another embodiment adds a combination mode to the most probable modes list if the two modes that are combined to form it are directional modes with angle difference less than 90 degrees. Another embodiment uses a linear combination of intra prediction modes and another embodiment performs a linear combination which depends on the distance of a prediction from the left and above blocs.

    Texture-based partitioning decisions for video compression

    公开(公告)号:US11412220B2

    公开(公告)日:2022-08-09

    申请号:US16771094

    申请日:2018-12-12

    Abstract: A block of video data is split using one or more of several possible partition operations by using the partitioning choices obtained through use of a texture-based image partitioning. In at least one embodiment, the block is split in one or more splitting operations using a convolutional neural network. In another embodiment, inputs to the convolutional neural network come from pixels along the block's causal borders. In another embodiment, boundary information, such as the location of partitions in spatially neighboring blocks, is used by the texture analysis. Methods, apparatus, and signal embodiments are provided for encoding.

    Method and apparatus for adaptive transform in video encoding and decoding

    公开(公告)号:US11375191B2

    公开(公告)日:2022-06-28

    申请号:US16652395

    申请日:2018-10-12

    Abstract: For a picture with two or more color components, the prediction residuals for the first color component of a block to be encoded may be transformed with a first transform. The transform coefficients for the first color component may go through quantization, de-quantization and inverse transform to obtain reconstructed prediction residuals. Based on the reconstructed prediction residuals for the first color component, the phases of the transform basis function of the first transform can be adjusted to improve the sparsity of the transformed signal. The prediction residuals for the remaining color components may then be transformed with the adjusted transform. In order to determine the phase shift factor, the reconstructed prediction residuals for the first color component may be transformed with the first transform, adjusted by different candidate phase shift factors, and the candidate phase shift factor that provides a smallest sparsity measure can be selected for the block.

    Method and apparatus for video coding with adaptive clipping

    公开(公告)号:US10999603B2

    公开(公告)日:2021-05-04

    申请号:US16333519

    申请日:2017-09-11

    Abstract: In a particular implementation, a clipping bound may be different from the signal bound. For example, to derive the upper clipping bound, a reconstructed sample value corresponding to original sample value Y is estimated to be Y+Δy. Thus, for a candidate upper clipping bound x, the difference between the clipped value and the original value is calculated as min(Y+Δy, x)−Y. The distortions using different candidate clipping values around signal bound M may be tested. The test starts with signal bound M and moves towards smaller values. The distortion may first decrease (or maintain the same) and then increase, and the turning point is chosen as upper clipping bound M′. Similarly, the lower clipping bound m′ can be chosen. For more effective clipping, the color components may be transformed such that the transformed color components may be more tightly enclosed by a box defined by the clipping bounds.

Patent Agency Ranking