Sparse matrix representation using a boundary of non-zero coefficients

    公开(公告)号:US11388439B2

    公开(公告)日:2022-07-12

    申请号:US16726408

    申请日:2019-12-24

    申请人: Google LLC

    摘要: A sparse matrix representation of image or video data for encoding or decoding uses a boundary of non-zero coefficients within the image or video data. A bounding box encloses each non-zero coefficient within an image or video block. The coefficients enclosed within the bounding box are encoded to a bitstream along with dimensional information usable to identify the bounding box within the image or video block during decoding. Coefficients not enclosed within the bounding box are not specifically encoded within the bitstream. The dimensional information represents one or more of a shape, size, or position within the image or video block of the bounding box. The bounding box may be identified according to a scan order used to process the coefficients within the image or video block. The bounding box may be rectangular or non-rectangular.

    Sparse Matrix Representation Using a Boundary of Non-Zero Coefficients

    公开(公告)号:US20210120270A1

    公开(公告)日:2021-04-22

    申请号:US16726408

    申请日:2019-12-24

    申请人: Google LLC

    摘要: A sparse matrix representation of image or video data for encoding or decoding uses a boundary of non-zero coefficients within the image or video data. A bounding box encloses each non-zero coefficient within an image or video block. The coefficients enclosed within the bounding box are encoded to a bitstream along with dimensional information usable to identify the bounding box within the image or video block during decoding. Coefficients not enclosed within the bounding box are not specifically encoded within the bitstream. The dimensional information represents one or more of a shape, size, or position within the image or video block of the bounding box. The bounding box may be identified according to a scan order used to process the coefficients within the image or video block. The bounding box may be rectangular or non-rectangular.

    Dynamic Bitset Coding
    3.
    发明公开

    公开(公告)号:US20230199222A1

    公开(公告)日:2023-06-22

    申请号:US17926831

    申请日:2020-05-28

    申请人: Google LLC

    摘要: Decoding a bitset, each bit of the bitset corresponding to a respective value in a range of a minimum value to a maximum value, includes decoding, from a compressed bitstream, indexes of bits of the bitset, each bit of the bits having a first value. Decoding the bitset also includes setting all other bits of the bitset not decoded from the compressed bitstream to a second value. Decoding the indexes of bits of the bitset includes decoding a number of the indexes of the bits of the bitset, decoding a first index of the indexes in a first range having a first lower bound and a first upper bound, and decoding a last index of the indexes in a second range having a second lower bound and a second upper bound.

    Image compression and decompression using triangulation

    公开(公告)号:US11019366B2

    公开(公告)日:2021-05-25

    申请号:US16413992

    申请日:2019-05-16

    申请人: GOOGLE LLC

    摘要: An encoder system can include a pixel grid generator to receive an image having a first dimension, generate a grid having a second dimension, add a plurality of points to positions on the grid, and map a plurality of pixels of the image to the plurality of points. The encoder system can include a color module to assign a color to each of the plurality of points using a color table, a triangulation module to generate a plurality of vertices based on the plurality of points and triangulate the grid using the vertices, and a compression module to compress the vertices as a set of compressed vertex positions and a set of vertex colors.

    ALPHA CHANNEL POST PROCESSING IN IMAGE CODING

    公开(公告)号:US20220014792A1

    公开(公告)日:2022-01-13

    申请号:US17354626

    申请日:2021-06-22

    申请人: Google LLC

    IPC分类号: H04N19/86

    摘要: Alpha channel post processing in image coding can include decoding, from multiple color channels of a bitstream, color channel values for an encoded image, decoding, from an alpha channel of the bitstream, alpha channel values for the encoded image, determining a bilateral filter based on a level of compression for encoding the alpha channel, post processing the alpha channel values by filtering the alpha channel values using the bilateral filter to obtain filtered alpha channel values, and generating at least a portion of a reconstructed image corresponding to the encoded image using the filtered alpha channel values and the color channel values.

    Image Decoding During Bitstream Interruptions

    公开(公告)号:US20210118186A1

    公开(公告)日:2021-04-22

    申请号:US16732283

    申请日:2019-12-31

    申请人: Google LLC

    IPC分类号: G06T9/00 G06F9/30 H04L29/06

    摘要: A data stream accessor seeks to simulate an uninterrupted bitstream for a decoder during network interruptions. The accessor performs a set of operations including receiving an available amount of image data and storing a current execution context of the call site, passing execution flow to the decoder, and performing a subset of operations. The subset includes receiving an indication of a desired amount of image data from the decoder. If a total amount of image data at the accessor is greater than or equal to the desired amount, the desired amount is transmitted to the decoder for decoding. Otherwise, and unless an error issues, a current execution context of the decoder is stored, execution flow is passed from the decoder to the call site, and the restorable execution context is loaded as the current execution context of the call site. The subset is repeated unless the image is completely decoded.

    Sparse matrix representation using a boundary of non-zero coefficients

    公开(公告)号:US11818397B2

    公开(公告)日:2023-11-14

    申请号:US17860693

    申请日:2022-07-08

    申请人: GOOGLE LLC

    摘要: A sparse matrix representation of image or video data for encoding or decoding uses a boundary of non-zero coefficients within the image or video data. A bounding box encloses each non-zero coefficient within an image or video block. The coefficients enclosed within the bounding box are encoded to a bitstream along with dimensional information usable to identify the bounding box within the image or video block during decoding. Coefficients not enclosed within the bounding box are not specifically encoded within the bitstream. The dimensional information represents one or more of a shape, size, or position within the image or video block of the bounding box. The bounding box may be identified according to a scan order used to process the coefficients within the image or video block. The bounding box may be rectangular or non-rectangular.

    EFFICIENT UPDATE OF CUMULATIVE DISTRIBUTION FUNCTIONS FOR IMAGE COMPRESSION

    公开(公告)号:US20230085142A1

    公开(公告)日:2023-03-16

    申请号:US17904030

    申请日:2020-07-06

    申请人: GOOGLE LLC

    IPC分类号: H03M7/30 H03M7/40

    摘要: Updating cumulative distribution functions (CDFs) during arithmetic encoding can be a challenge because the final element of the CDF should remain fixed during the update calculations. If the probabilities were floating-point numbers, this would not be too much of a challenge; nevertheless, the probabilities and hence the CDFs are represented as integers to take advantage of infinite-precision arithmetic. Some of these difficulties may be alleviated by introducing a “mixing” CDF along with the active CDF being updated; the mixing CDF provides nonlocal context for updating the CDF due to the introduction of a particular symbol in the encoding. Improved techniques of performing arithmetic encoding include updating the CDF using two, one-dimensional mixing CDF arrays: a symbol-dependent array and a symbol-dependent array. The symbol-dependent array is a sub array of a larger, fixed array such that the sub array selected depends on the symbol being used.

    IMAGE COMPRESSION AND DECOMPRESSION USING TRIANGULATION

    公开(公告)号:US20190356931A1

    公开(公告)日:2019-11-21

    申请号:US16413992

    申请日:2019-05-16

    申请人: GOOGLE LLC

    摘要: An encoder system can include a pixel grid generator to receive an image having a first dimension, generate a grid having a second dimension, add a plurality of points to positions on the grid, and map a plurality of pixels of the image to the plurality of points. The encoder system can include a color module to assign a color to each of the plurality of points using a color table, a triangulation module to generate a plurality of vertices based on the plurality of points and triangulate the grid using the vertices, and a compression module to compress the vertices as a set of compressed vertex positions and a set of vertex colors.