Perceptual importance maps for image processing

    公开(公告)号:US11475653B2

    公开(公告)日:2022-10-18

    申请号:US15965640

    申请日:2018-04-27

    Inventor: Boris Ivanovic

    Abstract: The present disclosure is directed to techniques for determining a perceptual importance map. The perceptual importance map indicates the relative importance to the human visual system of different portions of an image. The techniques include obtaining cost values for the blocks of an image, where cost values are values used in determining motion vectors. For each block, a confidence value is derived from the cost values. The confidence value indicates the confidence with which the motion vector is believed to be correct. A perceptual importance value is determined based on the confidence value via one or more modifications to the confidence value to better reflect importance to the human visual system. The generated perceptual importance values can be used for various purposes such as allocating bits for encoding, identifying regions of interest, or selectively rendering portions of an image with greater or lesser detail based on relative perceptual importance.

    Determining quality values for blocks of encoded video

    公开(公告)号:US11310496B2

    公开(公告)日:2022-04-19

    申请号:US16366959

    申请日:2019-03-27

    Abstract: A technique for determining a quality value for a subject block of encoded video is provided. Contributing blocks, of the same frame and/or different frames of the subject block, are determined by identifying blocks likely to be a part of the same moving object or background as the subject block. A spatial and/or temporal filter is then applied to the quality values of the contributing blocks and an initial quality value of the subject block. With a spatial filter, quality values for contributing blocks from the same frame are combined and used to modify the quality value of the subject block. With a spatial filter, a temporal characteristic quality value for contributing blocks of one or more other frames (such as the immediately previous frame) is determined and then combined with a quality value representative of the subject block.

    Block type prediction leveraging block-based pixel activities

    公开(公告)号:US11234004B2

    公开(公告)日:2022-01-25

    申请号:US16207893

    申请日:2018-12-03

    Abstract: Systems, apparatuses, and methods for block type prediction leveraging block-based pixel activities are disclosed. A pre-encoder generates predictions of block types for the blocks of a video frame based on associated pixel activities. For each block, the pre-encoder calculates the difference between the pixel activities of the block of a current frame and the pixel activities of a corresponding block of a previous video frame. If the difference is less than a first threshold, the pre-encoder predicts that the block will be a skip block. If the difference is in between the first threshold and a second threshold, the pre-encoder predicts that the block will be a P-block. Otherwise, if the difference is greater than the second threshold, then the pre-encoder predicts that the block will be an I-block. The pre-encoder uses the predictions to select quantization parameter (QP) ranges for encoding the blocks of the video frame.

    Multi-scale metric-based encoding

    公开(公告)号:US11102488B2

    公开(公告)日:2021-08-24

    申请号:US16427482

    申请日:2019-05-31

    Abstract: A processing system analyzes pixel activity levels of blocks of a picture at a plurality of spatial scales and/or dynamic ranges to generate a multi-scale metric that indicates how bit allocation or assignment of a given quantization parameter (QP) will affect the perceptual quality of the block. Blocks that have similar multi-scale metrics are likely to be visually similar and to benefit from similar bit allocations or QPs. Based on the multi-scale metric, an encoder encodes each block of the picture with a QP and/or a number of bits.

    PERCEPTUAL IMPORTANCE MAPS FOR IMAGE PROCESSING

    公开(公告)号:US20190332883A1

    公开(公告)日:2019-10-31

    申请号:US15965640

    申请日:2018-04-27

    Inventor: Boris Ivanovic

    Abstract: The present disclosure is directed to techniques for determining a perceptual importance map. The perceptual importance map indicates the relative importance to the human visual system of different portions of an image. The techniques include obtaining cost values for the blocks of an image, where cost values are values used in determining motion vectors. For each block, a confidence value is derived from the cost values. The confidence value indicates the confidence with which the motion vector is believed to be correct. A perceptual importance value is determined based on the confidence value via one or more modifications to the confidence value to better reflect importance to the human visual system. The generated perceptual importance values can be used for various purposes such as allocating bits for encoding, identifying regions of interest, or selectively rendering portions of an image with greater or lesser detail based on relative perceptual importance.

    Video frame rate conversion using streamed metadata

    公开(公告)号:US10412462B2

    公开(公告)日:2019-09-10

    申请号:US15346392

    申请日:2016-11-08

    Inventor: Boris Ivanovic

    Abstract: A video server generates metadata representative of interpolation parameters for portions of a first frame representative of a scene in a stream of frames including the first frame. The interpolation parameters are used to generate at least one interpolated frame representative of the scene subsequent to the first frame and prior to a second frame in the stream of frames. The video server incorporates the metadata into the stream and transmits the stream including the multiplexed metadata. A video client receives the first frame representative the stream of frames including the metadata. The video client generates one or more interpolated frames representative of the scene subsequent to the first frame and prior to a second frame in the stream of frames based on the first frame and the metadata. The video client displays the first frame, the one or more interpolated frames, and the second frame.

    Pre-processing for video compression

    公开(公告)号:US10708624B1

    公开(公告)日:2020-07-07

    申请号:US16426487

    申请日:2019-05-30

    Abstract: A processing system filters blocks of a picture to minimize a size and error of the blocks prior to encoding. A pre-processing module of the processing system measures characteristics of a plurality of blocks and evaluates the effects of applying each of a plurality of filters to the blocks prior to encoding in order to predict an increase in compressibility of blocks having similar characteristics that are filtered with each filter before being encoded, with the least impact on quality. The pre-processing module trains models to predict a size and error of blocks filtered with each filter based on block characteristics. The pre-processing module uses the models to calculate a cost in terms of size and error of applying each filter to a given block having certain characteristics. The pre-processing module then applies to the block the filter that is predicted to result in the best cost.

Patent Agency Ranking