Text and line detection in video encode by using co-sited gradient and variance values

    公开(公告)号:US11490090B2

    公开(公告)日:2022-11-01

    申请号:US17138812

    申请日:2020-12-30

    Abstract: Methods and devices are provided for encoding video. By using co-sited gradient and variance values to detect text and line in frames of the video. A processor is configured to receive a plurality of frames of video, determine, for a portion of a frame, a variance of the portion of the frame and a gradient of the portion of the frame and encode, using one of a plurality of different encoding qualities, the portion of the frame based on the gradient and the variance of the portion of the frame. Encoding is performed at both the sub-frame level and frame level. The portion of the frame is classified into one of a plurality of categories based on the gradient and variance and encoded based on the category.

    Efficient processing of translucent objects in video keying

    公开(公告)号:US11039153B2

    公开(公告)日:2021-06-15

    申请号:US16428909

    申请日:2019-05-31

    Abstract: A video keying processing device is provided which comprises memory configured to store data and a processor configured to determine, which pixel portions, in a YUV color space of a first video comprising a foreground object and a background color, represent the foreground object and the background color of the first video. The processor is also configured to, for each pixel portion of the first video determined to represent the foreground object and the background color, convert YUV values of the pixel portion to red-green-blue (RGB) color component value and determine a blended display value for each RGB color component of the pixel portion based on a blending factor. The processor is also configured to generate a composite video for display using the blended display values of each pixel portion determined to represent the foreground object and the background color.

    Method for maximizing video slice size constraint

    公开(公告)号:US10602158B2

    公开(公告)日:2020-03-24

    申请号:US14620036

    申请日:2015-02-11

    Abstract: A method and apparatus to maximize video slice size is described herein. The method packs as many macroblocks as possible within a capped-size slice, while preserving user-defined quality constraints. The probability to conform to the maximum slice size constraint may be adjusted according to a user-defined parameter. The method may be integrated into a rate control process of a video encoder. The method predicts whether encoding a macroblock with a quantization parameter exceeds a current slice size constraint. It further predicts whether encoding a given number of macroblocks with a given configuration of quantization parameters exceeds the current slice size constraint. The method then proceeds to encode the current macroblock either on a condition that encoding the given number of macroblocks with the given configuration of quantization parameters falls below the size constraint of the current slice or after determining that a new slice is needed.

    Bit packing for delta color compression

    公开(公告)号:US10511858B2

    公开(公告)日:2019-12-17

    申请号:US15209194

    申请日:2016-07-13

    Abstract: A compressor is configured to determine delta color compression values for a plurality of pixels in a block and subdivide the plurality of pixels in the block into a plurality of groups and transmit a compressed bitstream representative of the delta values. The compressed bitstream includes bits representative of a block header that indicates a range of numbers of bits that are sufficient to represent the delta values, a plurality of group headers that each indicate a group minimum number of bits that is sufficient to represent the delta values in a corresponding one of the plurality of groups, and the delta values encoded using the group minimum number of bits for the group that includes the delta values. A decompressor configured to decompress the compressed bitstream based on the block header, the plurality of group headers, and the encoded delta values.

    METHOD AND APPARATUS FOR DETERMINING THE SEVERITY OF CORRUPTION IN A PICTURE

    公开(公告)号:US20170188051A1

    公开(公告)日:2017-06-29

    申请号:US14981835

    申请日:2015-12-28

    Abstract: An encoder encodes pixels representative of a picture in a multimedia stream, generates a first approximate signature based on approximate values of pixels in a reconstructed copy of the picture, and transmits the encoded pixels and the first approximate signature. A decoder receives a first packet including the encoded pixels and the first approximate signature, decodes the encoded pixels, and transmits a first signal in response to comparing the first approximate signature and a second approximate signature generated based on approximate values of the decoded pixels. If a corrupted packet is detected, the multimedia application requests an intra-coded picture in response to the first approximate signature differing from the second approximate signature. The second signal instructs the decoder to bypass requesting an intra-coded picture and to continue decoding received packets in response to the first approximate signature being equal to the second approximate signature.

    EFFICIENT LOW-COMPLEXITY VIDEO COMPRESSION
    47.
    发明申请
    EFFICIENT LOW-COMPLEXITY VIDEO COMPRESSION 有权
    有效的低复杂度视频压缩

    公开(公告)号:US20160295234A1

    公开(公告)日:2016-10-06

    申请号:US14679799

    申请日:2015-04-06

    Abstract: Disclosed is a low-complexity and yet efficient lossy method to compress distortion information for motion estimation, resulting in significant reduction in needed storage capacity. A system for implementing the method and a computer-readable medium for storing the method are also disclosed. The method includes determining and storing a distortion value for each trial motion vector in a plurality of trial motion vectors. Each trial motion vector specifies a position of a search region relative to a reference frame. The method further includes compressing each of the distortion values as a fixed number of bits based upon a minimum distortion value amongst the stored distortion values, and re-storing each compressed distortion value in place of its uncompressed value.

    Abstract translation: 公开了一种用于压缩运动估计的失真信息的低复杂度且有效的有损方法,导致所需存储容量的显着降低。 还公开了一种用于实现该方法的系统和用于存储该方法的计算机可读介质。 该方法包括在多个试运动矢量中确定并存储每个试运动矢量的失真值。 每个试运动向量指定搜索区域相对于参考帧的位置。 该方法还包括基于存储的失真值中的最小失真值来将每个失真值压缩为固定位数,并且重新存储每个压缩失真值代替其未压缩值。

    EFFICIENT MOTION ESTIMATION FOR 3D STEREO VIDEO ENCODING
    48.
    发明申请
    EFFICIENT MOTION ESTIMATION FOR 3D STEREO VIDEO ENCODING 审中-公开
    3D立体声视频编码的有效运动估计

    公开(公告)号:US20140354771A1

    公开(公告)日:2014-12-04

    申请号:US13904766

    申请日:2013-05-29

    CPC classification number: H04N19/30 H04N19/503 H04N19/52 H04N19/56 H04N19/597

    Abstract: An efficient motion estimation method and apparatus for 3D stereo video encoding is described herein. In an embodiment of the method, an enhancement layer motion vector for a frame is determined by obtaining a motion vector of a co-located macroblock (MB) from the same frame of a base layer. The motion vectors of a predetermined number of surrounding MBs from the same frame of the base layer are also obtained. A predicted motion vector for the MB of the frame in the enhancement layer is determined using, for example, a median value from the motion vectors associated with the co-located MB and the predetermined number of surrounding MBs. A small or less than full range motion refinement is performed to obtain a final motion vector, where full range refers to the maximum search range supported by an encoder performing the method.

    Abstract translation: 本文描述了一种用于3D立体视频编码的高效运动估计方法和装置。 在该方法的实施例中,通过从基本层的相同帧获得同位置宏块(MB)的运动矢量来确定用于帧的增强层运动矢量。 还获得了来自基层的同一帧的预定数量的周围MB的运动矢量。 使用例如来自与同位置MB和预定数量的周围MB相关联的运动向量的中值来确定增强层中的帧的MB的预测运动矢量。 执行小或小于全范围运动细化以获得最终运动矢量,其中全范围是指由执行该方法的编码器支持的最大搜索范围。

    Graphics rendering with encoder feedback

    公开(公告)号:US11830225B2

    公开(公告)日:2023-11-28

    申请号:US15993074

    申请日:2018-05-30

    CPC classification number: G06T9/00 G06T15/005 G06T2210/08 G06T2210/36

    Abstract: A feedback processing module includes a memory configured to store feedback received from an encoder. The feedback includes parameters associated with encoded graphics content generated by a graphics engine. The feedback processing module also includes a processor configured to generate configuration information for the graphics engine based on the feedback. The graphics engine is configured to execute a workload based on the configuration information. In some cases, the feedback processing module is also configured to receive feedback from a decoder that is used to decode the graphics content that is encoded by the encoder and generate the configuration information based on the feedback received from the decoder.

Patent Agency Ranking