DETECTION OF VIDEO FEATURE BASED ON VARIANCE METRIC
    1.
    发明申请
    DETECTION OF VIDEO FEATURE BASED ON VARIANCE METRIC 有权
    基于变量公式检测视频特征

    公开(公告)号:US20130279563A1

    公开(公告)日:2013-10-24

    申请号:US13450870

    申请日:2012-04-19

    申请人: Ying Li Xu Gang Zhao

    发明人: Ying Li Xu Gang Zhao

    IPC分类号: H04N7/26

    摘要: A metric representing the sum of variances for pixel blocks of a region of an image are used to identify the presence a video feature of the image, and a transcoding is performed responsive to identifying the presence of the video feature. The identified video feature can include, but is not limited to, a scene change, the presence of a black border region or a caption region, or the complexity of the image. The transcoding operation can include, but is not limited to, coding the image as an Intra-frame, omitting the content corresponding to the black border region or the caption region from the transcoded image or allocating a relatively lower bit budget for the black border region or a relatively higher bit budget to the caption region during transcoding of the image, or setting the bit budget for rate control during transcoding.

    摘要翻译: 用于表示图像区域的像素块的方差之和的度量用于识别图像的视频特征的存在,并且响应于识别视频特征的存在来执行代码转换。 识别的视频特征可以包括但不限于场景改变,黑色边界区域或字幕区域的存在,或图像的复杂性。 代码转换操作可以包括但不限于将图像编码为帧内帧,从代码转换的图像中省略与黑色边界区域或字幕区域相对应的内容,或者为黑色边框区域分配相对较低的比特预算 或在图像代码转换期间到字幕区域的相对较高的比特预算,或者在转码期间设置用于速率控制的比特预算。

    Detection of video feature based on variance metric
    2.
    发明授权
    Detection of video feature based on variance metric 有权
    基于方差度量检测视频特征

    公开(公告)号:US09071842B2

    公开(公告)日:2015-06-30

    申请号:US13450870

    申请日:2012-04-19

    申请人: Ying Li Xu Gang Zhao

    发明人: Ying Li Xu Gang Zhao

    摘要: A metric representing the sum of variances for pixel blocks of a region of an image are used to identify the presence a video feature of the image, and a transcoding is performed responsive to identifying the presence of the video feature. The identified video feature can include, but is not limited to, a scene change, the presence of a black border region or a caption region, or the complexity of the image. The transcoding operation can include, but is not limited to, coding the image as an Intra-frame, omitting the content corresponding to the black border region or the caption region from the transcoded image or allocating a relatively lower bit budget for the black border region or a relatively higher bit budget to the caption region during transcoding of the image, or setting the bit budget for rate control during transcoding.

    摘要翻译: 用于表示图像区域的像素块的方差之和的度量用于识别图像的视频特征的存在,并且响应于识别视频特征的存在来执行代码转换。 识别的视频特征可以包括但不限于场景改变,黑色边界区域或字幕区域的存在,或图像的复杂性。 代码转换操作可以包括但不限于将图像编码为帧内帧,从代码转换的图像中省略与黑色边界区域或字幕区域相对应的内容,或者为黑色边框区域分配相对较低的比特预算 或在图像代码转换期间到字幕区域的相对较高的比特预算,或者在转码期间设置用于速率控制的比特预算。

    ADAPTIVE SINGLE-FIELD/DUAL-FIELD VIDEO ENCODING
    3.
    发明申请
    ADAPTIVE SINGLE-FIELD/DUAL-FIELD VIDEO ENCODING 有权
    自适应单场/双场视频编码

    公开(公告)号:US20140153640A1

    公开(公告)日:2014-06-05

    申请号:US13705422

    申请日:2012-12-05

    申请人: Xu Gang Zhao Ying Li

    发明人: Xu Gang Zhao Ying Li

    IPC分类号: H04N7/26

    摘要: A video processing device includes an interface to receive an input video stream and an interface to provide an encoded video stream. The input video stream includes a sequence of frames. Each frame comprises two fields. The video processing device further includes an encoder to encode the input video stream to generate the encoded video stream. The encoder is to dynamically switch between a first encoding mode and a second encoding mode responsive to a variable quantization parameter. In the first encoding mode the encoder is to encode both fields or the complete frame of a corresponding frame of the sequence. In the second encoding mode the encoder is to encode only one field of the two fields of a corresponding frame of the sequence. This approach can achieve a specified low bit rate with reduced quantization effects while keeping the horizontal resolution unchanged.

    摘要翻译: 视频处理设备包括接收输入视频流的接口和提供编码视频流的接口。 输入视频流包括一系列帧。 每个帧包括两个字段。 视频处理设备还包括编码器以对输入视频流进行编码以生成编码视频流。 编码器将响应于可变量化参数在第一编码模式和第二编码模式之间动态切换。 在第一编码模式中,编码器将对序列的相应帧的两个场或完整帧进行编码。 在第二编码模式中,编码器仅对序列的相应帧的两个场的一个字段进行编码。 这种方法可以在保持水平分辨率不变的情况下实现具有降低的量化效应的指定的低比特率。

    METHOD AND DEVICE TO IDENTIFY MOTION VECTOR CANDIDATES USING A SCALED MOTION SEARCH
    4.
    发明申请
    METHOD AND DEVICE TO IDENTIFY MOTION VECTOR CANDIDATES USING A SCALED MOTION SEARCH 有权
    使用规模运动搜索识别运动矢量候选的方法和装置

    公开(公告)号:US20130251024A1

    公开(公告)日:2013-09-26

    申请号:US13425522

    申请日:2012-03-21

    申请人: Ying Li Xu Gang Zhao

    发明人: Ying Li Xu Gang Zhao

    IPC分类号: H04N7/26

    CPC分类号: H04N19/53 H04N19/56

    摘要: A scaled motion search section can be used in a video processing device that processes a video input signal that includes a plurality of pictures. The scaled motion search section includes a downscaling module that downscales the plurality of pictures to generate a plurality of downscaled pictures, wherein the downscaling module includes a horizontal downscaling filter and a vertical downscaling filter, and wherein the vertical downscaling filter generates downscaled pixels for a macroblock pair using only pixels from the macroblock pair. A transfer function that models the scaled motion vectors is determined and used to identify a final set of motion vector candidates used in a larger scale motion search.

    摘要翻译: 缩放运动搜索部分可以用于处理包括多个图像的视频输入信号的视频处理装置。 缩放运动搜索部分包括缩小模块,其缩小多个图片以生成多个缩小的图片,其中缩小模块包括水平缩小滤镜和垂直缩小滤镜,并且其中垂直缩小滤镜产生宏块的缩小像素 只使用来自宏块对的像素。 确定用于对缩放的运动矢量建模的传递函数,并用于识别在较大比例运动搜索中使用的运动矢量候选的最终集合。

    QUANTIZATION PARAMETER ADJUSTMENT BASED ON SUM OF VARIANCE AND ESTIMATED PICTURE ENCODING COST
    5.
    发明申请
    QUANTIZATION PARAMETER ADJUSTMENT BASED ON SUM OF VARIANCE AND ESTIMATED PICTURE ENCODING COST 有权
    基于变量和估计图片编码成本的量化参数调整

    公开(公告)号:US20140376616A1

    公开(公告)日:2014-12-25

    申请号:US13926179

    申请日:2013-06-25

    申请人: Ying Li Xu Gang Zhao

    发明人: Ying Li Xu Gang Zhao

    摘要: A video processing device includes a rate control module to determine more accurate initial quantization parameters at each scene switching point and to adjust the QP parameters in response to scene changes using a sum of variances metric and an estimated picture encoding cost metric from a coding complex estimation block. To determine a first quantization parameter set, a sum of variances metric and an estimated picture encoding cost metric for an initial set pictures of a video stream are used. A bit allocation module is to set a target bit allocation for infra-encoded pictures as substantially proportional to the sum of variances metric and substantially inversely proportional to the estimated picture encoding cost metric, and set a target bit allocation for forward predictive and bi-predictive pictures as substantially proportional to the estimated picture encoding cost metric and substantially inversely proportional to the sum of variances metric.

    摘要翻译: 视频处理装置包括速率控制模块,用于在每个场景切换点处确定更准确的初始量化参数,并且使用来自编码复合估计的方差度量和估计图像编码成本度量的和来响应于场景变化来调整QP参数 块。 为了确定第一量化参数集,使用视差流的初始设置图像的方差度量和估计图像编码成本度量的和。 比特分配模块是为了将经过编码的图像的目标比特分配设置为基本上与方差度量之和成比例并且与估计的图像编码成本度量基本上成反比,并且设置用于前向预测和双向预测的目标比特分配 图像基本上与估计图像编码成本度量成正比,并且与方差度量之和基本成反比。

    Quantization parameter adjustment based on sum of variance and estimated picture encoding cost
    6.
    发明授权
    Quantization parameter adjustment based on sum of variance and estimated picture encoding cost 有权
    基于方差和估计图像编码成本的量化参数调整

    公开(公告)号:US09565440B2

    公开(公告)日:2017-02-07

    申请号:US13926179

    申请日:2013-06-25

    申请人: Ying Li Xu Gang Zhao

    发明人: Ying Li Xu Gang Zhao

    摘要: A video processing device includes a rate control module to determine more accurate initial quantization parameters at each scene switching point and to adjust the QP parameters in response to scene changes using a sum of variances metric and an estimated picture encoding cost metric from a coding complex estimation block. To determine a first quantization parameter set, a sum of variances metric and an estimated picture encoding cost metric for an initial set pictures of a video stream are used. A bit allocation module is to set a target bit allocation for infra-encoded pictures as substantially proportional to the sum of variances metric and substantially inversely proportional to the estimated picture encoding cost metric, and set a target bit allocation for forward predictive and bi-predictive pictures as substantially proportional to the estimated picture encoding cost metric and substantially inversely proportional to the sum of variances metric.

    摘要翻译: 视频处理装置包括速率控制模块,用于在每个场景切换点处确定更准确的初始量化参数,并且使用来自编码复合估计的方差度量和估计图像编码成本度量的和来响应于场景变化来调整QP参数 块。 为了确定第一量化参数集,使用视差流的初始设置图像的方差度量和估计图像编码成本度量的和。 比特分配模块是为了将经过编码的图像的目标比特分配设置为基本上与方差度量之和成比例并且与估计的图像编码成本度量基本上成反比,并且设置用于前向预测和双向预测的目标比特分配 图像基本上与估计图像编码成本度量成正比,并且与方差度量之和基本成反比。

    Scene change detection using sum of variance and estimated picture encoding cost
    7.
    发明授权
    Scene change detection using sum of variance and estimated picture encoding cost 有权
    场景变化检测使用方差和估计图像编码成本

    公开(公告)号:US09426475B2

    公开(公告)日:2016-08-23

    申请号:US13926185

    申请日:2013-06-25

    申请人: Ying Li Xu Gang Zhao

    发明人: Ying Li Xu Gang Zhao

    摘要: A video processing device includes a complexity estimation module to determine a first sum of variances metric and a first estimated picture encoding cost metric for a first picture of a video stream. The video processing device further includes a scene analysis module to determine a first threshold based on a first statistical feature for sum of variance metrics of a set of one or more pictures preceding the first picture in the video stream and a second threshold based on a second statistical feature for estimated picture encoding cost metrics of the set of one or more pictures. The scene analysis module further is to identify a scene change as occurring at the first picture based on the first sum of variances metric, the first estimated picture encoding cost metric, the first threshold, and the second threshold.

    摘要翻译: 视频处理设备包括复杂度估计模块,用于确定视差流的第一图像的方差度量和第一估计图像编码成本度量的第一和。 视频处理装置还包括场景分析模块,用于基于第一统计特征来确定第一阈值,所述第一统计特征用于在视频流中的第一图像之前的一个或多个图像的集合的方差度量的和和基于第二阈值的第二阈值 用于一个或多个图像的集合的估计图像编码成本度量的统计特征。 场景分析模块还用于基于方差度量,第一估计图像编码成本度量,第一阈值和第二阈值的第一和来识别出现在第一图像处的场景变化。

    Method and device to identify motion vector candidates using a scaled motion search
    8.
    发明授权
    Method and device to identify motion vector candidates using a scaled motion search 有权
    使用缩放运动搜索来识别运动矢量候选的方法和装置

    公开(公告)号:US09232230B2

    公开(公告)日:2016-01-05

    申请号:US13425522

    申请日:2012-03-21

    申请人: Ying Li Xu Gang Zhao

    发明人: Ying Li Xu Gang Zhao

    IPC分类号: H04N7/50 H04N19/53 H04N19/56

    CPC分类号: H04N19/53 H04N19/56

    摘要: A scaled motion search section can be used in a video processing device that processes a video input signal that includes a plurality of pictures. The scaled motion search section includes a downscaling module that downscales the plurality of pictures to generate a plurality of downscaled pictures, wherein the downscaling module includes a horizontal downscaling filter and a vertical downscaling filter, and wherein the vertical downscaling filter generates downscaled pixels for a macroblock pair using only pixels from the macroblock pair. A transfer function that models the scaled motion vectors is determined and used to identify a final set of motion vector candidates used in a larger scale motion search.

    摘要翻译: 缩放运动搜索部分可以用于处理包括多个图像的视频输入信号的视频处理装置。 缩放运动搜索部分包括缩小模块,其缩小多个图片以生成多个缩小的图片,其中缩小模块包括水平缩小滤镜和垂直缩小滤镜,并且其中垂直缩小滤镜产生宏块的缩小像素 只使用来自宏块对的像素。 确定用于对缩放的运动矢量建模的传递函数,并用于识别在较大比例运动搜索中使用的运动矢量候选的最终集合。

    Adaptive single-field/dual-field video encoding
    9.
    发明授权
    Adaptive single-field/dual-field video encoding 有权
    自适应单场/双场视频编码

    公开(公告)号:US09560361B2

    公开(公告)日:2017-01-31

    申请号:US13705422

    申请日:2012-12-05

    申请人: Xu Gang Zhao Ying Li

    发明人: Xu Gang Zhao Ying Li

    摘要: A video processing device includes an interface to receive an input video stream and an interface to provide an encoded video stream. The input video stream includes a sequence of frames. Each frame comprises two fields. The video processing device further includes an encoder to encode the input video stream to generate the encoded video stream. The encoder is to dynamically switch between a first encoding mode and a second encoding mode responsive to a variable quantization parameter. In the first encoding mode the encoder is to encode both fields or the complete frame of a corresponding frame of the sequence. In the second encoding mode the encoder is to encode only one field of the two fields of a corresponding frame of the sequence. This approach can achieve a specified low bit rate with reduced quantization effects while keeping the horizontal resolution unchanged.

    摘要翻译: 视频处理设备包括接收输入视频流的接口和提供编码视频流的接口。 输入视频流包括一系列帧。 每个帧包括两个字段。 视频处理设备还包括编码器以对输入视频流进行编码以生成编码视频流。 编码器将响应于可变量化参数在第一编码模式和第二编码模式之间动态切换。 在第一编码模式中,编码器将对序列的相应帧的两个场或完整帧进行编码。 在第二编码模式中,编码器仅对序列的相应帧的两个场的一个字段进行编码。 这种方法可以在保持水平分辨率不变的情况下实现具有降低的量化效应的指定的低比特率。

    SCENE CHANGE DETECTION USING SUM OF VARIANCE AND ESTIMATED PICTURE ENCODING COST
    10.
    发明申请
    SCENE CHANGE DETECTION USING SUM OF VARIANCE AND ESTIMATED PICTURE ENCODING COST 有权
    使用变异和估计图像编码成本的场景变化检测

    公开(公告)号:US20140376624A1

    公开(公告)日:2014-12-25

    申请号:US13926185

    申请日:2013-06-25

    申请人: Ying Li Xu Gang Zhao

    发明人: Ying Li Xu Gang Zhao

    IPC分类号: H04N19/87 H04N19/503

    摘要: A video processing device includes a complexity estimation module to determine a first sum of variances metric and a first estimated picture encoding cost metric for a first picture of a video stream. The video processing device further includes a scene analysis module to determine a first threshold based on a first statistical feature for sum of variance metrics of a set of one or more pictures preceding the first picture in the video stream and a second threshold based on a second statistical feature for estimated picture encoding cost metrics of the set of one or more pictures. The scene analysis module further is to identify a scene change as occurring at the first picture based on the first sum of variances metric, the first estimated picture encoding cost metric, the first threshold, and the second threshold.

    摘要翻译: 视频处理设备包括复杂度估计模块,用于确定视差流的第一图像的方差度量和第一估计图像编码成本度量的第一和。 视频处理装置还包括场景分析模块,用于基于第一统计特征来确定第一阈值,所述第一统计特征用于在视频流中的第一图像之前的一个或多个图像的集合的方差度量和基于第二阈值的第二阈值 用于一个或多个图像的集合的估计图像编码成本度量的统计特征。 场景分析模块还用于基于方差度量,第一估计图像编码成本度量,第一阈值和第二阈值的第一和来识别出现在第一图像处的场景变化。