摘要:
A metric representing the sum of variances for pixel blocks of a region of an image are used to identify the presence a video feature of the image, and a transcoding is performed responsive to identifying the presence of the video feature. The identified video feature can include, but is not limited to, a scene change, the presence of a black border region or a caption region, or the complexity of the image. The transcoding operation can include, but is not limited to, coding the image as an Intra-frame, omitting the content corresponding to the black border region or the caption region from the transcoded image or allocating a relatively lower bit budget for the black border region or a relatively higher bit budget to the caption region during transcoding of the image, or setting the bit budget for rate control during transcoding.
摘要:
A metric representing the sum of variances for pixel blocks of a region of an image are used to identify the presence a video feature of the image, and a transcoding is performed responsive to identifying the presence of the video feature. The identified video feature can include, but is not limited to, a scene change, the presence of a black border region or a caption region, or the complexity of the image. The transcoding operation can include, but is not limited to, coding the image as an Intra-frame, omitting the content corresponding to the black border region or the caption region from the transcoded image or allocating a relatively lower bit budget for the black border region or a relatively higher bit budget to the caption region during transcoding of the image, or setting the bit budget for rate control during transcoding.
摘要:
A video processing device includes an interface to receive an input video stream and an interface to provide an encoded video stream. The input video stream includes a sequence of frames. Each frame comprises two fields. The video processing device further includes an encoder to encode the input video stream to generate the encoded video stream. The encoder is to dynamically switch between a first encoding mode and a second encoding mode responsive to a variable quantization parameter. In the first encoding mode the encoder is to encode both fields or the complete frame of a corresponding frame of the sequence. In the second encoding mode the encoder is to encode only one field of the two fields of a corresponding frame of the sequence. This approach can achieve a specified low bit rate with reduced quantization effects while keeping the horizontal resolution unchanged.
摘要:
A scaled motion search section can be used in a video processing device that processes a video input signal that includes a plurality of pictures. The scaled motion search section includes a downscaling module that downscales the plurality of pictures to generate a plurality of downscaled pictures, wherein the downscaling module includes a horizontal downscaling filter and a vertical downscaling filter, and wherein the vertical downscaling filter generates downscaled pixels for a macroblock pair using only pixels from the macroblock pair. A transfer function that models the scaled motion vectors is determined and used to identify a final set of motion vector candidates used in a larger scale motion search.
摘要:
A video processing device includes a rate control module to determine more accurate initial quantization parameters at each scene switching point and to adjust the QP parameters in response to scene changes using a sum of variances metric and an estimated picture encoding cost metric from a coding complex estimation block. To determine a first quantization parameter set, a sum of variances metric and an estimated picture encoding cost metric for an initial set pictures of a video stream are used. A bit allocation module is to set a target bit allocation for infra-encoded pictures as substantially proportional to the sum of variances metric and substantially inversely proportional to the estimated picture encoding cost metric, and set a target bit allocation for forward predictive and bi-predictive pictures as substantially proportional to the estimated picture encoding cost metric and substantially inversely proportional to the sum of variances metric.
摘要:
A video processing device includes a rate control module to determine more accurate initial quantization parameters at each scene switching point and to adjust the QP parameters in response to scene changes using a sum of variances metric and an estimated picture encoding cost metric from a coding complex estimation block. To determine a first quantization parameter set, a sum of variances metric and an estimated picture encoding cost metric for an initial set pictures of a video stream are used. A bit allocation module is to set a target bit allocation for infra-encoded pictures as substantially proportional to the sum of variances metric and substantially inversely proportional to the estimated picture encoding cost metric, and set a target bit allocation for forward predictive and bi-predictive pictures as substantially proportional to the estimated picture encoding cost metric and substantially inversely proportional to the sum of variances metric.
摘要:
A video processing device includes a complexity estimation module to determine a first sum of variances metric and a first estimated picture encoding cost metric for a first picture of a video stream. The video processing device further includes a scene analysis module to determine a first threshold based on a first statistical feature for sum of variance metrics of a set of one or more pictures preceding the first picture in the video stream and a second threshold based on a second statistical feature for estimated picture encoding cost metrics of the set of one or more pictures. The scene analysis module further is to identify a scene change as occurring at the first picture based on the first sum of variances metric, the first estimated picture encoding cost metric, the first threshold, and the second threshold.
摘要:
A scaled motion search section can be used in a video processing device that processes a video input signal that includes a plurality of pictures. The scaled motion search section includes a downscaling module that downscales the plurality of pictures to generate a plurality of downscaled pictures, wherein the downscaling module includes a horizontal downscaling filter and a vertical downscaling filter, and wherein the vertical downscaling filter generates downscaled pixels for a macroblock pair using only pixels from the macroblock pair. A transfer function that models the scaled motion vectors is determined and used to identify a final set of motion vector candidates used in a larger scale motion search.
摘要:
A video processing device includes an interface to receive an input video stream and an interface to provide an encoded video stream. The input video stream includes a sequence of frames. Each frame comprises two fields. The video processing device further includes an encoder to encode the input video stream to generate the encoded video stream. The encoder is to dynamically switch between a first encoding mode and a second encoding mode responsive to a variable quantization parameter. In the first encoding mode the encoder is to encode both fields or the complete frame of a corresponding frame of the sequence. In the second encoding mode the encoder is to encode only one field of the two fields of a corresponding frame of the sequence. This approach can achieve a specified low bit rate with reduced quantization effects while keeping the horizontal resolution unchanged.
摘要:
A video processing device includes a complexity estimation module to determine a first sum of variances metric and a first estimated picture encoding cost metric for a first picture of a video stream. The video processing device further includes a scene analysis module to determine a first threshold based on a first statistical feature for sum of variance metrics of a set of one or more pictures preceding the first picture in the video stream and a second threshold based on a second statistical feature for estimated picture encoding cost metrics of the set of one or more pictures. The scene analysis module further is to identify a scene change as occurring at the first picture based on the first sum of variances metric, the first estimated picture encoding cost metric, the first threshold, and the second threshold.