Abstract:
A perimeter around a detected object in a frame of image data can be generated in a first coordinate system. The perimeter can be converted from the first coordinate system into a second coordinate system having the same aspect ratio as the first coordinate system. A first metadata entry can include dimensions of image data in the second coordinate system. A second metadata entry can provide a location and dimensions of the converted perimeter in the second coordinate space. Additional metadata can indicate matching objects between frames, position of an object relative to other objects in a frame, a probability that an object is correctly detected, and a total number of objects detected across multiple frames of image data.
Abstract:
A multi-stage coding method includes receiving an input block of data for encoding and one or more previously coded samples associated with the input block. The input block is segmented into at least a first sub-region and a second sub-region. A prediction for the first sub-region is generated based on the one or more previously coded samples. Residual data for the first sub-region is obtained using the prediction for the first sub-region. A reconstruction of the first sub-region is generated using the residual data for the first sub-region and the prediction for the first sub-region. A prediction for the second sub-region is generated using the reconstruction of the first sub-region. Residual data for the second sub-region is obtained using the prediction for the second sub-region. The input block is encoded based in part on the residual data for the first region and the residual data for the second region.
Abstract:
A multi-stage coding method includes receiving an input block of data for encoding and one or more previously coded samples associated with the input block. The input block is segmented into at least a first sub-region and a second sub-region. A prediction for the first sub-region is generated based on the one or more previously coded samples. Residual data for the first sub-region is obtained using the prediction for the first sub-region. A reconstruction of the first sub-region is generated using the residual data for the first sub-region and the prediction for the first sub-region. A prediction for the second sub-region is generated using the reconstruction of the first sub-region. Residual data for the second sub-region is obtained using the prediction for the second sub-region. The input block is encoded based in part on the residual data for the first region and the residual data for the second region.
Abstract:
A video coding system may include an encoder performs motion-compensated prediction on a video signal in a second format converted from an input format of the video signal. The video coding system may also include a decoder to decode portions of the encoded video, and a filtering system that filters portions of the decoded video, for example, by deblocking filtering or SAO filtering, using parameters derived from the video signal in the input format. A prediction system may include another format converter that converts the decoded video to the input format. The prediction system may select parameters of the motion-compensated prediction based at least in part on a comparison of the video signal in the input format to decoded video in the input format.
Abstract:
A method of adaptive chroma downsampling is presented. The method comprises converting a source image to a converted image in an output color format, applying a plurality of downsample filters to the converted image and estimating a distortion for each filter chose the filter that produces the minimum distortion. The distortion estimation includes applying an upsample filter, and a pixel is output based on the chosen filter. Methods for closed loop conversions are also presented.
Abstract:
Image and video processing techniques are disclosed for processing components of a color space individually by determining limits for each component based on the relationship between each component in a color space. These limits may then be used to clip each component such that the component values are within the determined range for that component. In this manner, more efficient processing of images and/or video may be achieved.
Abstract:
A method of adaptive chroma downsampling is presented. The method comprises converting a source image to a converted image in an output color format, applying a plurality of downsample filters to the converted image and estimating a distortion for each filter chose the filter that produces the minimum distortion. The distortion estimation includes applying an upsample filter, and a pixel is output based on the chosen filter. Methods for closed loop conversions are also presented.
Abstract:
A method and apparatus for variable accuracy inter-picture timing specification for digital video encoding is disclosed. Specifically, the present invention discloses a system that allows the relative timing of nearby video pictures to be encoded in a very efficient manner. In one embodiment, the display time difference between a current video picture and a nearby video picture is determined. The display time difference is then encoded into a digital representation of the video picture. In a preferred embodiment, the nearby video picture is the most recently transmitted stored picture. For coding efficiency, the display time difference may be encoded using a variable length coding system or arithmetic coding. In an alternate embodiment, the display time difference is encoded as a power of two to reduce the number of bits transmitted.
Abstract:
A perimeter around a detected object in a frame of image data can be generated in a first coordinate system. The perimeter can be converted from the first coordinate system into a second coordinate system having the same aspect ratio as the first coordinate system. A first metadata entry can include dimensions of image data in the second coordinate system. A second metadata entry can provide a location and dimensions of the converted perimeter in the second coordinate space. Additional metadata can indicate matching objects between frames, position of an object relative to other objects in a frame, a probability that an object is correctly detected, and a total number of objects detected across multiple frames of image data.
Abstract:
A perimeter around a detected object in a frame of image data can be generated in a first coordinate system. The perimeter can be converted from the first coordinate system into a second coordinate system having the same aspect ratio as the first coordinate system. A first metadata entry can include dimensions of image data in the second coordinate system. A second metadata entry can provide a location and dimensions of the converted perimeter in the second coordinate space. Additional metadata can indicate matching objects between frames, position of an object relative to other objects in a frame, a probability that an object is correctly detected, and a total number of objects detected across multiple frames of image data.