摘要:
Disclosed is an apparatus (500) for generating a second compressed video stream (550) having a second resolution, from a first compressed video stream (540) having a first resolution. The apparatus comprises means (513) for extracting transform domain luma data and spatial domain chroma data from the first compressed video stream (540), means (514-516) for applying a transform domain operation to the luma data to form reconstructed transform domain luma data, means (518, 519, 560) for applying a spatial domain operation to the chroma data to form reconstructed spatial domain chroma data, and means for scaling the reconstructed transform domain luma data and reconstructed spatial domain chroma data to generate the second compressed video stream.
摘要:
Disclosed is an apparatus (500) for generating a second compressed video stream (550) having a second resolution, from a first compressed video stream (540) having a first resolution. The apparatus comprises means (513) for extracting transform domain luma data and spatial domain chroma data from the first compressed video stream (540), means (514-516) for applying a transform domain operation to the luma data to form reconstructed transform domain luma data, means (518, 519, 560) for applying a spatial domain operation to the chroma data to form reconstructed spatial domain chroma data, and means for scaling the reconstructed transform domain luma data and reconstructed spatial domain chroma data to generate the second compressed video stream.
摘要:
Disclosed is a method (800) and an apparatus (250) for generating a scaled motion vector for a particular output coding unit, the method comprising determining (802) statistics of motion data from an area-of-interest selecting (804) from a pre-defined set (805-807) of re-sampling filters a re-sampling filter dependent upon said determined statistics for the particular output coding unit, and applying the selected re-sampling filter to motion vectors from said area-of-interest to generate the scaled motion vector.
摘要:
Disclosed is a method (800) and an apparatus (250) for generating a scaled motion vector for a particular output coding unit, the method comprising determining (802) statistics of motion data from an area-of-interest selecting (804) from a pre-defined set (805-807) of re-sampling filters a re-sampling filter dependent upon said determined statistics for the particular output coding unit, and applying the selected re-sampling filter to motion vectors from said area-of-interest to generate the scaled motion vector.
摘要:
Disclosed is a method of image coding for joint decoding of images from different viewpoints using distributed coding techniques. The method receives a first set of features (205) and error correction bits (203) corresponding to a first image (201) obtained at a first viewpoint (122) and a second set of features (425) from a second image (254, 415) corresponding to a second viewpoint (124). An approximation (437) of said first image (201) at said first viewpoint (122) is determined (432, 434, 436) an based on the first and second sets of features (205, 425) and the second image at the second viewpoint. A reliability measure (445) of the approximation of the first image is then determined (450) by joint decoding (438) the approximation (437) using the error correction bits (203). The approximation of the first image is then refined iteratively (460, 438) based on the reliability measure (445) and image information (448) derived from the joint decoding.
摘要:
A system and method of generating a common ground plane from a plurality of image sequences includes detecting at least three observations for each image sequence, generating a plurality of rectified ground planes for the plurality of image sequences, determining a geometric property of the plurality of observations in the plurality of image sequences, determining a relative scaling factor of each of the plurality of rectified ground planes, and generating the common ground plane from the plurality of image sequences based on the rectified ground planes and the determined relative scaling factors.
摘要:
Methods, apparatuses (100, 400, 1000), and computer program products for generating an enhanced digital image (490, 495, 1022) comprising a plurality of pixels are disclosed. Using a first digital image (420, 1020) captured from a first camera (124) and parity bits (410, 415, 1010) generated from a second digital image captured by a second camera (122, 126), a third digital image (445, 447, 1045) is constructed. The second camera (122, 126) captures the second image at a resolution different to the resolution of the first camera (124) capturing the first image (420, 1020). A disparity map (455, 457, 1055) between the first image (420, 1020) and the third image (445, 447, 1045) is determined (450, 452, 1050). One of the first image (420, 1020) and the third image (445, 447, 1045) is enhanced (470, 472, 1070) dependent upon the determined disparity map (455, 457, 1055) to generate the enhanced digital image (490, 495, 1022).
摘要:
Methods, apparatuses (100, 400, 1000), and computer program products for generating an enhanced digital image (490, 495, 1022) comprising a plurality of pixels are disclosed. Using a first digital image (420, 1020) captured from a first camera (124) and parity bits (410, 415, 1010) generated from a second digital image captured by a second camera (122, 126), a third digital image (445, 447, 1045) is constructed. The second camera (122, 126) captures the second image at a resolution different to the resolution of the first camera (124) capturing the first image (420, 1020). A disparity map (455, 457, 1055) between the first image (420, 1020) and the third image (445, 447, 1045) is determined (450, 452, 1050). One of the first image (420, 1020) and the third image (445, 447, 1045) is enhanced (470, 472, 1070) dependent upon the determined disparity map (455, 457, 1055) to generate the enhanced digital image (490, 495, 1022).
摘要:
A method of decoding a frame (1110) of video data is disclosed. The data is encoded in a format having a first field (1031) comprising a plurality of encoded key frames and a second field (1032A; 1032B) comprising data facilitating error correction of an approximation of the frame to be decoded using the first field. The method decodes (1140; 1240) at least two key frames from the first field and then determines the approximation (1157; 1257) of the frame from the decoded key frames. The method then determines (1125; 1225) a reliability (1165; 1265) for each of at least parts of the approximation, and applies (1080; 1280) the data (1032A; 1032B) facilitating error correction to the approximation (1157; 1257) of the frame, based on the determined reliabilities for the parts to thereby form the decoded frame (1135; 1235=1110).
摘要:
An apparatus for use in video mixing of multiple video sources compressed in one or more video codecs includes a bitstream unpacker configured to receive and unpack each of the multiple video sources to provide intermediate video parameters including transform-domain coefficients, frame header information, macroblock header information, and motion vector data. The apparatus also includes an intermediate coefficient buffer coupled to the bitstream unpacker and a decision module coupled to the bitstream unpacker. The apparatus further includes a transform-domain coefficient downscaling module coupled to the intermediate coefficient buffer, a motion vector refinement module coupled to the bitstream unpacker, and a bitstream packer coupled to the decision module, the transform-domain coefficient downscaling module, and the motion vector refinement module. The bitstream packer is configured to output multiple video output streams in an output frame and the multiple output streams are compressed using the one or more video codecs.