摘要:
Provided are a method for motion estimation and an apparatus for processing an image. The method includes extracting a pattern from any one frame, classifying the extracted pattern as any one of predetermined basic patterns, counting basic patterns of regions included in macroblocks according to types of the basic patterns to set representative basic patterns of the macroblocks, calculating at least one of a sum of absolute difference (SAD) term coefficient and a smoothness constraint (SC) term coefficient from a correlation between representative basic patterns of macroblocks included in a macroblock group, and calculating a cost function result using at least one of the calculated SAD term coefficient and SC term coefficient.
摘要:
Multi-view video that is being streamed to a remote device in real time may be encoded. Frames of a real-world scene captured by respective video cameras are received for compression. A virtual viewpoint, positioned relative to the video cameras, is used to determine expected contributions of individual portions of the frames to a synthesized image of the scene from the viewpoint position using the frames. For each frame, compression rates for individual blocks of a frame are computed based on the determined contributions of the individual portions of the frame. The frames are compressed by compressing the blocks of the frames according to their respective determined compression rates. The frames are transmitted in compressed form via a network to a remote device, which is configured to render the scene using the compressed frames.
摘要:
Methods and systems for digitally enhancing the characteristics of an image sequence including video and motion picture images are provided. Highly accurate motion information represented by trajectories are generated through analyzing available image frames in the image sequence. Some embodiments of the present invention relate to generating multiple layer trajectories from an image sequence. Certain aspects may be applicable to the enhancement of three-dimensional (3D) image sequences including 3D video and 3D motion pictures.
摘要:
Objects are tracked within images. According to an example embodiment, video data is processed for tracking one or more objects. A computer circuit executes instructions to process a series of ordered video frames in a set of video data as follows. A region of interest is defined in an initial one of the video frames, and a particle mesh is formed from a set of feature points for an object in the defined region. The particle mesh is propagated to an adjacent video frame using motion vectors of points in the mesh to form a coarse boundary at the adjacent frame. Occlusion and scene boundaries are detected in the adjacent video frame, and the detected boundaries are used to set a boundary and a number of feature points within the boundary in the adjacent video frame to maintain mesh coherence and track the object in the subsequent video frames.
摘要:
A block of a video frame may be predicted and encoded using a local affine motion prediction scheme. At least three points within the video frame are selected, each point being adjacent to a respective corner of the current block to be encoded. For each point, a motion search is performed for a template block centered about the point within a reference frame to find a template reference block. The template reference block has a center point that forms an anchor point pair with the corresponding point adjacent to the current block. Using coordinates of the anchor point pairs, affine coefficients for an affine transformation are generated. Using the affine transformation, a prediction block for the current block is generated and used to encode the current block within a bitstream. Providing the local affine motion prediction scheme can improve the prediction of certain blocks with complex motion, increasing compression.
摘要:
The disclosed subject matter describes a new pattern description that can be part of an SEI message or a parameter set, and can be used to describe a temporal picture coding structure and associated bitrates and frame rates. The knowledge of the coding structure can benefit transraters, bitstream extractors, and digital video recorders. Decoders can utilize the coding structure information for example, to decode pictures in parallel, using multiple threads/cores.
摘要:
Parametric Motion Vector Prediction (PMVP) methodologies and components and systems for performing those methodologies are provided to more effectively and efficiently encode video content that includes complex motion such as zoom or rotation. By substituting the PMVP for a collocated MVP used in HEVC in order to reduce the amount of bit rate increase required when including the PMVP analysis in the bit stream. Further, compression of the motion vectors is provided in a three stage approach based on transformation, quantization and difference coding.
摘要:
A method and apparatus for performing hybrid multihypothesis prediction during video coding of a coding unit includes: processing a plurality of sub-coding units in the coding unit; and performing disparity vector (DV) derivation when the coding unit is processed by a 3D or multi-view coding tool or performing block vector (BV) derivation when the coding unit is processed by intra picture block copy (IntraBC) mode. The step of performing DV or BV derivation includes deriving a plurality of vectors for multihypothesis motion-compensated prediction of a specific sub-coding unit from at least one other sub-coding/coding unit. The one other sub-coding/coding unit is coded before the corresponding DV or BV is derived for multihypothesis motion-compensated prediction of the specific sub-coding unit. A linear combination of a plurality of pixel values derived from the plurality of vectors is used as a predicted pixel value of the specific sub-coding unit.
摘要:
Systems and methods of improving video encoding/decoding efficiency may be provided. A feature-based processing stream is applied to video data having a series of video frames. Computer-vision-based feature and object detection algorithms identify regions of interest throughout the video datacube. The detected features and objects are modeled with a compact set of parameters, and similar feature/object instances are associated across frames. Associated features/objects are formed into tracks, and each track is given a representative, characteristic feature. Similar characteristic features are clustered and then stored in a model library, for reuse in the compression of other videos. A model-based compression framework makes use of the preserved model data by detecting features in a new video to be encoded, relating those features to specific blocks of data, and accessing similar model information from the model library. The formation of model libraries can be specialized to include personal, “smart” model libraries, differential libraries, and predictive libraries. Predictive model libraries can be modified to handle a variety of demand scenarios.
摘要:
A progressive mesh decoding apparatus, a progressive mesh decoding method and a computer readable recording medium using the same are provided. In the progressive mesh decoding method, a first desired resolution mesh is progressively reconstructed from a base mesh through vertex addition. Vertex level information indicating a resolution level of a mesh corresponding to a predetermined process of the first desired resolution mesh reconstructing process is recorded for respective vertices added at the predetermined process. In response to a second desired resolution mesh having a second desired resolution level lower than the first desired resolution level being requested, the second desired resolution mesh is reconstructed by deleting vertices from the first desired resolution mesh, wherein vertices to be deleted in a predetermined process of the second desired resolution mesh reconstructing process are determined based on the vertex level information.