摘要:
One embodiment of the invention relates to a method of merging segments to form supersegments in an image. The image consists of a plurality of segments that are constituent portions of the image. At least one candidate segment(s) and at least one neighboring segment(s) for each candidate segment are identified. An error statistic for each pair, consisting of a candidate segment and a corresponding neighboring segment, is computed. A neighboring segment is determined that results in a smallest error statistic for a given candidate segment. A determination is also made as to whether the smallest error statistic is sufficiently small to merit merging of the corresponding pair of segments. The corresponding pair of segments is merged to create one supersegment. The supersegment is a new segment including all pixels formerly contained in one of the two segments that were merged.
摘要:
One embodiment disclosed relates to the use of object motion estimation to interlace a progressive video sequence. One of a plurality of consecutive frames is segmented and motion vectors for each segment are determined though object motion estimation. Interpolated motion vectors are used to construct at least one intermediate frame, and interlaced fields are extracted from the new sequence of frames that includes intermediate frames. An interlaced sequence with smooth, incremental motion is thus constructed from a progressive video sequence.
摘要:
A method and apparatus for temporally filtering a video sequence using motion compensation in which motion information captures the motion of objects is disclosed. Pixels from a current frame are aligned with matching pixels from previous and/or future frames according to the motion of the surrounding object of arbitrary shape. A weighted average of the corresponding pixels is taken for each pixel in the current frame to produce a filtered version of the current frame. The weights may be adjusted to compensate for similarities between the frames and for blur transitions near object boundaries. A lighting offset may also be used to prevent erroneous lighting shifts during filtering.
摘要:
One embodiment relates to a computer-implemented method of image segmentation using automated saddle point detection. An edge map is created by edge detection, and a distance map is generated based on the edge map. Saddle points are detected using the distance map. Connector pixels are determined using the saddle points, and connector pixels forming valid connecting paths are marked as edge pixels. Finally, flood filling is performed within edges to designate image segments. Other features, aspects and embodiments are also disclosed.
摘要:
A process and apparatus for identifying abrupt cuts or scene changes in any ordered sequence of images. In one specific embodiment, two or more consecutive images from a sequence are introduced to a segmenter as digital frames. The segmenter independently divides each of these frames into pixel regions or segments according to some common characteristic so that every pixel belongs to exactly one segment. A segment analysis unit then performs some statistical analysis on the segment data for each of the frames and generates composite statistics for each frame. A frame comparison unit then examines these composite statistics to determine whether these frames belong to a consistent scene of images. If the composite statistics for these frames differ sufficiently, the comparison unit declares the latter frame in the sequence to belong to a new scene. This information may then be transmitted back to the data source for the purpose of marking the scene change or for any other purpose.
摘要:
In a method and apparatus for predicting and coding motion vectors in a video compression scheme, an ordered list of segments for a reference frame is used to create a hierarchy of segments with a plurality of levels. Motion vectors for segments in the top level are entropy coded, and these vectors are used to predict vectors for segments at the next level. Residual vectors are entropy coded to correct these predictions, and the process of prediction from above and coding residuals continues recursively down through the hierarchy of segment levels. Information about the previous motion of segments may be exploited in the prediction process. In a complementary method and apparatus for decoding motion vectors, the same segment hierarchy is used to predict motion vectors by the same method used during encoding, and these predicted vectors are added to residual motion vectors to reconstruct the actual motion vectors.
摘要:
One embodiment relates to a computer-implemented method for the automated extraction of objects from a video stream. The method includes an automated procedure for creating a temporal graph, and an automated procedure for cutting the graph into graph partitions. The method further includes an automated procedure for mapping the graph partitions to pixels in frames of the video stream. Other features, aspects and embodiments are also disclosed.
摘要:
An automated method for classifying an object in a sequence of video frames. The object is tracked in multiple frames of the sequence of video frame, and feature descriptors are determined for the object for each of the multiple frames. Multiple classification scores are computed by matching said feature descriptors for the object for each of the multiple frames with feature descriptors for a candidate class in a classification database. Said multiple classification scores are aggregated to generate an estimated probability that the object is a member of the candidate class. Other embodiments, aspects and features are also disclosed.
摘要:
One embodiment disclosed relates to the use of object motion estimation to interlace a progressive video sequence. One of a plurality of consecutive frames is segmented and motion vectors for each segment are determined though object motion estimation. Interpolated motion vectors are used to construct at least one intermediate frame, and interlaced fields are extracted from the new sequence of frames that includes intermediate frames. An interlaced sequence with smooth, incremental motion is thus constructed from a progressive video sequence.
摘要:
One embodiment of the invention pertains to a method of determining a measure of image complexity. An image is subdivided the image into a plurality of small image regions. Multiple statistical tests are performed to determine the similarity of a pair of adjacent image regions. If said pair passes the multiple statistical tests, then the pair of adjacent image regions are grouped together into one new region. The resulting merged regions may be weighted according to geometry and/or color variance, and the weights may be summed to produce an image complexity measure.