摘要:
A method and system for computing at least one of spatial and temporal relationships between at least first and second sequences of representations having respective first and second temporal progressions and including employing the first and second temporal progressions to obtain at least one of the spatial and temporal relationships.
摘要:
A system and method for increasing space or time resolution of an image sequence by combination of a plurality input sources with different space-time resolutions such that the single output displays increased accuracy and clarity without the calculation of motion vectors. This system of enhancement may be used on any optical recording device, including but not limited to digital video, analog video, still pictures of any format and so forth. The present invention includes support for example for such features as single frame resolution increase, combination of a number of still pictures, the option to calibrate spatial or temporal enhancement or any combination thereof, increased video resolution by using high-resolution still cameras as enhancement additions and may optionally be implemented using a camera synchronization method.
摘要:
A method and system for aligning in at least one of time and space temporally ordered sequences of images comprising receiving a plurality of sequences of images, each sequence including a multiplicity of images, each plurality of sequences defining a space-time volume, and providing an output indication relating at least one point in a space-time volume corresponding to one of the plurality of sequences to at least one point in a space-time volume corresponding to at least another one of the plurality of sequences.
摘要:
A method for measuring bi-directional similarity between a first signal of a first size and a second signal of a second size includes matching at least some patches of the first signal with patches of the second signal for data completeness, matching at least some patches of the second signal with patches of the first signal for data coherence, calculating the bi-directional similarity measure as a function of the matched patches for coherence and the matched patches for completeness and indicating the similarity between the first signal and the second signal. Another method generates a second signal from a first signal where the second signal is different than the first signal by at least one parameter. The method includes attempting to maximize a bi-directional similarity measure between the second signal and the first signal.
摘要:
A method for measuring bi-directional similarity between a first signal of a first size and a second signal of a second size includes matching at least some patches of the first signal with patches of the second signal for data completeness, matching at least some patches of the second signal with patches of the first signal for data coherence, calculating the bi-directional similarity measure as a function of the matched patches for coherence and the matched patches for completeness and indicating the similarity between the first signal and the second signal. Another method generates a second signal from a first signal where the second signal is different than the first signal by at least one parameter. The method includes attempting to maximize a bi-directional similarity measure between the second signal and the first signal.
摘要:
A method and apparatus for detecting moving objects in both two-dimensional and three-dimensional scenes. The method repetitively applies a two-dimensional transformation to a plurality of images representing a scene to identify misaligned regions within the images. Any residual motion represented by the misaligned regions that may be classified as a moving object within the scene is further processed by a three-dimensional technique that removes parallax motion from the residual motion. The result is motion contained in an epipolar flow field which is only due to a moving object within the scene.
摘要:
A system for generating three-dimensional mosaics from a plurality of input images representing an imaged scene. The plurality input images contain at least two images of a single scene, where at least two of the images have overlapping regions. The system combines the images using a parallax-based approach that generates a three-dimensional mosaic comprising an image mosaic representing a panoramic view of the scene and a shape mosaic representing the three dimensional geometry of the scene. Specifically, in one embodiment, the system registers the input images along a parametric surface within the imaged scene and derives translation vectors useful in aligning the images into a two-dimensional image mosaic. Once registered, the system generates a shape mosaic representing objects within the scene.
摘要:
A method includes finding regions of a reference signal which provide at least one of: local evidence scores and a global evidence score. The local evidence scores indicate local similarity of the regions of the reference signal to regions of a query signal and the global evidence score defines the extent of a global similarity of the query signal to the reference signal. A media exploring device is also included which includes an importance encoder and a media explorer. The importance encoder generates importance scores of at least portions of digital media as a function of at least one of local evidence scores and global evidence scores. The media explorer enables exploring through the digital media according to (i) the importance scores, (ii) data associations/links induced by the evidence scores between different portions of the digital media. The device may also include a media player to play the digital media with adaptive speeds as a function of the importance scores. The device may also include a labeling/annotation module which inherits labels/annotations/markings according to the abovementioned data associations.
摘要:
A method implementable on a computing device includes exploiting data redundancy to combine high frequency information from at least two different scales of an input signal to generate a super resolution version of said input signal. An alternative method includes exploiting recurrence of data from an input signal in at least two different scales of at least one reference signal to extract and to combine high frequency information from a plurality of scales of said at least one reference signal to generate a super resolution version of said input signal. An alternative method includes generating a super resolution version of a single input video sequence in at least the temporal dimension by exploiting data recurrence within the input video sequence or with respect to an external database of example video sequences. A signal may be an image, a video sequence, an audio signal, etc.
摘要:
A method includes matching at least portions of first, second signals using local self-similarity descriptors of the signals. The matching includes computing a local self-similarity descriptor for each one of at least a portion of points in the first signal, forming a query ensemble of the descriptors for the first signal and seeking an ensemble of descriptors of the second signal which matches the query ensemble of descriptors. This matching can be used for image categorization, object classification, object recognition, image segmentation, image alignment, video categorization, action recognition, action classification, video segmentation, video alignment, signal alignment, multi-sensor signal alignment, multi-sensor signal matching, optical character recognition, image and video synthesis, correspondence estimation, signal registration and change detection. It may also be used to synthesize a new signal with elements similar to those of a guiding signal synthesized from portions of the reference signal. Apparatus is also included.