摘要:
A method and apparatus for accurately computing parallax information as captured by imagery of a scene. The method computes the parallax information of each point in an image by computing the parallax within windows that are offset with respect to the point for which the parallax is being computed. Additionally, parallax computations are performed over multiple frames of imagery to ensure accuracy of the parallax computation and to facilitate correction of occluded imagery.
摘要:
A method and apparatus for accurately computing parallax information as captured by imagery of a scene. The method computes the parallax information of each point in an image by computing the parallax within windows that are offset with respect to the point for which the parallax is being computed. Additionally, parallax computations are performed over multiple frames of imagery to ensure accuracy of the parallax computation and to facilitate correction of occluded imagery.
摘要:
A method and apparatus for accurately computing parallax information as captured by imagery of a scene. The method computes the parallax information of each point in an image by computing the parallax within windows that are offset with respect to the point for which the parallax is being computed. Additionally, parallax computations are performed over multiple frames of imagery to ensure accuracy of the parallax computation and to facilitate correction of occluded imagery.
摘要:
An embodiment of the invention is a system and process for true multi-image alignment that does not rely on the measurements of a reference image being distortion free. For instance, lens distortion is a common imaging phenomenon. When lens distortion is present, none of the images can be assumed to be ideal. In an embodiment of the invention, all the images are modeled as intensity measurements represented in their respective coordinate systems, each of which is related to a reference coordinate system through an interior camera transformation and an exterior view transformation. Motion parameters determined in accordance with an embodiment of the invention dictate the position of the input frames within the reference frame. A reference coordinate system is used, but not a reference image. Motion parameters are computed to warp all input images to a virtual image mosaic in the reference coordinate system of the reference frame. Each pixel in the virtual image mosaic may be predicted by intensities at corresponding pixel positions from more than one image. The error measure, which is the sum of the variances of predicted pixel intensities at each pixel location summed over the virtual image mosaic, is minimized. The embodiment of the invention advantageously maximally uses information present in all images.
摘要:
A method of constructing an image mosaic comprising the steps of selecting source images, aligning the source images, selecting source segments, enhancing the images, and merging the images to form the image mosaic is disclosed. An apparatus for constructing an image mosaic comprising means for selecting source images, means for aligning the source images, means for selecting source image segments, means for enhancing the images, and means for merging the images to form the image mosaic is also disclosed. The process may be performed automatically by the system or may be guided interactively by a human operator. Applications include the construction of photographic quality prints form video and digital camera images.
摘要:
A method and concomitant apparatus for comprehensively representing video information in a manner facilitating indexing of the video information. Specifically, a method according to the inveniton comprises the steps of dividing a continuous video stream into a plurality of video scenes; and at least one of the steps of dividing, using intra-scene motion analysis, at least one of the plurality of scenes into one or more layers; representing, as a mosaic, at least one of the pluraliy of scenes; computing, for at least one layer or scene, one or more content-related appearance attributes; and storing, in a database, the content-related appearance attributes or said mosaic representations.
摘要:
A method for detecting a moving target is disclosed that receives a plurality of images from at least one camera; receives a measurement of scale from one of a measurement device and a second camera; calculates the pose of the at least one camera over time based on the plurality of images and the measurement of scale; selects a reference image and an inspection image from the plurality of images of the at least one camera; and detects a moving target from the reference image and the inspection image based on the orientation of corresponding portions in the reference image and the inspection image relative to a location of an epipolar direction common to the reference image and the inspection image; and displays any detected moving target on a display. The measurement of scale can derived from a second camera or, for example, a wheel odometer. The method can also detect moving targets by combining the above epipolar method with a method based on changes in depth between the inspection image and the reference image and based on changes in flow between the inspection image and the reference image.
摘要:
The present invention provides an improved system and method for estimating range of the objects in the images from various distances. The method comprises receiving a set of images of the scene having multiple objects from at least one camera in motion. Due to the motion of the camera, each of the images are obtained at different camera locations Then an object visible in multiple images is selected. Data related to approximate camera positions and orientations and the images of the visible object are used to estimate the location of the object relative to a reference coordinate system. Based on the computed data, a projected location of the visible object is computed and the orientation angle of the camera for each image is refined. Additionally, pairs of cameras with various locations can then be chosen to obtain dense stereo for regions of the image at various ranges. The process is further structured so that as new images arrive, they are incorporated into the pose adjustment so that the dense stereo results can. be updated.
摘要:
The present invention is embodied in a method for representing and analyzing spatiotemporal data in order to make qualitative yet semantically meaningful distinctions among various regions of the data at an early processing stage. In one embodiment of the invention, successive frames of image data are analyzed to classify spatiotemporal regions as being stationary, exhibiting coherent motion, exhibiting incoherent motion, exhibiting scintillation and so lacking in structure as to not support further inference. The exemplary method includes filtering the image data in a spatiotemporal plane to identify regions that exhibit various spatiotemporal characteristics. The output data provided by these filters is then used to classify the data.
摘要:
The present invention provides an improved method for estimating range of objects in images from various distances comprising receiving a set of images of the scene having multiple objects from at least one camera in motion. Due to the motion of the camera, each of the images are obtained at different camera locations. Then an object visible in multiple images is selected. Data related to approximate camera positions and orientations and the images of the visible object are used to estimate the location of the object relative to a reference coordinate system. Based on the computed data, a projected location of the visible object is computed and the orientation angle of the camera for each image is refined. Additionally, pairs of cameras with various locations can obtain dense stereo for regions of the image at various ranges.