摘要:
Methods and apparatus for visual saliency estimation for images and video are described. In an embodiment, a process includes decomposing, by a processor, an image into elements, wherein each element includes at least one pixel. The processor then calculates a first image measure indicative of each element's uniqueness in the image on a per element basis, and a second image measure indicative of each element's spatial distribution in the image on a per element basis. A per element saliency measure is provided by combining the first image measure and the second image measure, or by utilizing the first image measure, or by utilizing the second image measure.
摘要:
Methods and apparatus for visual saliency estimation for images and video are described. In an embodiment, a process includes decomposing, by a processor, an image into elements, wherein each element includes at least one pixel. The processor then calculates a first image measure indicative of each element's uniqueness in the image on a per element basis, and a second image measure indicative of each element's spatial distribution in the image on a per element basis. A per element saliency measure is provided by combining the first image measure and the second image measure, or by utilizing the first image measure, or by utilizing the second image measure.
摘要:
Natural looking output images are computed from input images based on given user constraints. Pixels in the output images are assigned a shift such that the respective output pixel value is derived from the value of the input pixel whose location is related to that of the output pixel by the shift, at least one shift being non-zero. The shift is determined by an optimization process adapted to minimize a cost function that includes a data term on the shifts of single pixels and a smoothness term on the shifts of pixel pairs. The output image is computed by applying the optimized shift-map between the input and output pixels. The data term can include shift constraints that limit the location in the output images of selected input pixels, and saliency constraints, indicating a preference that selected pixels in the input images will or will not appear in the output image.
摘要:
Natural looking output images are computed from input images based on given user constraints. Pixels in the output images are assigned a shift such that the respective output pixel value is derived from the value of the input pixel whose location is related to that of the output pixel by the shift, at least one shift being non-zero. The shift is determined by an optimization process adapted to minimize a cost function that includes a data term on the shifts of single pixels and a smoothness term on the shifts of pixel pairs. The output image is computed by applying the optimized shift-map between the input and output pixels. The data term can include shift constraints that limit the location in the output images of selected input pixels, and saliency constraints, indicating a preference that selected pixels in the input images will or will not appear in the output image.
摘要:
A system for generating a stereoscopic panoramic image pair having left and right panoramic mosaic images for use in facilitating stereoscopic viewing of a panoramic scene, the system comprising a panoramic image generator and a strip separation value generation module. The panoramic image generator is configured to mosaic together respective left and right image strips from respective ones of a plurality of images to form the respective left and right panoramic images. The panoramic image generator is configured to use strip separation values for the respective images to determine the separation of the respective left and right image strips in generating the respective left and right panoramic images. The strip separation value generation module is configured to generate, for respective ones of said images, the strip separation values so as to provide at least one selected disparity for objects in the panoramic scene.
摘要:
Computer-implemented method, system, and techniques for summarization, searching, and indexing of video are provided, wherein data related to objects detected in the video in a selected time interval is received and the objects are clustered into clusters such that each cluster includes objects that are similar in respect to a selected feature or a combination of features. A video summary is generated based on the computed clusters.
摘要:
A method for identifying a blur profile of a multi image display with a first image separating mask. The method comprises displaying a calibration pattern through a second image separating mask, allowing an evaluator to provide a visual estimation indicating a blur brought about to the calibration pattern by the second image separating mask, and generating a blur profile of at least the first image separating mask according to the visual estimation. The first and second image separating masks having a substantially similar optical profile.
摘要:
Method and apparatus for generating images of a scene from image data of the scene and displaying the images to provide a sense of depth. In some embodiments of the method and apparatus the generated images are mosaics.
摘要:
A computer-implemented method and system determines camera movement of a new frame relative to a sequence of frames of images containing at least one dynamic object and for which relative camera movement is assumed. From changes in color values of sets of pixels in different frames of the sequence for which respective locations of all pixels in each set are adjusted so as to neutralize the effect of camera movement between the respective frames in the sequence containing the pixels, corresponding color values of the pixels in the new frame are predicted and used to determine camera movement as a relative movement of the new frame and the predicted frame. An embodiment of the invention maintains an aligned space-time volume of frames for which camera movement is neutralized and adds each new frame to the aligned space-time volume after neutralizing camera movement in the new frame.
摘要:
In a system and method for generating a synopsis video from a source video, at least three different source objects are selected according to one or more defined constraints, each source object being a connected subset of image points from at least three different frames of the source video. One or more synopsis objects are sampled from each selected source object by temporal sampling using image points derived from specified time periods. For each synopsis object a respective time for starting its display in the synopsis video is determined, and for each synopsis object and each frame a respective color transformation for displaying the synopsis object may be determined. The synopsis video is displayed by displaying selected synopsis objects at their respective time and color transformation, such that in the synopsis video at least three points that each derive from different respective times in the source video are displayed simultaneously.