摘要:
The handling of occlusions in stereo imaging is disclosed. In one implementation, an association between a discontinuity in one stereo image and an occlusion in a second stereo image is utilized. In such an implementation, the first and second stereo images are segmented. A mapping of a discontinuity within the second stereo image is used to form at least part of a boundary of an occlusion in the first stereo image. The mapped discontinuity is found at a boundary between two segments in the second stereo image, and once mapped, divides a segment in the first stereo image into two patches. An energy calculation is made in an iterative manner, alternating with changes to a solution with the disparities and occlusions of the patches. Upon minimization, disparities and occlusions at the patch and pixel level are available.
摘要:
Systems and methods of segmenting images are disclosed herein. The similarity of images in a set of images is compared. A group of images is selected from the set of images. The images in the group of images are selected based on compared similarities among the images. An informative image is selected from the group of images. User-defined semantic information of the informative image is received. The group of images as a graph is modeled as a graph. Each image in the group of images denotes a node in the graph. Edges of the graph denote a foreground relationship between images or a background relationship between images. One or more images in the group of images are automatically segmented by propagating the semantic information of the informative image to images in the group of images having a corresponding graph node that is related to a graph node corresponding to the informative image. Segmentation results can be refined according to user provided image semantics.
摘要:
A search includes comparing a query image provided by a user to a plurality of stored images of faces stored in a stored image database, and determining a similarity of the query image to the plurality of stored images. One or more resultant images of faces, selected from among the stored images, are displayed to the user based on the determined similarity of the stored images to the query image provided by the user. The resultant images are displayed based at least in part on one or more facial features.
摘要:
A Poisson-quantization noise model for modeling noise in low-light conditions is described. In one aspect, image information is received. A Poisson-quantization noise model is then generated from a Poisson noise model and a quantization noise model. Poisson-quantization noise is then estimated in the image information using the Poisson-quantization noise model.
摘要:
Salience-preserving image fusion is described. In one aspect, multi-channel images are fused into a single image. The fusing operations are based on importance-weighted gradients. The importance weighted gradients are measured using respective salience maps for each channel in the multi-channel images.
摘要:
Systems and methods of segmenting images are disclosed herein. The similarity of images in a set of images is compared. A group of images is selected from the set of images. The images in the group of images are selected based on compared similarities among the images. An informative image is selected from the group of images. User-defined semantic information of the informative image is received. The group of images as a graph is modeled as a graph. Each image in the group of images denotes a node in the graph. Edges of the graph denote a foreground relationship between images or a background relationship between images. One or more images in the group of images are automatically segmented by propagating the semantic information of the informative image to images in the group of images having a corresponding graph node that is related to a graph node corresponding to the informative image. Segmentation results can be refined according to user provided image semantics.
摘要:
Progressive cut interactive object segmentation is described. In one implementation, a system analyzes strokes input by the user during iterative image segmentation in order to model the user's intention for refining segmentation. In the user intention model, the color of each stroke indicates the user's expectation of pixel label change to foreground or background, the location of the stroke indicates the user's region of interest, and the position of the stroke relative to a previous segmentation boundary indicates a segmentation error that the user intends to refine. Overexpansion of pixel label change is controlled by penalizing change outside the user's region of interest while overshrinkage is controlled by modeling the image as an eroded graph. In each iteration, energy consisting of a color term, a contrast term, and a user intention term is minimized to obtain a segmentation map.
摘要:
Systems and methods for video completion by motion field transfer are described. In one aspect, a spatio-temporal target patch of an input video data sequence is filled in or replaced by motion field transfer from a spatio-temporal source patch of the input video data sequence. Color is propagated to corresponding portions of the spatio-temporal target patch by treating the transferred motion information as directed edges These motion field transfer and color propagation operations result in a video completed spatio-temporal target patch. The systems and methods present the video data sequence, which now includes the video completed spatio-temporal target patch, to user for viewing.
摘要:
Some implementations provide techniques and arrangements to address intrapersonal variations encountered during facial recognition. For example, some implementations employ an identity data set having a plurality of images representing different intrapersonal settings. A predictive model may associate one or more input images with one or more images in the identity data set. Some implementations may use an appearance-prediction approach to compare two images by predicting an appearance of at least one of the images under an intrapersonal setting of the other image. Further, some implementations may utilize a likelihood-prediction approach for comparing images that generates a classifier for an input image based on an association of an input image with the identity data set.
摘要:
Some implementations provide techniques and arrangements to address intrapersonal variations encountered during facial recognition. For example, some implementations transform at least a portion of an image from a first intrapersonal condition to a second intrapersonal condition to enable more accurate comparison with another image. Some implementations may determine a pose category of an input image and may modify at least a portion of the input image to a different pose category of another image for comparing the input image with the other image. Further, some implementations provide for compression of data representing at least a portion of the input image to decrease the dimensionality of the data.