摘要:
A system and method recognizes and tracks human motion from different motion classes. In a learning stage, a discriminative model is learned to project motion data from a high dimensional space to a low dimensional space while enforcing discriminance between motions of different motion classes in the low dimensional space. Additionally, low dimensional data may be clustered into motion segments and motion dynamics learned for each motion segment. In a tracking stage, a representation of human motion is received comprising at least one class of motion. The tracker recognizes and tracks the motion based on the learned discriminative model and the learned dynamics.
摘要:
Taking a set of unlabeled images of a collection of objects acquired under different imaging conditions, and decomposing the set into disjoint subsets corresponding to individual objects requires clustering. Appearance-based methods for clustering a set of images of 3-D objects acquired under varying illumination conditions can be based on the concept of illumination cones. A clustering problem is equivalent to finding convex polyhedral cones in the high-dimensional image space. To efficiently determine the conic structures hidden in the image data, the concept of conic affinity can be used which measures the likelihood of a pair of images belonging to the same underlying polyhedral cone. Other algorithms can be based on affinity measure based on image gradient comparisons operating directly on the image gradients by comparing the magnitudes and orientations of the image gradient.
摘要:
A system and a method are disclosed for an adaptive discriminative generative model with a probabilistic interpretation. As applied to visual tracking, the discriminative generative model separates the target object from the background more accurately and efficiently than conventional methods. A computationally efficient algorithm constantly updates the discriminative model over time. The discriminative generative model adapts to accommodate dynamic appearance variations of the target and background. Experiments show that the discriminative generative model effectively tracks target objects undergoing large pose and lighting changes.
摘要:
The advantage of the present invention is to appropriately detect the object. The object detection apparatus in the present invention has a plurality of cameras to determine the distance to the objects, a distance determination unit to determine the distance therein, a histogram generation unit to specify the frequency of the pixels against the distances to the pixels, an object distance determination unit that determines the most likely distance, a probability mapping unit that provides the probabilities of the pixels based on the difference of the distance, a kernel detection unit that determines a kernel region as a group of the pixels, a periphery detection unit that determines a peripheral region as a group of the pixels, selected from the pixels being close to the kernel region and an object specifying unit that specifies the object region where the object is present with a predetermined probability.
摘要:
A system and a method are disclosed for clustering images of objects seen from different viewpoints. That is, given an unlabelled set of images of n objects, an unsupervised algorithm groups the images into N disjoint subsets such that each subset only contains images of a single object. The clustering method makes use of a broad geometric framework that exploits the interplay between the geometry of appearance manifolds and the symmetry of the 2D affine group.
摘要:
Visual tracking over a sequence of images is formulated by defining an object class and one or more background classes. The most discriminant features available in the images are then used to select a portion of each image as belonging to the object class. Fisher's linear discriminant method is used to project high-dimensional image data onto a lower-dimensional space, e.g., a line, and perform classification in the lower-dimensional space. The projection function is incrementally updated.
摘要:
A method for representing images for pattern classification extends the conventional Isomap method with Fisher Linear Discriminant (FLD) or Kernel Fisher Linear Discriminant (KFLD) for classification. The extended Isomap method estimates the geodesic distance of data points corresponding to images for pattern classification, and uses pairwise geodesic distances as feature vectors. The method applies FLD to the feature vectors to find an optimal projection direction to maximize the distances between cluster centers of the feature vectors. The method may apply KFLD to the feature vectors instead of FLD.
摘要:
The advantage of the present invention is to appropriately detect the object. The object detection apparatus in the present invention has a plurality of cameras to determine the distance to the objects, a distance determination unit to determine the distance therein, a histogram generation unit to specify the frequency of the pixels against the distances to the pixels, an object distance determination unit that determines the most likely distance, a probability mapping unit that provides the probabilities of the pixels based on the difference of the distance, a kernel detection unit that determines a kernel region as a group of the pixels, a periphery detection unit that determines a peripheral region as a group of the pixels, selected from the pixels being close to the kernel region and an object specifying unit that specifies the object region where the object is present with a predetermined probability.
摘要:
A face recognition system and method project an input face image and a set of reference face images from an input space to a high dimensional feature space in order to obtain more representative features of the face images. The Kernel Fisherfaces of the input face image and the reference face images are calculated, and are used to project the input face image and the reference face images to a face image space lower in dimension than the input space and the high dimensional feature space. The input face image and the reference face images are represented as points in the face image space, and the distance between the input face point and each of the reference image points are used to determine whether or not the input face image resembles a particular face image of the reference face images.
摘要:
A system and a method are disclosed for adaptive probabilistic tracking of an object within a motion video. The method utilizes a time-varying Eigenbasis and dynamic, observation and inference models. The Eigenbasis serves as a model of the target object. The dynamic model represents the motion of the object and defines possible locations of the target based upon previous locations. The observation model provides a measure of the distance of an observation of the object relative to the current Eigenbasis. The inference model predicts the most likely location of the object based upon past and present observations. The method is effective with or without training samples. A computer-based system provides a means for implementing the method. The effectiveness of the system and method are demonstrated through simulation.