摘要:
A robot includes a face extracting section for extracting features of a face included in an image captured by a CCD camera, and a face recognition section for recognizing the face based on a result of face extraction by the face extracting section. The face extracting section is implemented by Gabor filters that filter images using a plurality of filters that have orientation selectivity and that are associated with different frequency components. The face recognition section is implemented by a support vector machine that maps the result of face recognition to a non-linear space and that obtains a hyperplane that separates in that space to discriminate a face from a non-face. The robot is allowed to recognize a face of a user within a predetermined time under a dynamically changing environment.
摘要:
A robot apparatus (1) includes leg blocks (3A to 3D), head block (4), etc. as a moving part (16), a motion controller (102), learning unit (103), prediction unit (104) and a drive unit (105). When the moving part (106), any of the blocks, is operated from outside, the learning unit (103) learns a time-series signal generated due to the external operation. The motion controller (102) and drive unit (105) control together the moving part (106) based on a signal generated at the moving part (106) due to an external force applied to the robot apparatus (1) and a signal having already been learned by the learning unit (103) to make an action taught by the user. The prediction unit (105) predicts whether the moving part (106) makes the taught action according to the initial signal generated at the moving part (106) due to the applied external force. Thus, the robot apparatus (1) can learn an action taught by the user and determine an external force-caused signal to make the taught action.
摘要:
Provided is a learning device including: an acquisition section that acquires a plurality of image pairs in which the same subjects appear and a plurality of image pairs in which different subjects appear; a setting section that sets feature points on one image and the other image of each image pair; a selection section that selects a plurality of prescribed feature points, which are set at the same positions of the one image and the other image, so as to thereby select a feature extraction filter for each prescribed feature point; an extraction section that extracts the features of the prescribed feature points of each of the one image and the other image by using the plurality of feature extraction filters; a calculation section that calculates a correlation between the features; and a learning section that learns a same-subject classifier on the basis of the correlation and label information.
摘要:
A gesture input device includes an input unit to which image information representing an action is input. The gesture input device also includes a detection unit that detects the action based on a shape of the input image information. The gesture input device further includes a prediction unit that predicts one or more gestures based on a detection result of the action. In addition, the gesture input device includes a notification unit that notifies an action to be performed next to input the predicted one or more gestures.
摘要:
An information processing apparatus, which creates a tree structure used by a recognition apparatus which recognizes specific information using the tree structure, including a memory unit which stores data including the information to be recognized and data not including the information so as to correspond to a label showing whether or not the data includes the information, a recognition device which recognizes the information and outputs a high score value when the data including the information is input, and a grouping unit which performs grouping of the recognition devices using a score distribution obtained when the data is input into the recognition devices.
摘要:
Systems and methods for implementing a superpixel boosted top-down image recognition framework are provided. The framework utilizes superpixels comprising contiguous pixel regions sharing similar characteristics. Feature extraction methods described herein provide non-redundant image feature vectors for classification model building. The provided framework differentiates a digitized image into a plurality of superpixels. The digitized image is characterized through image feature extraction methods based on the plurality of superpixels. Image classification models are generated from the extracted image features and ground truth labels and may then be used to classify other digitized images.
摘要:
An image processing apparatus includes a detector, a setting unit, and an image generator. The detector detects a target object image region from a first image. When one or more predetermined parameters are applicable to a target object within the region detected by the detector, the setting unit sets the relevant target object image region as a first region. The image generator then generates a second image by applying predetermined processing to either the image portion within the first region, or to the image portions in a second region containing image portions within the first image that are not contained in the first region.
摘要:
A learning apparatus includes an image generator, a feature point extractor, a feature value calculator, and a classifier generator. The image generator generates, from an input image, images having differing scale coefficients. The feature point extractor extracts feature points from each image generated by the image generator. The feature value calculator calculates feature values for the feature points by filtering the feature points using a predetermined filter. The classifier generator generates one or more classifiers for detecting a predetermined target object from an image by means of statistical learning using the feature values.
摘要:
Systems and methods for implementing a multi-label image recognition framework for classifying digital images are provided. The provided multi-label image recognition framework utilizes an iterative, multiple analysis path approach to model training and image classification tasks. A first iteration of the multi-label image recognition framework generates confidence maps for each label, which are shared by the multiple analysis paths to update the confidence maps in subsequent iterations. The provided multi-label image recognition framework permits model training and image classification tasks to be performed more accurately than conventional single-label image recognition frameworks.
摘要:
An image processing apparatus includes: an image feature outputting unit that outputs each of image features in correspondence with a time of the frame; a foreground estimating unit that estimates a foreground image at a time s by executing a view transform as a geometric transform on a foreground view model and outputs an estimated foreground view; a background estimating unit that estimates a background image at the time s by executing a view transform as a geometric transform on a background view model and outputs an estimated background view; a synthesized view generating unit that generates a synthesized view by synthesizing the estimated foreground and background views; a foreground learning unit that learns the foreground view model based on an evaluation value; and a background learning unit that learns the background view model based on the evaluation value by updating the parameter of the foreground view model.