摘要:
An image including a face is input (S201), a plurality of local features are detected from the input image, a region of a face in the image is specified using the plurality of detected local features (S202), and an expression of the face is determined on the basis of differences between the detection results of the local features in the region of the face and detection results which are calculated in advance as references for respective local features in the region of the face (S204).
摘要:
A plurality of pieces of learning data, each associated with a class to which the piece of the learning data belong, are input. In each piece of the learning data, a statistical amount of attribute values of elements in each of specific k parts, k being equal to or larger than 1, is calculated. Each piece of the learning data is mapped in a k-dimensional feature space as a vector having the calculated k statistics amounts as elements. Based on each piece of the mapped learning data and the classes to which the pieces of learning data belong, parameters for classifying input data into one of the plurality of classes are learned in the k-dimensional feature space. By using the parameters, pattern classification can be performed with high speed and high accuracy.
摘要:
An image sensing apparatus according to an embodiment of the present invention includes an image input unit (100) for inputting an image, a detection unit (200) for detecting a state of movement of the image input unit (100) in an image input operation, a storage unit (310, 9000) for storing a plurality of images input by the image input unit (100) and movement information corresponding to the state of movement detected by the detection unit (200), and an image generating unit (320) for generating an image from an arbitrary viewpoint position on the basis of the plurality of images and the movement information stored in the storage unit.
摘要:
The present invention refers to an information processing apparatus comprising: obtaining means (100) for obtaining an image of an object captured by image sensing means; face region detection means (110) for detecting a face region of the object from the image; eye region detection means (110) for detecting an eye region of the object; generation means (120) for generating a high-resolution image and low-resolution image of the face region detected by the face region detection means; first extraction means (130) for extracting a first feature amount indicating a direction of a face existing in the face region from the low-resolution image; second extraction means (140) for extracting a second feature amount indicating a direction of an eye existing in the eye region from the high-resolution image; and estimation means (150) for estimating a gaze direction of the object from the first feature amount and the second feature amount.
摘要:
An image-processing apparatus for executing accurate facial expression recognition even for a subject hard to recognize a facial expression is provided. A person's face region is extracted from an image input from an image input unit. A predetermined partial region that changes between when the facial expression is in the first and second states is extracted from the extracted face region. A facial expression evaluation value is calculated using an evaluation value calculation formula. When the calculated facial expression evaluation value exceeds a threshold value, it is determined that the facial expression is in the second state. If the difference between the maximum value and the minimum value of the calculated facial expression evaluation value within a predetermined time is smaller than a predetermined value, the evaluation value calculation formula or its parameter is changed to increase the difference.
摘要:
There is disclosed a binocular camera which can realize panoramic view and stereoscopic view during image sensing. There is also disclosed a binocular camera which has two image sensing optical systems, a circuit for synthesizing right and left sensed parallax image signals to a panoramic image or a three-dimensional image, and a display for displaying the synthesized image signal.
摘要:
An information processing apparatus includes an image input unit (100) which inputs image data containing a face, a face position detection unit (101) which detects, from the image data, the position of a specific part of the face, and a facial expression recognition unit (102) which detects a feature point of the face from the image data on the basis of the detected position of the specific part and determines facial expression of the face on the basis of the detected feature point. The feature point is detected at a detection accuracy higher than detection of the position of the specific part. Detection of the position of the specific part is robust to a variation in the detection target.
摘要:
An image including a face is input (S201), a plurality of local features are detected from the input image, a region of a face in the image is specified using the plurality of detected local features (S202), and an expression of the face is determined on the basis of differences between the detection results of the local features in the region of the face and detection results which are calculated in advance as references for respective local features in the region of the face (S204).
摘要:
An information processing apparatus includes an image input unit (100) which inputs image data containing a face, a face position detection unit (101) which detects, from the image data, the position of a specific part of the face, and a facial expression recognition unit (102) which detects a feature point of the face from the image data on the basis of the detected position of the specific part and determines facial expression of the face on the basis of the detected feature point. The feature point is detected at a detection accuracy higher than detection of the position of the specific part. Detection of the position of the specific part is robust to a variation in the detection target.