摘要:
The position of a body part area for a personal image in an input image is detected. According to the position of the detected body part area, the position of the origin of a coordinate system for an ornament image is defined. Based on the position of the defined origin, an ornament-arranged input image is outputted. When the personal image moves in the input image, the ornament image also moves by following the movement of the personal image. Even when both the personal image and ornament image move, the ornament image can be made not to interfere with the personal image. Therefore, the personal image can be clearly displayed. Moreover, the input image can be made to look more interesting by synchronizing the movement of the ornament image with the movement of the personal image.
摘要:
A face detection device includes a face learning dictionary, which holds learned information for identification between a facial image and a non-facial image. An image input unit inputs a subject image. An edge image extraction unit extracts an edge image from the subject image. A partial image extraction unit, based on the edge image, extracts partial images that are candidates to contain facial images from the subject image. A face/non-face identification unit references the learning dictionary to identify whether or not each extracted partial image contains a facial image. Face detection of high precision, which reflects learned results, is performed.
摘要:
An image processing method for detecting an object from an input image using a template image, including inputting a specified image with respect to both a template image and an input image, calculating an edge normal direction vector of said specified image, generating an evaluation vector from said edge normal direction vector, subjecting the evaluation vector to orthogonal transformation, a step of performing a product sum calculation of corresponding spectral data with respect to each evaluation vector that has been subjected to orthogonal transformation and has been obtained for each of said template image and said input image, and a step of subjecting it to inverse orthogonal transformation and generating a similarity value map. The formula of the similarity value, the orthogonal transformation, and the inverse orthogonal transformation each have linearity. The pattern recognition is one in which the component of the similarity value is not subjected to positive/negative reversal through variations in brightness of the background.
摘要:
The present media editing device generates media including messages in an easy manner in a communication terminal such as a mobile terminal. Therein, a moving image data storage part stores moving image data recorded by a user. A region extraction part extracts any region including the user from the moving image data. A front determination part detects whether or not the user in the extracted region is facing the front. A sound detection part detects the presence or absence of a sound signal of a predetermined level or higher. A frame selection part determines starting and ending frames based on the results outputted from the front determination part and the sound detection part. An editing part performs, for example, an image conversion process by clipping out the media based on thus determined starting and ending frames. A transmission data storage part stores the resultantly edited media as transmission data.
摘要:
A hand gesture recognizing device is provided which can correctly recognize hand gestures at high speed without requiring users to be equipped with tools. A gesture of a user is stereoscopically filmed by a photographing device 1 and then stored in an image storage device 2. A feature image extracting device 3 transforms colors of the stereoscopic image data read from the image storage device 2 in accordance with color transformation tables created by a color transformation table creating device 13, and disassembles and outputs the feature image of the user in corresponding channels. A spatial position calculating device 4 calculates spatial positions of feature parts of the user by utilizing parallax of the feature image outputted from the feature image extracting device 4. A region dividing device 5 defines the space around the user with spatial region codes. A hand gesture detecting device 6 detects how the hands of the user move in relation to the spatial region codes. A category is detected first on the basis of the detected hand gesture, and then a sign language word in that category is specified.
摘要:
A 3D shape measurement method and a device using the method eliminate harmful influences of periodic inconstancy in the phase shift method. Optical intensity patterns following periodic functions of sine waves are irradiated on an object while shifting the phases thereof. Based on the image picked up from the object, the 3D shape of the object is measured. In this method, a plurality of optical intensity patterns following periodic functions with varying wavelengths are projected onto the object so as not to interfere with each other. The least common multiple of the wavelengths of the periodic functions is larger than the extent having periodic inconstancy within the image pickup area.
摘要:
In a broadly-applicable face image extraction device and method for defining a face by position and size in target images varied in type for face image extraction at high speed, an edge extraction part 1 extracts an edge part from a target image and generates an edge image. A template storage part 2 previously stores a template composed of a plurality of concentric shapes varied in size. A voting result storage part 3 has voting storage regions for each size of the concentric shapes of the template so as to store the result obtained by voting processing carried out by a voting part 4. The voting part 4 carries out the voting processing utilizing the template at each pixel in the edge image, and stores the result obtained thereby in the corresponding voting storage region. After the voting processing, an analysis part 5 performs cluster evaluation based on the voting results stored in the voting storage regions, and then defines the face in the target image by position and size.
摘要:
An object of the present invention is to provide a device and a method for recognizing hand shape and position even if a hand image to be provided for recognition is rather complicated in shape, and a recording medium having a program for carrying out the method recorded thereon. A hand image normalization part 11 deletes a wrist region respectively from a plurality of images varied in hand shape and position before subjecting the images to normalization in hand orientation and size to generate hand shape images. An eigenspace calculation part 13 calculates an eigenvalue and an eigenvector respectively from the hand shape images under an analysis based on an eigenspace method. An eigenspace projection part 15 calculates eigenspace projection coordinates by projecting the hand shape images onto an eigenspace having the eigenvectors as a basis. A hand image normalization part 21 deletes a wrist region from an input hand image, and generates an input hand shape image by normalizing the input hand image to be equivalent to the hand shape images. An eigenspace projection part 22 calculates eigenspace projection coordinates for the input hand shape image by projecting the same onto the eigenspace having the eigenvectors as the basis. A hand shape image selection part 23 compares the eigenspace projection coordinates calculated for the input hand shape image with each of the eigenspace projection coordinates calculated for the hand shape images, and then determines which of the hand shape images is closest to the input hand shape image. A shape/position output part 24 outputs shape information and position information on the determined hand shape image.
摘要:
A human tracking device according to the present invention stably tracks a human with good perception of the distance to a human with high resistance against disturbance. A camera image is divided into a human region and a background region. Then, each area of the image is judged whether or not the human region can be divided into a plurality of blob models corresponding to parts of a human. The parts of a human are preferably the head, trunk and legs. When the result of the judgment is “YES”, a plurality of human blob models are produced based on the human region. When the result of the judgment is “NO”, a single human blob model is produced based on the human region. The human is then tracked based on these human blob models. In this way, the human can be stably tracked with good perception of the distance to the human and with high resistance against disturbance.
摘要:
Two pictures of a subject obtained by objective lenses and from different locations of viewpoint are respectively rotated by dove prisms, 90 degrees clockwise, and then merged into a single picture by total reflection mirrors. The picture after synthesis is reduced at a predetermined ratio by a condenser lens, and then projected onto a pickup plane of a CCD. It is therefore possible to obtain stereoscopic pictures having parallax as a single picture using a single camera without narrowing the effective fields of view of right and left pictures of different viewpoints.