摘要:
A 3D shape measurement method and a device using the method eliminate harmful influences of periodic inconstancy in the phase shift method. Optical intensity patterns following periodic functions of sine waves are irradiated on an object while shifting the phases thereof. Based on the image picked up from the object, the 3D shape of the object is measured. In this method, a plurality of optical intensity patterns following periodic functions with varying wavelengths are projected onto the object so as not to interfere with each other. The least common multiple of the wavelengths of the periodic functions is larger than the extent having periodic inconstancy within the image pickup area.
摘要:
A hand gesture recognizing device is provided which can correctly recognize hand gestures at high speed without requiring users to be equipped with tools. A gesture of a user is stereoscopically filmed by a photographing device 1 and then stored in an image storage device 2. A feature image extracting device 3 transforms colors of the stereoscopic image data read from the image storage device 2 in accordance with color transformation tables created by a color transformation table creating device 13, and disassembles and outputs the feature image of the user in corresponding channels. A spatial position calculating device 4 calculates spatial positions of feature parts of the user by utilizing parallax of the feature image outputted from the feature image extracting device 4. A region dividing device 5 defines the space around the user with spatial region codes. A hand gesture detecting device 6 detects how the hands of the user move in relation to the spatial region codes. A category is detected first on the basis of the detected hand gesture, and then a sign language word in that category is specified.
摘要:
An image processing method for detecting an object from an input image using a template image, including inputting a specified image with respect to both a template image and an input image, calculating an edge normal direction vector of said specified image, generating an evaluation vector from said edge normal direction vector, subjecting the evaluation vector to orthogonal transformation, a step of performing a product sum calculation of corresponding spectral data with respect to each evaluation vector that has been subjected to orthogonal transformation and has been obtained for each of said template image and said input image, and a step of subjecting it to inverse orthogonal transformation and generating a similarity value map. The formula of the similarity value, the orthogonal transformation, and the inverse orthogonal transformation each have linearity. The pattern recognition is one in which the component of the similarity value is not subjected to positive/negative reversal through variations in brightness of the background.
摘要:
The present media editing device generates media including messages in an easy manner in a communication terminal such as a mobile terminal. Therein, a moving image data storage part stores moving image data recorded by a user. A region extraction part extracts any region including the user from the moving image data. A front determination part detects whether or not the user in the extracted region is facing the front. A sound detection part detects the presence or absence of a sound signal of a predetermined level or higher. A frame selection part determines starting and ending frames based on the results outputted from the front determination part and the sound detection part. An editing part performs, for example, an image conversion process by clipping out the media based on thus determined starting and ending frames. A transmission data storage part stores the resultantly edited media as transmission data.
摘要:
A human tracking device according to the present invention stably tracks a human with good perception of the distance to a human with high resistance against disturbance. A camera image is divided into a human region and a background region. Then, each area of the image is judged whether or not the human region can be divided into a plurality of blob models corresponding to parts of a human. The parts of a human are preferably the head, trunk and legs. When the result of the judgment is “YES”, a plurality of human blob models are produced based on the human region. When the result of the judgment is “NO”, a single human blob model is produced based on the human region. The human is then tracked based on these human blob models. In this way, the human can be stably tracked with good perception of the distance to the human and with high resistance against disturbance.
摘要:
Two pictures of a subject obtained by objective lenses and from different locations of viewpoint are respectively rotated by dove prisms, 90 degrees clockwise, and then merged into a single picture by total reflection mirrors. The picture after synthesis is reduced at a predetermined ratio by a condenser lens, and then projected onto a pickup plane of a CCD. It is therefore possible to obtain stereoscopic pictures having parallax as a single picture using a single camera without narrowing the effective fields of view of right and left pictures of different viewpoints.
摘要:
A face detection device includes a face learning dictionary, which holds learned information for identification between a facial image and a non-facial image. An image input unit inputs a subject image. An edge image extraction unit extracts an edge image from the subject image. A partial image extraction unit, based on the edge image, extracts partial images that are candidates to contain facial images from the subject image. A face/non-face identification unit references the learning dictionary to identify whether or not each extracted partial image contains a facial image. Face detection of high precision, which reflects learned results, is performed.
摘要:
In a broadly-applicable face image extraction device and method for defining a face by position and size in target images varied in type for face image extraction at high speed, an edge extraction part 1 extracts an edge part from a target image and generates an edge image. A template storage part 2 previously stores a template composed of a plurality of concentric shapes varied in size. A voting result storage part 3 has voting storage regions for each size of the concentric shapes of the template so as to store the result obtained by voting processing carried out by a voting part 4. The voting part 4 carries out the voting processing utilizing the template at each pixel in the edge image, and stores the result obtained thereby in the corresponding voting storage region. After the voting processing, an analysis part 5 performs cluster evaluation based on the voting results stored in the voting storage regions, and then defines the face in the target image by position and size.
摘要:
An object of the present invention is to provide a method of segmenting hand gestures which automatically segments hand gestures to be detected into words or apprehensible units structured by a plurality of words when recognizing the hand gestures without the user's presentation where to segment. Transition feature data in which a feature of a transition gesture being not observed during a gesture representing a word but is described when transiting from a gesture to another is previously stored. Thereafter, a motion of image corresponding to the part of body in which the transition gesture is observed is detected (step S106), the detected motion of image is compared with the transition feature data (step S107), and a time position where the transition gesture is observed is determined so as to segment the hand gestures (step S108).