摘要:
Disclosed herein is an apparatus and method for extracting correspondences between aerial images. The apparatus includes a line extraction unit, a line direction determination unit, a building top area extraction unit, and a correspondence extraction unit. The line extraction unit extracts lines corresponding buildings from aerial images. The line direction determination unit defines the directions of the lines as x, y and z axis directions based on a two-dimensional (2D) coordinate system. The building top area extraction unit rotates lines in the x and y axis directions so that the lines are arranged in parallel with the horizontal and vertical directions of the 2D image, and then extracts building top areas from rectangles. The correspondence extraction unit extracts correspondences between the aerial images by comparing the locations of the building top areas extracted from the aerial images.
摘要:
A method for reconstructing a 3D shape model of an object by using multi-view image information, includes: inputting multi-view images obtained by photographing the object from multiple viewpoints in a voxel space, and extracting silhouette information and color information of the multi-view images; reconstructing visual hulls by silhouette intersection using the silhouette information; and approximating polygons of cross-sections of the visual hulls to a natural geometric shape of the object by using the color information. Further, the method includes expressing a 3D geometric shape of the object by connecting the approximated polygons to create a mesh structure; extracting color textures of a surface of the object by projecting meshes of the mesh structure to the multi-view image; and creating a 3D shape model by modeling natural shape information and surface color information of the object.
摘要:
Disclosed are an apparatus and a method for automatic control of multiple cameras capable of supporting an effective camera view angle in a broadcast, a movie, etc. The automatic control apparatus of multiple cameras includes: a first main camera; a first camera driver controlling an operation of the first main camera; a second main camera; a second camera driver controlling an operation of the second main camera; at least one auxiliary camera; at least one third camera driver controlling an operation of the at least one auxiliary camera; and an interoperation processor changing a view angle of the at least one auxiliary camera by controlling the at least one third camera driver in accordance with a view angle changing reference changed by changing the view angle of the first main camera, the second main camera, or the first and second main cameras.
摘要:
Provided are a portable multi-view image acquisition system and a multi-view image preprocessing method. The portable multi-view image acquisition system may include: a portable studio including a plurality of cameras movable up, down, left and right; and a preprocessor performing a preprocessing including a subject separation from a multi-view image that is photographed by the plurality of cameras.
摘要:
Disclosed herein is a 3D urban modeling apparatus and method. The 3D urban modeling apparatus includes a calibration unit for calibrating data about a translation and a rotation of at least one capturing device at a time that input aerial images and terrestrial images were captured. A building model generation unit generates at least one 3D building model based on the aerial images and the terrestrial images to which results of the calibration have been applied. A terrain model generation unit generates a 3D terrain model by converting an input digital elevation model into a 3D mesh. A texture extraction unit extracts textures related to the building model and the terrain model from the aerial images and the terrestrial images. A model matching unit generates a 3D urban model by matching the building model with the terrain model, which are based on the textures, with each other.
摘要:
A multimedia application system uses metadata for sensory devices. The system includes: a sensory-device engine for generating a sensory device command (SDC) for controlling the sensory devices based on sensory effect information (SEI) generated to represent sensory effects by using the sensory devices depending on video contents, user preference information (UPI) of the sensory devices and device capability information (DCI) indicative of reproducing capability of the sensory devices; and a sensory-device controller for controlling sensory devices to perform sensory effect reproduction in response to the generated SDC.
摘要:
Provided is an apparatus and method for determining stereo disparity based on two-path dynamic programming and GGCP. The apparatus includes a pre-processing unit for analyzing texture distribution of an input image by using a Laplacian of Gaussian (LOG) filter and dividing the input image into a homogeneous region and a non-homogeneous region; a local matching unit for determining candidate disparities to be included in an each pixel of all pixels; a local post-processing unit for removing candidate disparities in a pixel of low reliability by performing a visibility test betweens candidate disparities in each pixel to improve the reliability of the candidate disparity; and a global optimizing unit for determining a final disparity for candidate disparities in an each pixel by performing a dynamic programming.
摘要:
A head motion tracking method for three-dimensional facial model animation, the head motion tracking method includes acquiring initial facial motion to be fit to an image of a three-dimensional model from an image inputted by a video camera; creating a silhouette of the three-dimensional model and projecting the silhouette; matching the silhouette created from the three-dimensional model with a silhouette acquired by a statistical feature point tracking scheme; and obtaining a motion parameter for the image of the three-dimensional model through motion correction using a texture to perform three-dimensional head motion tracking. In accordance with the present invention, natural three-dimensional facial model animation based on a real image acquired with a video camera can be performed automatically, thereby reducing time and cost.
摘要:
Provided is a method for generating a three-dimensional (3D) mesh based on unorganized sparse 3D points to generate a mesh model that displays a 3D surface by using unorganized sparse 3D points extracted from a plurality of two-dimensional image. The 3D mesh generating method based on unorganized sparse 3D points includes the steps of: receiving a plurality of unorganized sparse 3D points, a plurality of two-dimensional (2D) corresponding point information, and images; generating an initial mesh by using the received 2D corresponding information; removing an abnormal face from the initial mesh; checking if unused 2D corresponding point information exists among the received 2D corresponding point information; if unused 2D corresponding point information exists, reorganizing the initial mesh by performing a constrained Delaunay triangulation; and if unused 2D corresponding point information does not exist in the result of the confirmation, generating a final mesh.
摘要:
Provided are system and method for generating a 3D facial model and animation using one video camera. The system includes a pre-processing part, a facial model generating part, a transferring part, a projecting part, an error calculating part, and a mesh transforming part. The pre-processing part sets correspondence relations with other meshes, generates an average 3D facial model, and generates a geometrical model and a texture dispersion model. The facial model generating part projects the average 3D facial onto an expressionless facial image frame that stares a front side to generate a performer's 3D facial model. The transferring part transfers a 3D facial model template having an animation-controlled model to the performer's 3D facial model to generate the performer's 3D facial model. The projecting part projects the performer's 3D facial model onto a facial animation video frame including a facial expression. The error calculating part calculates an error projected by the projecting part. The mesh transforming part moves or rotationally converts a joint in such a direction as to minimize the error.