Abstract:
Disclosed herein is an apparatus and method for estimating the joint structure of a human body. The apparatus includes a multi-view image acquisition unit for receiving multi-view images acquired by capturing a human body. A human body foreground separation unit extracts a foreground region corresponding to the human body from the acquired multi-view images. A human body shape restoration unit restores voxels indicating geometric space occupation information of the human body using the foreground region corresponding to the human body, thus generating voxel-based three-dimensional (3D) shape information of the human body. A skeleton information extraction unit generates 3D skeleton information from the generated voxel-based 3D shape information of the human body. A skeletal structure estimation unit estimates positions of respective joints from a skeletal structure of the human body using both the generated 3D skeleton information and anthropometric information.