Low-cost 3D motion capture system using passive optical markers and monocular vision

Abstract This paper presents a low-cost 3D motion capture system using a single camera and passive optical markers. 3D motion capture systems usually demand multiple cameras. Thus, those systems require high-cost hardware devices and large computing power. However, such high-cost systems prevent personal or mobile systems using 3D motion capturing. To overcome this problem, we propose a low-cost motion capture system. The structure of our system is based on a single camera and passive optical markers, which enables the system to be low-cost. As the components of the proposed systems, we introduce 2D optical marker recognition, marker size recognition, and mapping for 3D data extraction. To show the effectiveness and efficiency, we conduct preliminary experiments in terms of accuracy and time-complexity. Experiment results indicate that the proposed system provides 3D motion capturing using a single camera enough to utilize it in 3D interaction.

[1]  Youngmo Han 2D-to-3D Visual Human Motion Converting System for Home Optical Motion Capture Tool and 3-D Smart TV , 2015, IEEE Systems Journal.

[2]  Sang Hwa Lee,et al.  Interactive e-learning system using pattern recognition and augmented reality , 2009, IEEE Transactions on Consumer Electronics.

[3]  Alberto Rovetta,et al.  Depth image-based hand tracking in complex scene , 2015 .

[4]  Hoon Yoo Axially moving a lenslet array for high-resolution 3D images in computational integral imaging. , 2013, Optics express.

[5]  Myungjin Cho,et al.  Improved depth extraction method of 3D objects using computational integral imaging reconstruction based on multiple windowing techniques , 2015 .

[6]  Andrea Sanna,et al.  A 6-DOF ARTag-based tracking system , 2010, IEEE Transactions on Consumer Electronics.

[7]  Taku Komura,et al.  A Virtual Reality Dance Training System Using Motion Capture Technology , 2011, IEEE Transactions on Learning Technologies.

[8]  Darius Burschka,et al.  Advances in Computational Stereo , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Hoon Yoo,et al.  QR-code finder recognition using four directional scanning method , 2012 .

[10]  Nicholas Ayache,et al.  Artificial vision for mobile robots - stereo vision and multisensory perception , 1991 .

[11]  Xu Chen,et al.  A Robot Indoor Position and Orientation Method based on 2D Barcode Landmark , 2011, J. Comput..

[12]  Qiong Liu,et al.  Linking Digital Media to Physical Documents: Comparing Content- and Marker-Based Tags , 2010, IEEE Pervasive Computing.

[13]  Olivier Stasse,et al.  MonoSLAM: Real-Time Single Camera SLAM , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  B. Bethke,et al.  Real-time indoor autonomous vehicle test environment , 2008, IEEE Control Systems.

[15]  Jian Su,et al.  A positioning method for the feature points of a target board image adopting singular value decomposition , 2013 .

[16]  Mark Fiala,et al.  Designing Highly Reliable Fiducial Markers , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Vladimir Pavlovic,et al.  Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Kun Li,et al.  Markerless Shape and Motion Capture From Multiview Video Sequences , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Heesung Jun,et al.  Vision-based location positioning using augmented reality for indoor navigation , 2008, IEEE Transactions on Consumer Electronics.

[20]  Hoon Yoo Depth extraction for 3D objects via windowing technique in computational integral imaging with a lenslet array , 2013 .

[21]  Ankit Chaudhary,et al.  Robust gesture recognition using Kinect: A comparison between DTW and HMM , 2015 .