Efficacy comparison of clustering systems for limb detection

This paper presents a comparison of applying different clustering algorithms on a point cloud constructed from the depth maps captured by a RGBD camera such as Microsoft Kinect. The depth sensor is capable of returning images, where each pixel represents the distance to its corresponding point not the RGB data. This is considered as the real novelty of the RGBD camera in computer vision compared to the common video-based and stereo-based products. Depth sensors captures depth data without using markers, 2D to 3D-transition or determining feature points. The captured depth map then cluster the 3D depth points into different clusters to determine the different limbs of the human-body. The 3D points clustering is achieved by different clustering techniques. Our Experiments show good performance and results in using clustering to determine different human-body limbs.

[1]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[2]  Günther Greiner,et al.  Automatic reconstruction of personalized avatars from 3D face scans , 2011, Comput. Animat. Virtual Worlds.

[3]  Hans-Peter Seidel,et al.  A data-driven approach for real-time full body pose reconstruction from a depth camera , 2011, Vision.

[4]  Kin Fun Li,et al.  A Web-Based Sign Language Translator Using 3D Video Processing , 2011, 2011 14th International Conference on Network-Based Information Systems.

[5]  Peter J. Rousseeuw,et al.  Clustering by means of medoids , 1987 .

[6]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[7]  Saeid Nahavandi,et al.  Measuring depth accuracy in RGBD cameras , 2013, 2013, 7th International Conference on Signal Processing and Communication Systems (ICSPCS).

[8]  Lei Wei,et al.  Low cost multimodal facial recognition via kinect sensors , 2012 .

[9]  Ruigang Yang,et al.  Accurate 3D pose estimation from a single depth image , 2011, 2011 International Conference on Computer Vision.

[10]  Nan Jiang,et al.  Unsupervised human skeleton extraction from Kinect depth images , 2012, ICIMCS '12.

[11]  Marjorie Skubic,et al.  Evaluation of an inexpensive depth camera for passive in-home fall risk assessment , 2011, 2011 5th International Conference on Pervasive Computing Technologies for Healthcare (PervasiveHealth) and Workshops.

[12]  Ricardo Gutierrez-Osuna,et al.  Web GIS in practice X: a Microsoft Kinect natural user interface for Google Earth navigation , 2011, International journal of health geographics.

[13]  Donald Gustafson,et al.  Fuzzy clustering with a fuzzy covariance matrix , 1978, 1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes.

[14]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[15]  Andrew W. Fitzgibbon,et al.  Efficient regression of general-activity human poses from depth images , 2011, 2011 International Conference on Computer Vision.

[16]  Tilak Dutta,et al.  Evaluation of the Kinect™ sensor for 3-D kinematic measurement in the workplace. , 2012, Applied ergonomics.

[17]  Xuan Song,et al.  Unsupervised skeleton extraction and motion capture from 3D deformable matching , 2013, Neurocomputing.

[18]  Charles Elkan,et al.  Using the Triangle Inequality to Accelerate k-Means , 2003, ICML.

[19]  Andrew D. Wilson Using a depth camera as a touch sensor , 2010, ITS '10.

[20]  Gonzalo López-Abente,et al.  Lung cancer risk and pollution in an industrial region of Northern Spain: a hospital-based case-control study , 2011, International journal of health geographics.

[21]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[22]  Young-Il Kim,et al.  Methods for generating TLPs (typical load profiles) for smart grid-based energy programs , 2011, 2011 IEEE Symposium on Computational Intelligence Applications In Smart Grid (CIASG).

[23]  Saeid Nahavandi,et al.  Kinect crowd interaction , 2012 .

[24]  Didier Stricker,et al.  3D shape scanning with a Kinect , 2011, SIGGRAPH '11.

[25]  Saeid Nahavandi,et al.  Real Time Ergonomic Assessment for Assembly Operations Using Kinect , 2013, 2013 UKSim 15th International Conference on Computer Modelling and Simulation.

[26]  Dilip Kumar Pratihar,et al.  A Comparative Study of Fuzzy C-Means Algorithm and Entropy-Based Fuzzy Clustering Algorithms , 2011, Comput. Informatics.

[27]  Andrew W. Fitzgibbon,et al.  KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.