RFSOM - Extending Self-Organizing Feature Maps with Adaptive Metrics to Combine Spatial and Textural Features for Body Pose Estimation

In this work we propose an online approach to compute a more precise assignment between parts of an upper human body model to RGBD image data. For this, a Self-Organizing Map (SOM) will be computed using a set of features where each feature is weighted by a relevance factor (RFSOM). These factors are computed using the generalized matrix learning vector quantization (GMLVQ) and allow to scale the input dimensions according to their relevance. With this scaling it is possible to distinguish between the different body parts of the upper body model. This method leads to a more precise positioning of the SOM in the 2.5D point cloud, a more stable behavior of the single neurons in their specific body region, and hence, to a more reliable pose model for further computation. The algorithm was evaluated on different data sets and compared to a Self-Organizing Map trained with the spatial dimensions only using the same data sets.

[1]  Thomas Martinetz,et al.  Self-Organizing Maps for Pose Estimation with a Time-of-Flight Camera , 2009, Dyn3D.

[2]  Reinhard Koch,et al.  Dynamic 3D Imaging, DAGM 2009 Workshop, Dyn3D 2009, Jena, Germany, September 9, 2009. Proceedings , 2009, Dyn3D.

[3]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[5]  Michael Biehl,et al.  Adaptive Relevance Matrices in Learning Vector Quantization , 2009, Neural Computation.

[6]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[7]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[9]  N. Otsu A threshold selection method from gray level histograms , 1979 .