Human detection in MOUT scenarios using covariance descriptors and supervised manifold learning

Military Operations in Urban Terrain (MOUT) require the capability to perceive and to analyse the situation around a patrol in order to recognize potential threats. As in MOUT scenarios threats usually arise from humans one important task is the robust detection of humans. Detection of humans in MOUT by image processing systems can be very challenging, e.g., due to complex outdoor scenes where humans have a weak contrast against the background or are partially occluded. Porikli et al. introduced covariance descriptors and showed their usefulness for human detection in complex scenes. However, these descriptors do not lie on a vector space and so well-known machine learning techniques need to be adapted to train covariance descriptor classifiers. We present a novel approach based on manifold learning that simplifies the classification of covariance descriptors. In this paper, we apply this approach for detecting humans. We describe our human detection method and evaluate the detector on benchmark data sets generated from real-world image sequences captured during MOUT exercises.

[1]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[2]  Dariu Gavrila,et al.  Multi-cue Pedestrian Detection and Tracking from a Moving Vehicle , 2007, International Journal of Computer Vision.

[3]  W. Förstner,et al.  A Metric for Covariance Matrices , 2003 .

[4]  W. Krüger,et al.  Image exploitation for MISAR , 2007, SPIE Defense + Commercial Sensing.

[5]  Yvonne Hübner,et al.  Assessment and optimization of methods for tracking people in riot control scenarios , 2008, Security + Defence.

[6]  Fatih Murat Porikli,et al.  Human Detection via Classification on Riemannian Manifolds , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[8]  Q. Shi,et al.  Gaussian Process Latent Variable Models for , 2011 .

[9]  Jian Zhang,et al.  An Experimental Evaluation of Local Features for Pedestrian Classification , 2007, 9th Biennial Conference of the Australian Pattern Recognition Society on Digital Image Computing Techniques and Applications (DICTA 2007).

[10]  Lawrence K. Saul,et al.  Think Globally, Fit Locally: Unsupervised Learning of Low Dimensional Manifold , 2003, J. Mach. Learn. Res..

[11]  H. Karcher Riemannian center of mass and mollifier smoothing , 1977 .

[12]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[13]  Francesco Camastra,et al.  Machine Learning for Audio, Image and Video Analysis - Theory and Applications , 2007, Advanced Information and Knowledge Processing.

[14]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[15]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[16]  Fan R. K. Chung,et al.  Lectures on Spectral Graph Theory , 2001 .

[17]  Lawrence Cayton,et al.  Algorithms for manifold learning , 2005 .

[18]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[19]  M. Kramer Nonlinear principal component analysis using autoassociative neural networks , 1991 .

[20]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[21]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[22]  Fatih Murat Porikli,et al.  Region Covariance: A Fast Descriptor for Detection and Classification , 2006, ECCV.

[23]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[24]  Xavier Pennec,et al.  Intrinsic Statistics on Riemannian Manifolds: Basic Tools for Geometric Measurements , 2006, Journal of Mathematical Imaging and Vision.

[25]  Aarnout Brombacher,et al.  Probability... , 2009, Qual. Reliab. Eng. Int..

[26]  Akira Utsumi,et al.  Human detection using geometrical pixel value structures , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[27]  P. Meer,et al.  Covariance Tracking using Model Update Based on Means on Riemannian Manifolds , 2005 .

[28]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[29]  A. Elgammal,et al.  Inferring 3D body pose from silhouettes using activity manifold learning , 2004, CVPR 2004.

[30]  L. Skovgaard A Riemannian geometry of the multivariate normal model , 1984 .

[31]  Bernt Schiele,et al.  Robust Object Detection with Interleaved Categorization and Segmentation , 2008, International Journal of Computer Vision.

[32]  Fatih Murat Porikli,et al.  Fast Construction of Covariance Matrices for Arbitrary Size Image Windows , 2006, 2006 International Conference on Image Processing.

[33]  F. Girosi,et al.  Networks for approximation and learning , 1990, Proc. IEEE.

[34]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[35]  B. Schiele,et al.  Combined Object Categorization and Segmentation With an Implicit Shape Model , 2004 .

[36]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[37]  Wolfgang Krüger Robust and efficient map-to-image registration with line segments , 2001, Machine Vision and Applications.

[38]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[39]  Xavier Pennec,et al.  A Riemannian Framework for Tensor Computing , 2005, International Journal of Computer Vision.