People and Mobile Robot Classification Through Spatio-Temporal Analysis of Optical Flow

The goal of this work is to distinguish between humans and robots in a mixed human-robot environment. We analyze the spatio-temporal patterns of optical flow-based features along several frames. We consider the Histogram of Optical Flow (HOF) and the Motion Boundary Histogram (MBH) features, which have shown good results on people detection. The spatio-temporal patterns are composed of groups of feature components that have similar values on previous frames. The groups of features are fed into the FuzzyBoost algorithm, which at each round selects the spatio-temporal pattern (i.e. feature set) having the lowest classification error. The search for patterns is guided by grouping feature dimensions, considering three algorithms: (a) similarity of weights from dimensionality reduction matrices, (b) Boost Feature Subset Selection (BFSS) and (c) Sequential Floating Feature Selection (SFSS), which avoid the brute force approach. The similarity weights are computed by the Multiple Metric Learning for large Margin Nearest Neighbor (MMLMNN), a linear dimensionality algorithm that provides a type of Mahalanobis metric Weinberger and Saul, J. MaCh. Learn. Res.10 (2009) 207–244. The experiments show that FuzzyBoost brings good generalization properties, better than the GentleBoost, the Support Vector Machines (SVM) with linear kernels and SVM with Radial Basis Function (RBF) kernels. The classifier was implemented and tested in a real-time, multi-camera dynamic setting.

[1]  Dariu Gavrila,et al.  Monocular Pedestrian Detection: Survey and Experiments , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Trevor Hastie,et al.  Additive Logistic Regression : a Statistical , 1998 .

[3]  Terrance E. Boult,et al.  Into the woods: visual surveillance of noncooperative and camouflaged targets in complex outdoor settings , 2001, Proc. IEEE.

[4]  Günther Palm,et al.  Biomimetic Neural Learning for Intelligent Robots - Intelligent Systems, Cognitive Robotics, and Neuroscience , 2005, Biomimetic Neural Learning for Intelligent Robots.

[5]  Shai Avidan SpatialBoost: Adding Spatial Reasoning to AdaBoost , 2006, ECCV.

[6]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[7]  Tieniu Tan,et al.  Silhouette Analysis-Based Gait Recognition for Human Identification , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Tomaso A. Poggio,et al.  Pedestrian detection using wavelet templates , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Martin Drauschke,et al.  Feature Subset Selection with Adaboost and ADTboost , 2008 .

[10]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[12]  Bernt Schiele,et al.  Multi-cue onboard pedestrian detection , 2009, CVPR.

[13]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[14]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[15]  Hongzhe Li,et al.  Group additive regression models for genomic data analysis. , 2008, Biostatistics.

[16]  Joseph J. Lim,et al.  Sketch Tokens: A Learned Mid-level Representation for Contour and Object Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Josef Kittler,et al.  Floating search methods in feature selection , 1994, Pattern Recognit. Lett..

[18]  Plinio Moreno,et al.  Feature Set Search Space for FuzzyBoost Learning , 2011, IbPRIA.

[19]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[20]  Aidong Zhang,et al.  Boost Feature Subset Selection: A New Gene Selection Algorithm for Microarray Dataset , 2006, International Conference on Computational Science.

[21]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[22]  Bernt Schiele,et al.  New features and insights for pedestrian detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  Yiannis Aloimonos,et al.  A Roadmap to the Integration of Early Visual Modules , 2007, International Journal of Computer Vision.

[24]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[25]  TanTieniu,et al.  Silhouette Analysis-Based Gait Recognition for Human Identification , 2003 .

[26]  Plinio Moreno,et al.  Boosting with Temporal Consistent Learners: An Application to Human Activity Recognition , 2007, ISVC.

[27]  Yuan Li,et al.  Learning sparse features in granular space for multi-view face detection , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[28]  Alexandre Bernardino,et al.  An Improved Labelling for the INRIA Person Data Set for Pedestrian Detection , 2013, IbPRIA.

[29]  Wolfgang Förstner,et al.  Comparison of Adaboost and ADTboost for Feature Subset Selection , 2018, PRIS.

[30]  Jin Hyeong Park,et al.  Scale-Space Based Weak Regressors for Boosting , 2007, ECML.

[31]  Martin A. Riedmiller,et al.  Appearance-Based Robot Discrimination Using Eigenimages , 2006, RoboCup.

[32]  J. Andrade-Cetto,et al.  Ubiquitous networking robotics in urban settings , 2006 .

[33]  David A. Forsyth,et al.  Tracking People by Learning Their Appearance , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Antonio Torralba,et al.  Sharing Visual Features for Multiclass and Multiview Object Detection , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Günther Palm,et al.  Visual Robot Detection in RoboCup Using Neural Networks , 2004, RoboCup.

[36]  Keinosuke Fukunaga,et al.  A Branch and Bound Algorithm for Feature Subset Selection , 1977, IEEE Transactions on Computers.

[37]  Michael T. Manry,et al.  Feature Selection Using a Piecewise Linear Network , 2006, IEEE Transactions on Neural Networks.

[38]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[39]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[40]  Mubarak Shah,et al.  TemporalBoost for event recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[41]  Alexandre Bernardino,et al.  Optical Flow Based Detection in Mixed Human Robot Environments , 2009, ISVC.

[42]  Luis Enrique Sucar,et al.  Human silhouette recognition with Fourier descriptors , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[43]  J. Santos-Victor,et al.  Introducing fuzzy decision stumps in boosting through the notion of neighbourhood , 2012 .

[44]  Pietro Perona,et al.  Fast Feature Pyramids for Object Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.