Notes on Expected Computational Cost of Classifiers Cascade: A Geometric View

A cascade of classifiers, working within a detection procedure, extracts and uses different number of features depending on the window under analysis. Windows with background regions can be typically recognized as negative with just a few features, whereas windows with target objects (or resembling them) might require thousands of features. The central point of attention for this paper is a quantity that describes the average computational cost of an operating cascade, namely — the expected value of the number of features the cascade uses. This quantity can be calculated explicitly knowing the probability distribution underlying the data and the properties of a particular cascade (detection and false alarm rates of its stages), or it can be accurately estimated knowing just the latter. We show three purely geometric examples that demonstrate how training a cascade with sensitivity / FAR constraints imposed per each stage can lead to non-optimality in terms of the computational cost. We do not propose a particular algorithm to overcome the pitfalls of stage-wise training, instead, we sketch an intuition showing that non-greedy approaches can improve the resulting cascades.

[1]  João Gama,et al.  Cascade Generalization , 2000, Machine Learning.

[2]  Oscar Deniz,et al.  Sample Selection for Training Cascade Detectors , 2015, PloS one.

[3]  Jianguo Li,et al.  Learning SURF Cascade for Fast and Accurate Object Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[5]  Jonathan Brandt,et al.  Robust object detection via soft cascade , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[7]  Franklin C. Crow,et al.  Summed-area tables for texture mapping , 1984, SIGGRAPH.

[8]  Nuno Vasconcelos,et al.  Boosting algorithms for detector cascade learning , 2014, J. Mach. Learn. Res..

[9]  Anton van den Hengel,et al.  Training Effective Node Classifiers for Cascade Classification , 2013, International Journal of Computer Vision.

[10]  Tat-Jen Cham,et al.  Fast training and selection of Haar features using statistics in boosting-based face detection , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[11]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[12]  Y. Freund,et al.  Discussion of the Paper \additive Logistic Regression: a Statistical View of Boosting" By , 2000 .

[13]  Anton van den Hengel,et al.  Optimally Training a Cascade Classifier , 2010, ArXiv.

[14]  Tomaso A. Poggio,et al.  A general framework for object detection , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).