Deconstructing Binary Classifiers in Computer Vision

This paper further develops the novel notion of deconstructive learning and proposes a practical model for deconstructing a broad class of binary classifiers commonly used in vision applications. Specifically, the problem studied in this paper is: Given an image-based binary classifier \({\mathbf {C}}\) as a black-box oracle, how much can we learn of its internal working by simply querying it? To formulate and answer this question computationally, we propose a novel framework that explicitly identifies and delineates the computer vision and machine learning components, and we propose an effective deconstruction algorithm for deconstructing binary classifiers with the typical two-component design that employ support vector machine or cascade of linear classifiers as their internal feature classifiers. The deconstruction algorithm simultaneously searches over a collection of candidate feature spaces by probing the spaces for the decision boundaries, using the labels provided by the given classifier. In particular, we demonstrate that it is possible to ascertain the type of kernel function used by the classifier and the number of support vectors (and the subspace spanned by the support vectors) using only image queries and ascertain the unknown feature space too. Furthermore, again using only simple image queries, we are able to completely deconstruct OpenCV’s pedestrian detector, ascertain the exact feature used, the type of classifier employed and recover the (almost) exact linear classifier.

[1]  Christopher Meek,et al.  Adversarial learning , 2005, KDD '05.

[2]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[3]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Jeffrey Ho,et al.  Deconstructing Kernel Machines , 2014, ECML/PKDD.

[5]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[6]  Andrew Zisserman,et al.  Scene Classification Via pLSA , 2006, ECCV.

[7]  Jack Poulson,et al.  Scientific computing , 2013, XRDS.

[8]  John Langford,et al.  Agnostic active learning , 2006, J. Comput. Syst. Sci..

[9]  Antonio Torralba,et al.  HOGgles: Visualizing Object Detection Features , 2013, 2013 IEEE International Conference on Computer Vision.

[10]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[12]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[13]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[14]  Luc Van Gool,et al.  Pedestrian detection at 100 frames per second , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[16]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[17]  Guillermo Sapiro,et al.  Discriminative learned dictionaries for local image analysis , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Sanjoy Dasgupta,et al.  Analysis of a greedy active learning strategy , 2004, NIPS.

[19]  Maria-Florina Balcan,et al.  Margin Based Active Learning , 2007, COLT.

[20]  Fei-FeiLi,et al.  One-Shot Learning of Object Categories , 2006 .

[21]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.