KernelBoost: Supervised Learning of Image Features For Classification

We propose a fully-supervised approach to training classifiers that automatically learn features directly from image data. This drops the dependency on hand-designed filters and features, which is generally a trial-and-error process and often yields far-from-optimal results. Our approach relies on the Gradient Boosting framework, learning discriminative features at each stage in the form of convolutional filters. It depends on just few easy-to-tune parameters, it is simple and general, and we show it outperforms state-of-the-art methods in tasks ranging from pixel classification in very different types of images to object detection.

[1]  Qiang Ji,et al.  Learning discriminant features for multi-view face and eye detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[2]  Mei-Chen Yeh,et al.  Fast Human Detection Using a Cascade of Histograms of Oriented Gradients , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3]  Rich Caruana,et al.  An empirical comparison of supervised learning algorithms , 2006, ICML.

[4]  Marc'Aurelio Ranzato,et al.  Building high-level features using large scale unsupervised learning , 2011, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Rainer Lienhart,et al.  An extended set of Haar-like features for rapid object detection , 2002, Proceedings. International Conference on Image Processing.

[6]  Qiang Wu,et al.  Object Detection Based on Co-occurrence GMuLBP Features , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[7]  Max W. K. Law,et al.  Three Dimensional Curvilinear Structure Detection Using Optimally Oriented Flux , 2008, ECCV.

[8]  Christoph H. Lampert,et al.  Beyond sliding windows: Object localization by efficient subwindow search , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Graham W. Taylor,et al.  Deconvolutional networks , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Guillermo Sapiro,et al.  Non-local sparse models for image restoration , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[11]  Abdesselam Bouzerdoum,et al.  Adaptive hierarchical architecture for visual recognition. , 2010, Applied optics.

[12]  Nuno Vasconcelos,et al.  Learning Optimal Embedded Cascades , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Juergen Gall,et al.  Class-specific Hough forests for object detection , 2009, CVPR.

[14]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[16]  Y-Lan Boureau,et al.  Learning Convolutional Feature Hierarchies for Visual Recognition , 2010, NIPS.

[17]  Hongyuan Zha,et al.  A General Boosting Method and its Application to Learning Ranking Functions for Web Search , 2007, NIPS.

[18]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[19]  Zhuowen Tu,et al.  Auto-Context and Its Application to High-Level Vision Tasks and 3D Brain Image Segmentation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  David G. Lowe,et al.  Multiclass Object Recognition with Sparse, Localized Features , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[22]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[23]  Eduardo A. Hoff,et al.  Grain size measurement by image analysis: An application in the ceramic and in the metallic industries , 2005 .

[24]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[25]  Dan Roth,et al.  Learning to detect objects in images via a sparse, part-based representation , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Geoffrey E. Hinton Learning to represent visual input , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[27]  Vincent Lepetit,et al.  Accurate and Efficient Linear Structure Segmentation by Leveraging Ad Hoc Features with Learned Filters , 2012, MICCAI.

[28]  Shimon Ullman,et al.  The chains model for detecting parts by their context , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[29]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[30]  Luca Maria Gambardella,et al.  Deep Neural Networks Segment Neuronal Membranes in Electron Microscopy Images , 2012, NIPS.

[31]  Yair Weiss,et al.  Learning object detection from a small number of examples: the importance of good features , 2004, CVPR 2004.

[32]  Honglak Lee,et al.  Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.