论文信息 - KernelBoost: Supervised Learning of Image Features For Classification

KernelBoost: Supervised Learning of Image Features For Classification

We propose a fully-supervised approach to training classifiers that automatically learn features directly from image data. This drops the dependency on hand-designed filters and features, which is generally a trial-and-error process and often yields far-from-optimal results. Our approach relies on the Gradient Boosting framework, learning discriminative features at each stage in the form of convolutional filters. It depends on just few easy-to-tune parameters, it is simple and general, and we show it outperforms state-of-the-art methods in tasks ranging from pixel classification in very different types of images to object detection.

Vincent Lepetit | Pascal Fua | Carlos Becker | Roberto Rigamonti

[1] Qiang Ji,et al. Learning discriminant features for multi-view face and eye detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[2] Mei-Chen Yeh,et al. Fast Human Detection Using a Cascade of Histograms of Oriented Gradients , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3] Rich Caruana,et al. An empirical comparison of supervised learning algorithms , 2006, ICML.

[4] Marc'Aurelio Ranzato,et al. Building high-level features using large scale unsupervised learning , 2011, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5] Rainer Lienhart,et al. An extended set of Haar-like features for rapid object detection , 2002, Proceedings. International Conference on Image Processing.

[6] Qiang Wu,et al. Object Detection Based on Co-occurrence GMuLBP Features , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[7] Max W. K. Law,et al. Three Dimensional Curvilinear Structure Detection Using Optimally Oriented Flux , 2008, ECCV.

[8] Christoph H. Lampert,et al. Beyond sliding windows: Object localization by efficient subwindow search , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[9] Graham W. Taylor,et al. Deconvolutional networks , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10] Guillermo Sapiro,et al. Non-local sparse models for image restoration , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[11] Abdesselam Bouzerdoum,et al. Adaptive hierarchical architecture for visual recognition. , 2010, Applied optics.

[12] Nuno Vasconcelos,et al. Learning Optimal Embedded Cascades , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Juergen Gall,et al. Class-specific Hough forests for object detection , 2009, CVPR.

[14] Matti Pietikäinen,et al. Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[16] Y-Lan Boureau,et al. Learning Convolutional Feature Hierarchies for Visual Recognition , 2010, NIPS.

[17] Hongyuan Zha,et al. A General Boosting Method and its Application to Learning Ranking Functions for Web Search , 2007, NIPS.

[18] Paul A. Viola,et al. Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[19] Zhuowen Tu,et al. Auto-Context and Its Application to High-Level Vision Tasks and 3D Brain Image Segmentation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20] David G. Lowe,et al. Multiclass Object Recognition with Sparse, Localized Features , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21] Leo Breiman,et al. Random Forests , 2001, Machine Learning.

[22] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[23] Eduardo A. Hoff,et al. Grain size measurement by image analysis: An application in the ceramic and in the metallic industries , 2005 .

[24] Aaas News,et al. Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[25] Dan Roth,et al. Learning to detect objects in images via a sparse, part-based representation , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26] Geoffrey E. Hinton. Learning to represent visual input , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[27] Vincent Lepetit,et al. Accurate and Efficient Linear Structure Segmentation by Leveraging Ad Hoc Features with Learned Filters , 2012, MICCAI.

[28] Shimon Ullman,et al. The chains model for detecting parts by their context , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[29] Trevor Hastie,et al. The Elements of Statistical Learning , 2001 .

[30] Luca Maria Gambardella,et al. Deep Neural Networks Segment Neuronal Membranes in Electron Microscopy Images , 2012, NIPS.

[31] Yair Weiss,et al. Learning object detection from a small number of examples: the importance of good features , 2004, CVPR 2004.

[32] Honglak Lee,et al. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.