An approach for combining multiple descriptors for image classification

Recently, efficient image descriptors have shown promise for image classification tasks. Moreover, methods based on the combination of multiple image features provide better performance compared to methods based on a single feature. This work presents a simple and efficient approach for combining multiple image descriptors. We first employ a Naive-Bayes Nearest-Neighbor scheme to evaluate four widely used descriptors. For all features, “Image-to-Class” distances are directly computed without descriptor quantization. Since distances measured by different metrics can be of different nature and they may not be on the same numerical scale, a normalization step is essential to transform these distances into a common domain prior to combining them. Our experiments conducted on a challenging database indicate that z-score normalization followed by a simple sum of distances fusion technique can significantly improve the performance compared to applications in which individual features are used. It was also observed that our experimental results on the Caltech 101 dataset outperform other previous results.

[1]  Sebastian Nowozin,et al.  On feature combination for multiclass object classification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[2]  Trevor Darrell,et al.  Bayesian Localized Multiple Kernel Learning , 2009 .

[3]  Jitendra Malik,et al.  Recognition using regions , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Garrison W. Cottrell,et al.  Color-to-Grayscale: Does the Method Matter in Image Recognition? , 2012, PloS one.

[5]  Andrew Zisserman,et al.  Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[6]  Eric O. Postma,et al.  The Natural Input Memory Model , 2005 .

[7]  Manik Varma,et al.  Learning The Discriminative Power-Invariance Trade-Off , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[8]  Trevor Darrell,et al.  The pyramid match kernel: discriminative classification with sets of image features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[9]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[10]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[11]  Janet Hui-wen Hsiao,et al.  NIMBLE: a kernel density model of saccade-based visual memory. , 2008, Journal of vision.

[12]  Eli Shechtman,et al.  In defense of Nearest-Neighbor based image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[14]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[15]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[17]  Wen Gao,et al.  Group-Sensitive Multiple Kernel Learning for Object Recognition , 2012, IEEE Transactions on Image Processing.

[18]  Bart Jansen,et al.  Combining Image Similarity Metrics for Semantic Image Annotation , 2012, OTM Workshops.

[19]  Md. Monirul Islam,et al.  A review on automatic image annotation techniques , 2012, Pattern Recognit..

[20]  Peter Auer,et al.  Weak Hypotheses and Boosting for Generic Object Detection and Recognition , 2004, ECCV.

[21]  Jitendra Malik,et al.  Shape matching and object recognition using low distortion correspondences , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[22]  Garrison W. Cottrell,et al.  Robust classification of objects, faces, and flowers using natural image statistics , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  Arun Ross,et al.  Score normalization in multimodal biometric systems , 2005, Pattern Recognit..

[24]  G. Griffin,et al.  Caltech-256 Object Category Dataset , 2007 .

[25]  Andrew Zisserman,et al.  Multiple kernels for object detection , 2009, 2009 IEEE 12th International Conference on Computer Vision.