Performance evaluation of new colour histogram-based interest point detectors

Interest point detection is an active area in computer vision due to its importance in many applications. Measuring the pixel-wise difference between image pixel intensities is the mechanism of most detectors that have been proposed in literature. Recently, interest point detectors were proposed that incorporated the histogram representation instead of image pixel intensity. In this paper, research that extends histogram-based interest point detectors is introduced. Four colour-space representations were used to construct new detectors: HSV, Opponent, Transformed and Ohta colour spaces. Several experiments were performed to evaluate the new colour histogram-based detectors and compare them with previous detectors. First, the proposed detectors were evaluated in an image-matching task. Then, we studied and evaluated the performance of some of the local image descriptors that were extracted from the interest points and regions detected by the proposed detectors. Finally, the four top-ranked descriptors in the descriptor evaluation experiments were used to evaluate the new colour histogram-based detectors in an image-classification task using different object and scene image datasets. The experimental results demonstrate that our new detectors possess an increased ability to distinguish and more robust in regards to image matching, particularly with respect to textured scene images that involve transformations, such as illumination, viewpoint and blur changes. Furthermore, the descriptor performance may change depending on the detector and data set type. The image-classification results demonstrate that the proposed detectors exhibit higher classification accuracy for certain descriptors and data sets than the other detectors.

[1]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[2]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[3]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[4]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[5]  Vassilis Tsagaris,et al.  Multispectral image fusion method using perceptual attributes , 2004, SPIE Remote Sensing.

[6]  Joost van de Weijer,et al.  Boosting color saliency in image feature detection , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Cordelia Schmid,et al.  Indexing Based on Scale Invariant Interest Points , 2001, ICCV.

[8]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[9]  Aly A. Farag,et al.  CSIFT: A SIFT Descriptor with Color Invariant Characteristics , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[10]  Rachid Benmokhtar Robust human action recognition scheme based on high-level feature fusion , 2012, Multimedia Tools and Applications.

[11]  Dennis Koelma,et al.  The MediaMill TRECVID 2008 Semantic Video Search Engine , 2008, TRECVID.

[12]  Neil A. Thacker,et al.  Robust Recognition of Scaled Shapes using Pairwise Geometric Histograms , 1995, BMVC.

[13]  Liang Lin,et al.  Layered Graph Matching with Composite Cluster Sampling , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[15]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[16]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[18]  Farzin Mokhtarian,et al.  A Theory of Multiscale, Curvature-Based Shape Representation for Planar Curves , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Tony Lindeberg,et al.  Feature Detection with Automatic Scale Selection , 1998, International Journal of Computer Vision.

[20]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21]  Hwann-Tzong Chen,et al.  Histogram-based interest point detectors , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[23]  Jiri Matas,et al.  Object Recognition using the Invariant Pixel-Set Signature , 2000, BMVC.

[24]  Thomas G. Dietterich,et al.  Image features and learning algorithms for biological, generic and social object recognition , 2009 .

[25]  Andrew Zisserman,et al.  Representing shape with a spatial pyramid kernel , 2007, CIVR '07.

[26]  Jan-Michael Frahm,et al.  Comparative Evaluation of Binary Features , 2012, ECCV.

[27]  Marthie de Kock Artificial Intelligence for Maximizing Content‐Based Image Retrieval , 2009 .

[28]  Luc Van Gool,et al.  Affine/ Photometric Invariants for Planar Intensity Patterns , 1996, ECCV.

[29]  Reinhard Koch,et al.  Matching of affinely invariant regions for visual servoing , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[30]  Tinne Tuytelaars,et al.  Dense interest points , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[31]  Cordelia Schmid,et al.  A sparse texture representation using affine-invariant regions , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[32]  Roland Siegwart,et al.  BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[33]  Yan Ke,et al.  PCA-SIFT: a more distinctive representation for local image descriptors , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[34]  Zongmin Ma,et al.  Artificial Intelligence for Maximizing Content Based Image Retrieval , 2008 .

[35]  Vincent Lepetit,et al.  A fast local descriptor for dense matching , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Xin Li,et al.  Interactive object-based image retrieval and annotation on iPad , 2013, Multimedia Tools and Applications.

[37]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[38]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  Andrew Zisserman,et al.  Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[40]  Richard Szeliski,et al.  Computer Vision - Algorithms and Applications , 2011, Texts in Computer Science.

[41]  Hongyang Chao,et al.  Discovering Video Shot Categories by Unsupervised Stochastic Graph Partition , 2013, IEEE Transactions on Multimedia.

[42]  Vincent Lepetit,et al.  DAISY: An Efficient Dense Descriptor Applied to Wide-Baseline Stereo , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Krystian Mikolajczyk,et al.  Evaluation of local detectors and descriptors for fast feature matching , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[44]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[45]  J. Koenderink,et al.  Representation of local geometry in the visual system , 1987, Biological Cybernetics.

[46]  K. Plataniotis,et al.  Color Image Processing : Methods and Applications , 2006 .

[47]  Mari Laine-Hernandez,et al.  Evaluating Local Feature Detectors in Salient Region Detection , 2013, SCIA.

[48]  Andrew Zisserman,et al.  Multi-view Matching for Unordered Image Sets, or "How Do I Organize My Holiday Snaps?" , 2002, ECCV.

[49]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[50]  T. Kanade,et al.  Color information for region segmentation , 1980 .

[51]  Andrew Zisserman,et al.  An Affine Invariant Salient Region Detector , 2004, ECCV.

[52]  Cordelia Schmid,et al.  Semi-Local Affine Parts for Object Recognition , 2004, BMVC.

[53]  Yi Xie,et al.  Evaluation of local features and classifiers in BOW model for image classification , 2012, Multimedia Tools and Applications.

[54]  Gang Hua,et al.  Dynamic hand gesture recognition: An exemplar-based approach from motion divergence fields , 2012, Image Vis. Comput..

[55]  Prashant Parikh A Theory of Communication , 2010 .

[56]  Luc Van Gool,et al.  Wide Baseline Stereo Matching based on Local, Affinely Invariant Regions , 2000, BMVC.

[57]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58]  Tony Lindeberg,et al.  Shape-adapted smoothing in estimation of 3-D shape cues from affine deformations of local 2-D brightness structure , 1997, Image Vis. Comput..

[59]  Hui Wei,et al.  Improvements in image categorization using codebook ensembles , 2011, Image Vis. Comput..

[60]  Cordelia Schmid,et al.  Evaluation of Interest Point Detectors , 2000, International Journal of Computer Vision.

[61]  Joost van de Weijer,et al.  Improving Color Constancy by Photometric Edge Weighting , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[62]  Liang Lin,et al.  Representing and recognizing objects with massive local image patches , 2012, Pattern Recognit..