Return of the king: The Fourier transform based descriptor for visual object classification

Most of the state-of-arts visual object classification methods use bag of words model for image representation. In this method, patches extracted from images are described by different shape and texture descriptors such as SIFT, LBP, SURF, etc. In this paper we introduce a new descriptor based on weighted histograms of phase angles of local Fourier transform (FT). We compare the classification accuracies obtained by using the proposed descriptor to the ones obtained by other well-known descriptors on Caltech-4 and Coil-100 data sets. Experimental results show that our proposed descriptor provides good accuracies indicating that FT based local descriptor captures important characteristics of images that are useful for classification. When we combined image representations obtained by FT descriptor with the representations obtained by other descriptors, results even get better suggesting that tested descriptors encode differential complementary information.

[1]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[2]  Feng Zhou,et al.  Texture feature based on local Fourier transform , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[3]  Frédéric Jurie,et al.  Randomized Clustering Forests for Image Classification , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[5]  Xiaoyang Tan,et al.  Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions , 2007, IEEE Transactions on Image Processing.

[6]  Pietro Perona,et al.  Learning object categories from Google's image search , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[7]  Kidiyo Kpalma,et al.  Texture features based on local Fourier histogram: self-compensation against rotation , 2008, J. Electronic Imaging.

[8]  Frédéric Jurie,et al.  Sampling Strategies for Bag-of-Features Image Classification , 2006, ECCV.

[9]  Krystian Mikolajczyk,et al.  On a Quest for Image Descriptors Based on Unsupervised Segmentation Maps , 2010, 2010 20th International Conference on Pattern Recognition.

[10]  Alexei A. Efros,et al.  Discovering object categories in image collections , 2005 .

[11]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[12]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[13]  U.A. Ahmad,et al.  Texture features based on Fourier transform and Gabor filters: an empirical comparison , 2007, 2007 International Conference on Machine Vision.

[14]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[15]  Jitendra Malik,et al.  Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.

[16]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[17]  K. N. Dollman,et al.  - 1 , 1743 .

[18]  Cordelia Schmid,et al.  Combining efficient object localization and image classification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[19]  David A. Forsyth,et al.  The effects of segmentation and feature choice in a translation model of object recognition , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[20]  Marko Heikkilä,et al.  Description of interest regions with local binary patterns , 2009, Pattern Recognit..

[21]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[22]  Frédéric Jurie,et al.  Creating efficient codebooks for visual recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[23]  Matti Pietikäinen,et al.  Rotation Invariant Image Description with Local Binary Pattern Histogram Fourier Features , 2009, SCIA.