K-Means Clustering Based on Density for Scene Image Classification

K-means clustering has been extremely popular in scene image classification. However, due to the random selection of initial cluster centers, the algorithm cannot always provide the most optimal results. In this paper, we develop a density-based k-means clustering. First, we calculate the density and distance for each feature vector. Then choose those features with high density and large distance as initial cluster centers. The remaining steps are the same with k-means. In order to evaluate our proposed algorithm, we have conducted several experiments on two-scene image datasets: Fifteen Scene Categories dataset and UIUC Sports Event dataset. The results show that our proposed method has good repeatability. Compared with the traditional k-means clustering, it can achieve higher classification accuracy when applied in multiclass scene image classification.

[1]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[2]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[3]  Francesca Odone,et al.  Histogram intersection kernel for image classification , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[4]  Fei-Fei Li,et al.  What, where and who? Classifying events by scene and object recognition , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[5]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[6]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[7]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[8]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[9]  Daniel A. Keim,et al.  An Efficient Approach to Clustering in Large Multimedia Databases with Noise , 1998, KDD.

[10]  Stephen J. Redmond,et al.  A method for initialising the K-means clustering algorithm using kd-trees , 2007, Pattern Recognit. Lett..

[11]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[12]  Bernt Schiele,et al.  Natural Scene Retrieval Based on a Semantic Modeling Step , 2004, CIVR.

[13]  Alessandro Laio,et al.  Clustering by fast search and find of density peaks , 2014, Science.