A novel method for image clustering

Image clustering has been attracting mounting focus on widely used fields, such as data compression, information retrieval, character recognition and so on, due to the emerging applications of various web-based and mobile-based image retrieval and services. To study this, based on Voronoi diagram, we propose a novel image clustering algorithm to effective discovery of image clusters in this paper. More specifically, based on Voronoi diagrams at first, a number of irregular grids are built across the whole plane. Furthermore, leveraging the good property of “the nearest neighbor” for the Voronoi diagrams, various irregular grids of plane are assigned by the points to different clusters. On the one hand, based on the density of grid points, it automatically adjusts the final suitable number of clustering; on the other hand, according to the changes of the centroids, it tunes the positions for the Voronoi's seeds. At last, the Voronoi cells finally become the result of clustering process. The empirical experiment results show that our proposed method not only can cluster image dataset effectively, but also can achieve the comparative performance with X-means algorithm and K-means algorithm. Moreover, our proposed method can outperform the effectiveness for both DBSCAN and OPTICS algorithms, which are classic density-based clustering algorithms towards larger-scale real-world applications.

[1]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[2]  Ji Wan,et al.  SOML: Sparse Online Metric Learning with Application to Image Retrieval , 2014, AAAI.

[3]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[4]  Rajeev Motwani,et al.  Maintaining variance and k-medians over data stream windows , 2003, PODS.

[5]  Wen Gao,et al.  Semi‐Supervised Learning in Reconstructed Manifold Space for 3D Caricature Generation , 2009, Comput. Graph. Forum.

[6]  Sudipto Guha,et al.  Streaming-data algorithms for high-quality clustering , 2002, Proceedings 18th International Conference on Data Engineering.

[7]  Shuangquan Wang,et al.  InferLoc: Calibration Free Based Location Inference for Temporal and Spatial Fine-Granularity Magnitude , 2012, 2012 IEEE 15th International Conference on Computational Science and Engineering.

[8]  Gunnar Rätsch,et al.  Input space versus feature space in kernel-based methods , 1999, IEEE Trans. Neural Networks.

[9]  Zhenyu Chen,et al.  Mining individual behavior pattern based on significant locations and spatial trajectories , 2012, 2012 IEEE International Conference on Pervasive Computing and Communications Workshops.

[10]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  CarSafe app: alerting drowsy and distracted drivers using dual cameras on smartphones , 2013, MobiSys '13.

[12]  Rong Jin,et al.  A Boosting Framework for Visuality-Preserving Distance Metric Learning and Its Application to Medical Image Retrieval , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Wei Liu,et al.  Semi-supervised distance metric learning for Collaborative Image Retrieval , 2008, CVPR.

[14]  Ye Xu,et al.  Multi-instance Metric Learning , 2011, 2011 IEEE 11th International Conference on Data Mining.

[15]  Yiqiang Chen,et al.  Surrounding context and episode awareness using dynamic Bluetooth data , 2012, UbiComp '12.

[16]  Jon Rigelsford,et al.  Pattern Recognition: Concepts, Methods and Applications , 2002 .

[17]  Ana L. N. Fred,et al.  Partitional vs Hierarchical Clustering Using a Minimum Grammar Complexity Approach , 2000, SSPR/SPR.

[18]  Chunyan Miao,et al.  Genetic sampling in eigenspace for 3D caricature synthesis , 2009, 2009 7th International Conference on Information, Communications and Signal Processing (ICICS).

[19]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[20]  Rina Panigrahy,et al.  Better streaming algorithms for clustering problems , 2003, STOC '03.

[21]  Shuangquan Wang,et al.  Inferring social contextual behavior from bluetooth traces , 2013, UbiComp.

[22]  Ye Xu,et al.  Preference, context and communities: a multi-faceted approach to predicting smartphone app usage patterns , 2013, ISWC '13.

[23]  S. Lauritzen The EM algorithm for graphical association models with missing data , 1995 .

[24]  Fanglin Chen,et al.  Unobtrusive sleep monitoring using smartphones , 2013, 2013 7th International Conference on Pervasive Computing Technologies for Healthcare and Workshops.

[25]  Shuangquan Wang,et al.  A supervised learning based semantic location extraction method using mobile phone data , 2012, 2012 IEEE International Conference on Computer Science and Automation Engineering (CSAE).

[26]  Chunyan Miao,et al.  Interactive 3D caricature generation based on double sampling , 2009, ACM Multimedia.

[27]  Ye Xu,et al.  Unbiased Metric Learning: On the Utilization of Multiple Datasets and Web Images for Softening Bias , 2013, 2013 IEEE International Conference on Computer Vision.

[28]  Yiqiang Chen,et al.  Personalized 3D Caricature Generation Based on PCA Subspace , 2008, PCM.

[29]  Wen Gao,et al.  Semi-supervised Learning of Caricature Pattern from Manifold Regularization , 2009, MMM.

[30]  Samuel Kaski,et al.  Self organization of a massive document collection , 2000, IEEE Trans. Neural Networks Learn. Syst..

[31]  Jiong Yang,et al.  STING: A Statistical Information Grid Approach to Spatial Data Mining , 1997, VLDB.

[32]  Shuangquan Wang,et al.  ContextSense: unobtrusive discovery of incremental social context using dynamic bluetooth data , 2014, UbiComp Adjunct.

[33]  Sudipto Guha,et al.  Clustering Data Streams: Theory and Practice , 2003, IEEE Trans. Knowl. Data Eng..

[34]  Andrew W. Moore,et al.  X-means: Extending K-means with Efficient Estimation of the Number of Clusters , 2000, ICML.