Novel hybrid object-based non-parametric clustering approach for grouping similar objects in specific visual domains

Abstract Current widely employed clustering approaches may not yield satisfactory results with regard to the characteristics and distribution of datasets and number of clusters to be sought, especially for visual domains in multidimensional space. This study establishes a novel clustering methodology using a pairwise similarity matrix, Clustering Visual Objects in Pairwise Similarity Matrix (CVOIPSM), for grouping similar objects in specific visual domains. A dimensionality reduction and feature extraction technique, along with a distance measuring method and a newly established algorithm, Clustering in Pairwise Similarity Matrix (CIPSM), are combined to develop the CVOIPSM methodology. CIPSM utilizes both Rk-means and an agglomerative, contractible, expandable (ACE) technique to calculate a membership degree based on maximizing inter-class similarity and minimizing intra-class similarity. CVOIPSM has been tested on several datasets, with average success rates on downsized subsamples between 87.5% and 97.75% and between 81% and 87% on the larger datasets. The difference in the success rates for small and large datasets is not statistically significant (p > 0.01). Moreover, this method automatically determines the likely number of clusters without any user dictation. The empirical results and the statistical significance test on these results ensure that CVOIPSM performs effectively and efficiently on specific visual domains, disclosing the interrelated patterns of similarities among objects.

[1]  Sanjay Kumar Dubey,et al.  Comparative Analysis of K-Means and Fuzzy C- Means Algorithms , 2013 .

[2]  Sanjoy Dasgupta,et al.  Incremental Clustering: The Case for Extra Clusters , 2014, NIPS.

[3]  François-Joseph Lapointe,et al.  Using the stability of objects to determine the number of clusters in datasets , 2017, Inf. Sci..

[4]  András Zempléni,et al.  Genetic algorithms and grid technologies in clustering, an example: Clustering of images , 2008, Qual. Reliab. Eng. Int..

[5]  Junhui Wang,et al.  Selection of the number of clusters via the bootstrap method , 2012, Comput. Stat. Data Anal..

[6]  Peter Eisert,et al.  2D-3D Mixed Face Recognition Schemes , 2008 .

[7]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[8]  Kaya Kuru,et al.  A Novel Hybrid Clustering Approach for Unsupervised Grouping of Similar Objects , 2014, HAIS.

[9]  In Seop Na,et al.  Automatic Classification for Various Images Collections Using Two Stages Clustering Method , 2013 .

[10]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[11]  Zhang Xiong,et al.  Incremental Clustering Using Information Bottleneck Theory , 2011, Int. J. Pattern Recognit. Artif. Intell..

[12]  Vladimir Estivill-Castro,et al.  Why so many clustering algorithms: a position paper , 2002, SKDD.

[13]  Horst Bischof,et al.  Context-driven clustering by multi-class classification in an active learning framework , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[14]  Roberto Cipolla,et al.  MCBoost: Multiple Classifier Boosting for Perceptual Co-clustering of Images and Visual Features , 2008, NIPS.

[15]  Emilio Corchado,et al.  A survey of multiple classifier systems as hybrid systems , 2014, Inf. Fusion.

[16]  Supriya Kapoor,et al.  Facial Gesture Recognition Using Correlation And Mahalanobis Distance , 2010, ArXiv.

[17]  Poonam Fauzdar And Sujata Kindri Comparitive Analysis Of K Means And Fuzzy C Means Algorithm , 2013 .

[18]  Ashok Chandrashekar,et al.  Learning what is where from unlabeled images: joint localization and clustering of foreground objects , 2013, Machine Learning.

[19]  Pawel A Penczek,et al.  Iterative stable alignment and clustering of 2D transmission electron microscope images. , 2012, Structure.

[20]  Olatz Arbelaitz,et al.  An extensive comparative study of cluster validity indices , 2013, Pattern Recognit..

[21]  Yongli Liu,et al.  Information bottleneck based incremental fuzzy clustering for large biomedical data , 2016, J. Biomed. Informatics.

[22]  Y. Rani,et al.  A Study of Hierarchical Clustering Algorithm , 2013 .

[23]  Viktor K. Prasanna,et al.  Semantic image clustering using object relation network , 2012, CVM.

[24]  Sanjay Silakari,et al.  Color Image Clustering using Block Truncation Algorithm , 2009, ArXiv.

[25]  Emilio Corchado,et al.  Recent trends in intelligent data analysis , 2014, Neurocomputing.

[26]  Brian Everitt,et al.  Cluster analysis , 1974 .

[27]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  M. Karthikeyan,et al.  Probability based document clustering and image clustering using content-based image retrieval , 2013, Appl. Soft Comput..

[29]  Mahesan Niranjan,et al.  Biomedical visual data analysis to build an intelligent diagnostic decision support system in medical genetics , 2014, Artif. Intell. Medicine.

[30]  Sakshi Patel,et al.  A study of hierarchical clustering algorithms , 2015, 2015 2nd International Conference on Computing for Sustainable Global Development (INDIACom).

[31]  V. Kumar,et al.  An Efficient Algorithm for Segmentation Using Fuzzy Local Information C-Means Clustering , 2011 .

[32]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[33]  I. Pitas,et al.  Hierarchical Face Clustering using SIFT Image Features , 2007, 2007 IEEE Symposium on Computational Intelligence in Image and Signal Processing.

[34]  Nicholas A. Hamilton,et al.  Visualizing and clustering high throughput sub-cellular localization imaging , 2008, BMC Bioinformatics.

[35]  Ying Liu,et al.  Region-based image retrieval with high-level semantics using decision tree learning , 2008, Pattern Recognit..

[36]  C. P. Blesslin Elizabeth,et al.  Spectral Clustering of Images in LUV Color Space by Spatial-Color Pixel Classification , 2010 .

[37]  B. Everitt,et al.  Cluster Analysis Ed. 5 , 2011 .

[38]  C. Krishna Mohan,et al.  Efficient clustering approach using incremental and hierarchical clustering methods , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[39]  Maria-Florina Balcan,et al.  Robust hierarchical clustering , 2013, J. Mach. Learn. Res..

[40]  Chinki Chandhok A Novel Approach to Image Segmentation using Artificial Neural Networks and K-Means Clustering , 2012 .

[41]  Amedeo Napoli,et al.  Ontology-guided data preparation for discovering genotype-phenotype relationships , 2008, BMC Bioinformatics.

[42]  P. Corcoran,et al.  A Review of Face Recognition Techniques for In-Camera Applications , 2007, 2007 International Symposium on Signals, Circuits and Systems.

[43]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[44]  B. S. Manjunath,et al.  Unsupervised Segmentation of Color-Texture Regions in Images and Video , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  Thomas Blaschke,et al.  Geographic Object-Based Image Analysis – Towards a new paradigm , 2014, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.