A Novel Ensemble based Cluster Analysis using Similarity Matrices and Clustering Algorithm (SMCA)

today's world data analytics is gaining popularity due to user's motivation towards online data storage. This storage is not organized because of content types and data handling schemes complexity. User aims to retrieve data in lesser time with logical outcomes as desired can be achieved by applying data mining. Clustering in data mining is one of the known categorization approach used for formation of groups of similar elements having certain properties in common with other elements. This formation sometime creates noisy result in terms of formatted clusters. It depends on various factors such as distance measures, proximity values, objective functions, categorical or numerical attribute types etc. Over the last few years various schemes are suggested by different authors for improving the performance of tradition clustering algorithms. Among them, one is ensemble based clustering. Ensemble uses the mechanism for criteria selection from newly formed clusters with a defined portioning and joining methods to generate a single result instead of multiple solutions. The generation results are affected by various environmental parameters such as number of cluster, partitioning types, proximity values, objective function etc. This paper propose a novel SMCA based ensemble clustering algorithm for improvements over the existing issues defined in the paper. At the primary level of work and analytical evaluations, it shows the promising results in near future.

[1]  Mohamed S. Kamel,et al.  Cluster-Based Cumulative Ensembles , 2005, Multiple Classifier Systems.

[2]  Joydeep Ghosh,et al.  Transfer Learning with Cluster Ensembles , 2011, ICML Unsupervised and Transfer Learning.

[3]  Xiuzhen Cheng,et al.  An Ensemble Method of Discovering Sample Classes Using Gene Expression Profiling , 2007 .

[4]  Robert Neumayer,et al.  Clustering Based Ensemble Classification for Spam Filtering , 2006 .

[5]  Jordi Turmo,et al.  Non-Parametric Document Clustering by Ensemble Methods , 2008, Proces. del Leng. Natural.

[6]  Marcin Pełka Ensemble Approach for Clustering of Interval-Valued Symbolic Data , 2012 .

[7]  Parag Kulkarni,et al.  Incremental Learning: Areas and Methods - A Survey , 2012 .

[8]  Kurt Hornik,et al.  A CLUE for CLUster Ensembles , 2005 .

[9]  Chris H. Q. Ding,et al.  Hierarchical Ensemble Clustering , 2010, 2010 IEEE International Conference on Data Mining.

[10]  Abdolreza Mirzaei,et al.  A Novel Hierarchical-Clustering-Combination Scheme Based on Fuzzy-Similarity Relations , 2010, IEEE Transactions on Fuzzy Systems.

[11]  Yong Wang,et al.  An effective ensemble method for hierarchical clustering , 2012, C3S2E '12.

[12]  Sharon Willis,et al.  Uniformed Services University of the Health Sciences. , 2003, Academic medicine : journal of the Association of American Medical Colleges.

[13]  Ludmila I. Kuncheva,et al.  Experimental Comparison of Cluster Ensemble Methods , 2006, 2006 9th International Conference on Information Fusion.

[14]  Burak Eksioglu,et al.  Performance of an Ensemble Clustering Algorithm on Biological Data Sets , 2011 .