A Differential Evolution-Based Clustering for Probability Density Functions

Clustering for probability density functions (CDFs) has recently emerged as a new interest technique in statistical pattern recognition because of its potential in various practical issues. For solving the CDF problems, evolutionary techniques which are successfully applied in clustering for discrete elements have not been studied much in CDF. Therefore, this paper presents for the first time an application of the differential evolution (DE) algorithm for clustering of probability density functions (pdfs) in which the clustering problem is transformed into an optimization problem. In this optimization problem, the objective function is to minimize the internal validity measure-SF index, and the design variable is the name of the cluster in which pdfs are assigned to. To solve this optimization problem, a DE-based CDF is proposed. The efficiency and feasibility of the proposed approach are demonstrated through four numerical examples including analytical and real-life problems with gradually increasing the complexity of the problem. The obtained results mostly outperform several results of compared algorithms in the literature.

[1]  Karl O. Jones,et al.  Comparison of ant colony optimisation and differential evolution , 2007, CompSysTech '07.

[2]  C. Hung,et al.  A Comparative Study on Differential Evolution and Genetic Algorithms for Some Combinatorial Problems , 2009 .

[3]  Thao Nguyen-Trang,et al.  Fuzzy clustering of probability density functions , 2017 .

[4]  Ujjwal Maulik,et al.  Automatic Fuzzy Clustering Using Modified Differential Evolution for Image Classification , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[5]  K. R. Harrigan An Application of Clustering for Strategic Group Analysis , 1985 .

[6]  Saku Kukkonen,et al.  Real-parameter optimization with differential evolution , 2005, 2005 IEEE Congress on Evolutionary Computation.

[7]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[8]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Swagatam Das,et al.  Automatic Clustering Using an Improved Differential Evolution Algorithm , 2007 .

[10]  T. Nguyen-Thoi,et al.  A two-step approach for damage detection in laminated composite structures using modal strain energy method and an improved differential evolution algorithm , 2016 .

[11]  Thu Pham-Gia,et al.  Statistical Discrimination Analysis Using the Maximum Function , 2008, Commun. Stat. Simul. Comput..

[12]  Sung-Hyuk Cha Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions , 2007 .

[13]  Trung Nguyen-Thoi,et al.  An improved constrained differential evolution using discrete variables (D-ICDE) for layout optimization of truss structures , 2015, Expert Syst. Appl..

[14]  V. Ho-Huu,et al.  Optimal design of truss structures with frequency constraints using improved differential evolution algorithm based on an adaptive mutation scheme , 2016 .

[15]  Xiaodong Li,et al.  Time series forecasting by evolving artificial neural networks using genetic algorithms, differential evolution and estimation of distribution algorithm , 2011 .

[16]  T. Pham-Gia,et al.  Clustering probability distributions , 2010 .

[17]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[18]  Joni-Kristian Kämäräinen,et al.  Differential Evolution Training Algorithm for Feed-Forward Neural Networks , 2003, Neural Processing Letters.

[19]  Angela Montanari,et al.  Model-based clustering of probability density functions , 2013, Adv. Data Anal. Classif..

[20]  Prabir Kumar Jena,et al.  Differential Evolution: An Inverse Approach for Crack Detection , 2013 .

[21]  Enrique H. Ruspini,et al.  A New Approach to Clustering , 1969, Inf. Control..

[22]  Tung Khac Truong,et al.  An improved differential evolution based on roulette wheel selection for shape and size optimization of truss structures with frequency constraints , 2016, Neural Computing and Applications.

[23]  Liangpei Zhang,et al.  Automatic Fuzzy Clustering Based on Adaptive Multi-Objective Differential Evolution for Remote Sensing Imagery , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[24]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[25]  René Vidal,et al.  Unsupervised Riemannian Clustering of Probability Density Functions , 2008, ECML/PKDD.

[26]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[27]  Võ Văn Tài,et al.  Clustering for probability density functions based on Genetic Algorithm , 2016 .

[28]  V. Ho-Huu,et al.  An effective reliability-based improved constrained differential evolution for reliability-based design optimization of truss structures , 2016, Adv. Eng. Softw..

[29]  V. Ho-Huu,et al.  Modified genetic algorithm-based clustering for probability density functions , 2017 .

[30]  Lukasz Ciszak,et al.  Application of clustering and association methods in data cleaning , 2008, 2008 International Multiconference on Computer Science and Information Technology.

[31]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Gianfranco Chicco,et al.  Application of clustering algorithms and self organising maps to classify electricity customers , 2003, 2003 IEEE Bologna Power Tech Conference Proceedings,.

[33]  M. Anusha,et al.  Evolutionary Clustering Algorithm Using Criterion-Knowledge-Ranking for Multi-objective Optimization , 2016, 2016 World Conference on Futuristic Trends in Research and Innovation for Social Welfare (Startup Conclave).

[34]  Wen-Liang Hung,et al.  An automatic clustering algorithm for probability density functions , 2015 .

[35]  Zhongying Zhao,et al.  Probability model selection and parameter evolutionary estimation for clustering imbalanced data without sampling , 2016, Neurocomputing.

[36]  David P. Hofmeyr Clustering by Minimum Cut Hyperplanes , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Wen Gao,et al.  Shape-based Adult Image Detection , 2006, Int. J. Image Graph..

[38]  S. Austin,et al.  Clustering of fast-food restaurants around schools: a novel application of spatial statistics to the study of food environments. , 2005, American journal of public health.

[39]  D. W. Scott,et al.  Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[40]  V. Ho-Huu,et al.  An adaptive elitist differential evolution for optimization of truss structures with discrete design variables , 2016 .

[41]  Arthur Zimek,et al.  Density-Based Clustering Validation , 2014, SDM.

[42]  He-sheng Tang,et al.  Differential evolution strategy for structural system identification , 2008 .

[43]  T. Nguyen-Thoi,et al.  Static and frequency optimization of folded laminated composite plates using an adjusted Differential Evolution algorithm and a smoothed triangular plate element , 2015 .

[44]  Xiaodong Li,et al.  Time series forecasting by evolving artificial neural networks with genetic algorithms, differential evolution and estimation of distribution algorithm , 2011, Neural Computing and Applications.

[45]  V. Ho-Huu,et al.  Optimization of laminated composite plates for maximizing buckling load using improved differential evolution and smoothed finite element method , 2016 .

[46]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..