Comparative Study of Clustering Distance Measures to Determine Neural Network Architectures

This paper presents a comparative study of clustering distance measures to determine the architecture of a neural network. A number of different Distance Measures have been proposed to measure ‘distance’ for the cluster analysis, which will be applied on the dataset used to train the neural network. The goal of this study is to select the optimal number of clusters which depends on the best choice of clustering distance measure because the results of cluster analysis are used to determine the number of hidden layers and the number of hidden neurons. A particular criterion presented in this paper to link between the number of cluster obtained and the number of hidden layers of a neural network.

[1]  Chad L. Myers,et al.  Comparison of Profile Similarity Measures for Genetic Interaction Networks , 2013, PloS one.

[2]  Roberto Musmanno,et al.  Heuristic techniques to optimize neural network architecture in manufacturing applications , 2015, Neural Computing and Applications.

[3]  Dan W. Patterson,et al.  Artificial Neural Networks: Theory and Applications , 1998 .

[4]  E. F. Codd Derivability, redundancy and consistency of relations stored in large data banks , 2009, SGMD.

[5]  E. F. CODD,et al.  A relational model of data for large shared data banks , 1970, CACM.

[6]  Oscar Castillo,et al.  A new approach for dynamic fuzzy logic parameter tuning in Ant Colony Optimization and its application in fuzzy control of a mobile robot , 2015, Appl. Soft Comput..

[7]  Anil K. Jain,et al.  A self-organizing network for hyperellipsoidal clustering (HEC) , 1996, IEEE Trans. Neural Networks.

[8]  Haidar M. Harmanani,et al.  A parallel genetic algorithm for the open-shop scheduling problem using deterministic and random moves , 2009, SpringSim '09.

[9]  Taskin Kavzoglu Determining Optimum Structure for Artificial Neural Networks , 1999 .

[10]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[11]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[12]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[13]  Aboul Ella Hassanien,et al.  Computational Intelligence in Biomedicine and Bioinformatics, Current Trends and Applications , 2008, Computational Intelligence in Biomedicine and Bioinformatics.

[14]  Chao He,et al.  Notice of Retraction Emission modeling of diesel engine fueled with biodiesel based on back propagation neural network , 2010 .

[15]  Fernando C. Lourenço,et al.  Binary-based similarity measures for categorical data and their application in Self- Organizing Maps , 2004 .

[16]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[17]  Teh Ying Wah,et al.  A Comparison Study on Similarity and Dissimilarity Measures in Clustering Continuous Data , 2015, PloS one.

[18]  Raúl M. del Toro,et al.  OPTIMAL TUNING OF A NETWORKED LINEAR CONTROLLER USING A MULTI-OBJECTIVE GENETIC ALGORITHM AND ITS APPLICATION TO ONE COMPLEX ELECTROMECHANICAL PROCESS , 2009 .

[19]  Vipin Kumar,et al.  Similarity Measures for Categorical Data: A Comparative Evaluation , 2008, SDM.

[20]  P. Atkinson,et al.  Introduction Neural networks in remote sensing , 1997 .

[21]  Tijana Vujicic,et al.  Comparative Analysis of Methods for Determining Number of Hidden Neurons in Artificial Neural Network , 2016 .