Multi-objective Evolutionary Optimization of Neural Networks for Virtual Reality Visual Data Mining: Application to Hydrochemistry

A method for the construction of virtual reality spaces for visual data mining using multi-objective optimization with genetic algorithms on neural networks is presented. Two neural network layers (output and last hidden) are used for the construction of simultaneous solutions for: a supervised classification of data patterns and the computation of two unsupervised similarity structure preservation measures between the original data matrix and its image in the new space. A set of spaces is constructed from selected solutions along the Pareto front which enables the understanding of the internal properties of the data based on visual inspection of non-dominating spaces with different properties. This strategy represents a conceptual improvement over spaces computed by single-objective optimization. The presented approach is domain independent and is illustrated with an application to the study of hydrochemical properties of ice and water samples from the Arctic.

[1]  J.J. Valdes,et al.  Virtual reality visual data mining with nonlinear discriminant neural networks: application to leukemia and Alzheimer gene expression data , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[2]  Graham Kendall,et al.  Search Methodologies: Introductory Tutorials in Optimization and Decision Support Techniques , 2013 .

[3]  Julio J. Valdés,et al.  Permission is granted to quote short excerpts and to reproduce figures and tables from this report, provided that the source of such material is fully acknowledged. Data Mining of Gene Expression Changes in , 2003 .

[4]  Anil K. Jain,et al.  Discriminant analysis neural networks , 1993, IEEE International Conference on Neural Networks.

[5]  I. Borg Multidimensional similarity structure analysis , 1987 .

[6]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[7]  Anil K. Jain,et al.  Artificial neural network for nonlinear projection of multivariate data , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[8]  Kalyanmoy Deb,et al.  A Fast Elitist Non-dominated Sorting Genetic Algorithm for Multi-objective Optimisation: NSGA-II , 2000, PPSN.

[9]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[10]  William H. Press,et al.  Numerical recipes in C , 2002 .

[11]  J. Gower A General Coefficient of Similarity and Some of Its Properties , 1971 .

[12]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[13]  Charles Gide,et al.  Cours d'économie politique , 1911 .

[14]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[15]  David Lowe,et al.  The optimised internal representation of multilayer classifier networks performs nonlinear discriminant analysis , 1990, Neural Networks.

[16]  Anil K. Jain,et al.  Artificial neural networks for feature extraction and multivariate data projection , 1995, IEEE Trans. Neural Networks.

[17]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[18]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[19]  Julio J. Valdés,et al.  Virtual Reality Representation of Information Systems and Decision Rules: An Exploratory Technique for Understanding Data and Knowledge Structure , 2003, RSFDGrC.

[20]  Timothy Masters,et al.  Advanced algorithms for neural networks: a C++ sourcebook , 1995 .