CAVE-SOM: Immersive visual data mining using 3D Self-Organizing Maps

Data mining techniques are becoming indispensable as the amount and complexity of available data is rapidly growing. Visual data mining techniques attempt to include a human observer in the loop and leverage human perception for knowledge extraction. This is commonly allowed by performing a dimensionality reduction into a visually easy-to-perceive 2D space, which might result in significant loss of important spatial and topological information. To address this issue, this paper presents the design and implementation of a unique 3D visual data mining framework - CAVE-SOM. The CAVE-SOM system couples the Self-Organizing Map (SOM) algorithm with the immersive Cave Automated Virtual Environment (CAVE). The main advantages of the CAVE-SOM system are: i) utilizing a 3D SOM to perform dimensionality reduction of large multi-dimensional datasets, ii) immersive visualization of the trained 3D SOM, iii) ability to explore and interact with the multi-dimensional data in an intuitive and natural way. The CAVE-SOM system uses multiple visualization modes to guide the visual data mining process, for instance the data histograms, U-matrix, connections, separations, uniqueness and the input space view. The implemented CAVE-SOM framework was validated on several benchmark problems and then successfully applied to analysis of wind-power generation data. The knowledge extracted using the CAVE-SOM system can be used for further informed decision making and machine learning.

[1]  Edward J. Wegman,et al.  Immersive Projection Technology for Visual Data Mining , 2002 .

[2]  M. V. Velzen,et al.  Self-organizing maps , 2007 .

[3]  Osmar R. Zaïane,et al.  On-line analytical processing while immersed in a CAVE , 2001, Proceedings Second International Workshop on User Interfaces in Data Intensive Systems. UIDIS 2001.

[4]  Kwong-Sak Leung,et al.  Expanding Self-Organizing Map for data visualization and cluster analysis , 2004, Inf. Sci..

[5]  Michael H. Böhlen,et al.  Visual Data Mining - Theory, Techniques and Tools for Visual Analytics , 2008, Visual Data Mining.

[6]  David H. Laidlaw,et al.  Experiments in Immersive Virtual Reality for Scientific Visualization , 2002, Comput. Graph..

[7]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[8]  José Alfredo Ferreira Costa,et al.  Gradient-based SOM clustering and visualisation methods , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[9]  Edward J. Wegman Affordable environments for 3D collaborative data visualization , 2000, Comput. Sci. Eng..

[10]  Tetsuro Ogi,et al.  Super High Definition Three-Dimensional Display Evironment Applied to Visual Data Mining , 2010, 2010 13th International Conference on Network-Based Information Systems.

[11]  Tony R. Martinez,et al.  Iterative Non-linear Dimensionality Reduction with Manifold Sculpting , 2007, NIPS.

[12]  Guoquan Jiang,et al.  The Research of Data Mining Based on Neural Networks , .

[13]  Zhiwen Liu,et al.  Explored research on data preprocessing and mining technology for clinical data applications , 2010, 2010 2nd IEEE International Conference on Information Management and Engineering.

[14]  Gianluca Bontempi,et al.  New Routes from Minimal Approximation Error to Principal Components , 2008, Neural Processing Letters.

[15]  Jean-Claude Thill,et al.  Visual Data Mining in Spatial Interaction Analysis with Self-Organizing Maps , 2009 .

[16]  Hang Li,et al.  Fuzzy self-organizing maps for data mining with incomplete data sets , 2010, 2010 International Conference on Computer Application and System Modeling (ICCASM 2010).

[17]  Erzsébet Merényi,et al.  Exploiting Data Topology in Visualization and Clustering of Self-Organizing Maps , 2009, IEEE Transactions on Neural Networks.

[18]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[19]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[20]  Julio J. Valdés,et al.  Evolutionary computation based nonlinear transformations to low dimensional spaces for sensor data fusion and Visual Data Mining , 2010, IEEE Congress on Evolutionary Computation.

[21]  Abdulmotaleb El-Saddik,et al.  Exploring the underlying structure of haptic-based handwritten signatures using visual data mining techniques , 2010, 2010 IEEE Haptics Symposium.

[22]  Michael H. Böhlen,et al.  3D visual data mining - goals and experiences , 2003, Comput. Stat. Data Anal..

[23]  Erik Granum,et al.  Methods for visual mining of data in Virtual Reality , 2001 .

[24]  Anil K. Jain,et al.  A non-linear projection method based on Kohonen's topology preserving maps , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[25]  Juha Vesanto,et al.  SOM-based data visualization methods , 1999, Intell. Data Anal..

[26]  Pak Chung Wong,et al.  Guest Editor's Introduction: Visual Data Mining , 1999, IEEE Computer Graphics and Applications.

[27]  Thomas Martinetz,et al.  Topology representing networks , 1994, Neural Networks.

[28]  Yoshihiro Okada,et al.  Treecube+3D-ViSOM: Combinational Visualization Tool for Browsing 3D Multimedia Data , 2007, 2007 11th International Conference Information Visualization (IV '07).

[29]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[30]  Daniel A. Keim,et al.  Information Visualization and Visual Data Mining , 2002, IEEE Trans. Vis. Comput. Graph..

[31]  Ahmed Seffah,et al.  From Visualization to Visual Mining: Application to Environmental Data , 2008, First International Conference on Advances in Computer-Human Interaction.