A new visualization tool for data mining techniques

Clustering techniques and classification trees are two of the main techniques used in data mining but, at present, there is still a lack of visualization methods for these tools. Many graphs associated with clustering, also with hierarchical clustering, do not give any information about the values of the centroids’ attributes and the relationships among them. In classification trees, graphical procedures can also be developed to help simplify their interpretation and to obtain a better understanding, but more visualization methods to support this tool are needed. This paper presents a novel visualization technique called sectors on sectors (SonS), and an extended version called multidimensional sectors on sectors (MDSonS), for improving the interpretation of several data mining algorithms. These methods are applied for visualizing the results of: (a) hierarchical clustering, which makes it possible to extract all the existing relationships among centroids’ attributes at any hierarchy level; (b) growing hierarchical self-organizing maps (GHSOM), a variant of the well-known self-organizing maps (SOM), by means of which it is possible to visualize, simultaneously, the data information at each hierarchy level compactly and extract relationships among variables; (c) classification trees, in which the SonS is used for representing the input data information for each class presented in each terminal node of a classification tree providing extra information for a better understanding of the problem. These methods are tested by means of several data sets (real and synthetic). The achieved results show the suitability and usefulness of the proposed approaches.

[1]  Edward M. Reingold,et al.  Tidier Drawings of Trees , 1981, IEEE Transactions on Software Engineering.

[2]  Andreas Rauber,et al.  Uncovering hierarchical structure in data using the growing hierarchical self-organizing map , 2002, Neurocomputing.

[3]  Chang-Sung Jeong,et al.  Reconfigurable disc trees for visualizing large hierarchical information space , 1998, Proceedings IEEE Symposium on Information Visualization (Cat. No.98TB100258).

[4]  Michael Berthold,et al.  Intelligent Data Analysis , 1999, Springer Berlin Heidelberg.

[5]  Colin Ware,et al.  Information Visualization: Perception for Design , 2000 .

[6]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[7]  Evangelos E. Milios,et al.  LogView: Visualizing Event Log Clusters , 2008, 2008 Sixth Annual Conference on Privacy, Security and Trust.

[8]  Ben Shneiderman,et al.  Tree visualization with tree-maps: 2-d space-filling approach , 1992, TOGS.

[9]  Keith Andrews,et al.  Information Slices: Visualising and Exploring Large Hierarchies using Cascading, Semi-Circular Discs , 1998 .

[10]  Risto Mukkulainen,et al.  Script Recognition with Hierarchical Feature Maps , 1990 .

[11]  Teuvo Kohonen,et al.  Self-Organizing Maps, Third Edition , 2001, Springer Series in Information Sciences.

[12]  Jock D. Mackinlay,et al.  Cone Trees: animated 3D visualizations of hierarchical information , 1991, CHI.

[13]  José David Martín-Guerrero,et al.  Sectors on sectors (SonS): A new hierarchical clustering visualization tool , 2011, 2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[14]  Ganesh S. Oak Information Visualization Introduction , 2022 .

[15]  Mark Kesselman,et al.  Introduction to comparative politics : political challenges and changing agendas , 2013 .

[16]  Silvia Lanteri,et al.  Classification of olive oils from their fatty acid composition , 1983 .

[17]  Michael R. Berthold,et al.  Intelligent Data Analysis , 2000, Springer Berlin Heidelberg.

[18]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[19]  P. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 1999 .

[20]  Simon M. Lin,et al.  Applications of Tree-Maps to hierarchical biological data , 2002, Bioinform..

[21]  Heidrun Schumann,et al.  Information visualization using a new focus+context technique in combination with dynamic clustering of information space , 1999, NPIVM '99.

[22]  Ben Shneiderman,et al.  Visualization and analysis of microarray and gene ontology data with treemaps , 2004, BMC Bioinformatics.

[23]  Friedrich Leisch,et al.  Neighborhood graphs, stripes and shadow plots for cluster visualization , 2010, Stat. Comput..

[24]  Chun-Houh Chen,et al.  Handbook of Data Visualization (Springer Handbooks of Computational Statistics) , 2008 .

[25]  Tamara Munzner,et al.  H3: laying out large directed graphs in 3D hyperbolic space , 1997, Proceedings of VIZ '97: Visualization Conference, Information Visualization Symposium and Parallel Rendering Symposium.

[26]  José David Martín-Guerrero,et al.  Growing Hierarchical Sectors on Sectors , 2011, ESANN.

[27]  Danny Holten,et al.  Hierarchical Edge Bundles: Visualization of Adjacency Relations in Hierarchical Data , 2006, IEEE Transactions on Visualization and Computer Graphics.

[28]  Hans-Peter Kriegel,et al.  Towards an Effective Cooperation of the Computer and the User for Classification , 2000, KDD 2000.

[29]  Keith Andrews,et al.  A Comparative Study of Four Hierarchy Browsers using the Hierarchical Visualisation Testing Environment (HVTE) , 2007, 2007 11th International Conference Information Visualization (IV '07).

[30]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[31]  L. C. Vroomen,et al.  Cheops: a compact explorer for complex hierarchies , 1996, Proceedings of Seventh Annual IEEE Visualization '96.

[32]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[33]  Ben Shneiderman,et al.  Readings in information visualization - using vision to think , 1999 .

[34]  Chun-Houh Chen,et al.  Handbook of Data Visualization , 2016 .

[35]  Andreas Rauber,et al.  The growing hierarchical self-organizing map , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[36]  Keith Andrews,et al.  Visual exploration of large hierarchies with information pyramids , 2002, Proceedings Sixth International Conference on Information Visualisation.