Radviz extensions with applications

RadViz (RV), a visualization tool developed at the Institute for Visualization and Perception Research at the University of Massachusetts Lowell, has proved to be very useful in a wide variety of applications. It has also been incorporated internationally into several generalized visualization systems. This research represents efforts to extend the ability of RV to visualize complex datasets. RV has the particular capability of being able to effectively display high dimensional datasets. This research has made use of this property by developing Vectorized RadViz (VRV). VRV provides the capability to enhance the power of RV by creating dimensional anchors assigned to individual values coming from each of the dimensions in the dataset. This tends to significantly increase the number of dimensions to be displayed, again, harnessing RV's strength to display high dimensional datasets. Repositioning these dimensional anchors expands the ability of RV to expose underlying characteristics of the dataset. This research shows how VRV can be applied to cluster ensembles and decision trees and how RV can display fuzzy clusters. Using the extent to which each record is a member of each cluster of the fuzzy cluster set RV is able to display their relationships. Fuzzy clusters were also compared to cluster ensembles using VRV. The basic features of each of the visualizations in this research were illustrated using the Iris dataset but also applied to larger scale microarray datasets. Note: Much of this research has already been published in references (1) and (2).

[1]  Silvia Miksch,et al.  Gravi++: Interactive Information Visualization to Explore Highly Structured Temporal Data , 2005, J. Univers. Comput. Sci..

[2]  Javier M. Moguerza,et al.  Detecting the Number of Clusters Using a Support Vector Machine Approach , 2002, ICANN.

[3]  Aidong Zhang,et al.  VizStruct: exploratory visualization for gene expression profiling , 2004, Bioinform..

[4]  Nathan Cooprider,et al.  Extension of star coordinates into three dimensions , 2007, Electronic Imaging.

[5]  Georges G. Grinstein,et al.  Intelligently resolving point occlusion , 2003, IEEE Symposium on Information Visualization 2003 (IEEE Cat. No.03TH8714).

[6]  Stefan Rüger,et al.  Info Navigator: A visualization tool for document searching and browsing , 2003 .

[7]  Georges G. Grinstein,et al.  Evidence for Proximal to Distal Appendage Amputation Site Effects from Global Gene Expression Correlations Found in Newt Microarrays , 2007, 2007 IEEE 7th International Symposium on BioInformatics and BioEngineering.

[8]  Georges G. Grinstein,et al.  DNA visual and analytic data mining , 1997 .

[9]  Andreas Wierse,et al.  Information Visualization in Data Mining and Knowledge Discovery , 2001 .

[10]  Stefan Berchtold,et al.  Similarity clustering of dimensions for an enhanced visualization of multidimensional data , 1998, Proceedings IEEE Symposium on Information Visualization (Cat. No.98TB100258).

[11]  Matthew O. Ward,et al.  Clutter Reduction in Multi-Dimensional Data Visualization Using Dimension Reordering , 2004 .

[12]  Georges G. Grinstein,et al.  Vectorized Radviz and Its Application to Multiple Cluster Datasets , 2008, IEEE Transactions on Visualization and Computer Graphics.

[13]  Haim Levkowitz,et al.  From Visual Data Exploration to Visual Data Mining: A Survey , 2003, IEEE Trans. Vis. Comput. Graph..

[14]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[15]  Georges G. Grinstein,et al.  Visualizing Fuzzy Clusters Using RadViz , 2009, 2009 13th International Conference Information Visualisation.

[16]  Georges G. Grinstein,et al.  Heat Map Visualizations Allow Comparison of Multiple Clustering Results and Evaluation of Dataset Quality: Application to Microarray Data , 2007, 2007 11th International Conference Information Visualization (IV '07).

[17]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[18]  Ivan Bratko,et al.  VizRank: finding informative data projections in functional genomics by machine learning , 2005, Bioinform..

[19]  Georges G. Grinstein,et al.  High-Dimensional Visualization Support for Data Mining Gene Expression Data , 2001 .

[20]  Eser Kandogan,et al.  Visualizing multi-dimensional clusters, trends, and outliers using star coordinates , 2001, KDD '01.

[21]  Christian Döring,et al.  Fundamentals of Fuzzy Clustering , 2007 .

[22]  Georges G. Grinstein,et al.  Dimensional anchors: a graphic primitive for multidimensional multivariate information visualizations , 1999, NPIVM '99.

[23]  H. Charles Romesburg,et al.  Cluster analysis for researchers , 1984 .

[24]  K. Marx,et al.  Applications of Machine Learning and High‐Dimensional Visualization in Cancer Detection, Diagnosis, and Management , 2004, Annals of the New York Academy of Sciences.

[25]  James C. Bezdek,et al.  Visual cluster validity for prototype generator clustering models , 2003, Pattern Recognit. Lett..

[26]  Daniel B. Carr,et al.  Some visualization challenges , 2001 .

[27]  Urska Cvek,et al.  2D and 3D Neural-Network Based Visualization of High-Dimensional Biomedical Data , 2007, 2007 11th International Conference Information Visualization (IV '07).

[28]  Blaz Zupan,et al.  Data and text mining Visualization-based cancer microarray data classification analysis , 2007 .

[29]  Frank Klawonn,et al.  Visual Inspection of Fuzzy Clustering Results , 2003 .

[30]  Ana L. N. Fred,et al.  Combining multiple clusterings using evidence accumulation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  André Csillaghy,et al.  SphereViz - Data Exploration in a Virtual Reality Environment , 2007, 2007 11th International Conference Information Visualization (IV '07).

[32]  ML Ujwal,et al.  A Machine Learning Approach to Pharmacological Profiling of the Quinone Scaffold in the NCI Database: A Compound Class Enriched in Those Effective Against Melanoma and Leukemia Cell Lines , 2007, 2007 IEEE 7th International Symposium on BioInformatics and BioEngineering.

[33]  Giuseppe Santucci,et al.  Reducing InfoVis cluttering through non uniform sampling, displacement, and user perception , 2006, Electronic Imaging.

[34]  Ana L. N. Fred,et al.  Finding Consistent Clusters in Data Partitions , 2001, Multiple Classifier Systems.

[35]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[36]  Raquel M. Pillat,et al.  Experimental study on evaluation of multidimensional information visualization techniques , 2005, CLIHC '05.

[37]  Haim Levkowitz,et al.  Enhanced High Dimensional Data Visualization through Dimension Reduction and Attribute Arrangement , 2006, Tenth International Conference on Information Visualisation (IV'06).

[38]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[39]  János Abonyi,et al.  Aggregation and Visualization of Fuzzy Clusters Based on Fuzzy Similarity Measures , 2007 .

[40]  Yike Guo,et al.  New paradigms in information visualization. , 2000, SIGIR 2000.

[41]  Blaz Zupan,et al.  FreeViz - An intelligent multivariate visualization approach to explorative analysis of biomedical data , 2007, J. Biomed. Informatics.