VistaClara: an interactive visualization for exploratory analysis of DNA microarrays

We have created VistaClara to explre the effectiveness of applying an extended permutation matrix to the task of exploratory data analysis of multi-experiment microarray studies. The permutation matrix is a visualization technique for interactive exploratory analysis of tabular data that permits both row and column rearrangement, and fits well with the tabular forms of data characteristic of gene expression studies. However, this technique has been largely overlooked by current bioinformatics research. Our implementation supports direct incorporation of supplemental data and annotations into the matrix view. This enables visually searching for patterns in gene expression measurements that correlate with other types of relevant data (disease classes, clinical, histological, drug treatments, etc.). The heatmap visualization common in microarray analysis is extended to provide a novel alternative using size as well as color to graphically represent experimental values, thus allowing more effective quantitative comparisons. Methods to sort rows or columns by similarity extend the possible permutation operations, and allow more efficient searching for biologically relevant patterns in very large data sets. Based on overview+detail principles, a dynamic compressed heatmap view of the entire data set provides the user with overall context, including possible correlations not currently visible in the more detailed view. Combined, these techniques make it possible to perform highly interactive ad hoc visual explorations of microarray.

[1]  S. Rosenberg,et al.  Cloning of the gene coding for a shared human melanoma antigen recognized by autologous T cells infiltrating into tumor. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Ben Shneiderman,et al.  Image-Browser Taxonomy and Guidelines for Designers , 1995, IEEE Softw..

[3]  D. Slonim From patterns to pathways: gene expression data analysis comes of age , 2002, Nature Genetics.

[4]  Ben Shneiderman,et al.  Readings in information visualization - using vision to think , 1999 .

[5]  Allan Kuchinsky,et al.  When worlds collide: Molecular biology as interdisciplinary collaboration , 2001, ECSCW.

[6]  Harri Siirtola,et al.  Interaction with the Reorderable Matrix , 1999, 1999 IEEE International Conference on Information Visualization (Cat. No. PR00210).

[7]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Ben Shneiderman,et al.  Image Browsers: Taxonomy, Guidelines, and Informal Specifications , 1994 .

[9]  Hans Hinterberger,et al.  Comparative multivariate visualization across conceptually different graphic displays , 1994, Seventh International Working Conference on Scientific and Statistical Database Management.

[10]  Ramana Rao,et al.  The table lens: merging graphical and symbolic representations in an interactive focus + context visualization for tabular information , 1994, CHI '94.

[11]  Debashis Ghosh,et al.  alpha-Methylacyl coenzyme A racemase as a tissue biomarker for prostate cancer. , 2002, JAMA.

[12]  M. Bittner,et al.  Human prostate cancer and benign prostatic hyperplasia: molecular dissection by gene expression profiling. , 2001, Cancer research.

[13]  Ramana Rao Reflections: seegomanifesto , 1999, INTR.

[14]  Allan Kuchinsky,et al.  Biological storytelling: a software tool for biological information organization based upon narrative structure , 2002, AVI '02.

[15]  Christopher G. Healey,et al.  Assisted navigation for large information spaces , 2002, IEEE Visualization, 2002. VIS 2002..

[16]  N. Sampas,et al.  Molecular classification of cutaneous malignant melanoma by gene expression profiling , 2000, Nature.

[17]  Jacques Bertin,et al.  Graphics and graphic information-processing , 1981 .