A Perceptually Optimized Scheme for Visualizing Gene Expression Ratios with Confidence Values

Gene expression data studies are often concerned with comparing experimental versus control conditions. Ratios of gene expression values, fold changes, are therefore commonly used as biologically meaningful markers. Visual representations are inevitable for the explorative analysis of data. Fold changes alone are no reliable markers, since low signal intensities may lead to unreliable ratios and should therefore be visually marked less important than the more trustworthy ratios of larger expression values. Methods: We propose a new visualization scheme showing ratios and their confidence together in one single diagram, enabling a more precise explorative assessment of gene expression data. Basis of the visualization scheme are near-uniform perceptible color scales improving the readability of the commonly used red-green color scale. A sub-sampling algorithm for optimizing color scales is presented. Instead of difficult to read bivariate color maps encoding two variables into a single color we propose the use of colored patches (rectangles) of different sizes representing the absolute values, while representing ratios by a univariate color map. Different pre-processing steps for visual bandwidth limitation and reliability value estimation are proposed. Results and Conlusions: The proposed bivariate visualization scheme shows a clear perceptible order in ratio and reliability values leading to better and clearer interpretable diagrams. The proposed color scales were specifically adapted to human visual perception. Psychophysical optimized color scales are superior to traditional sRGB red-green maps. This leads to an improved explorative assessment of gene expression data.

[1]  Colin Ware,et al.  Color sequences for univariate maps: theory, experiments and principles , 1988, IEEE Computer Graphics and Applications.

[2]  Ernst Wit,et al.  Statistics for microarrays , 2004 .

[3]  Maureen C. Stone,et al.  Color gamut mapping and the printing of digital color images , 1988, TOGS.

[4]  Maureen C. Stone,et al.  Color Printing for Computer Graphics , 1990 .

[5]  Korbinian Strimmer,et al.  Modeling gene expression measurement error: a quasi-likelihood approach , 2003, BMC Bioinformatics.

[6]  Rae A. Earnshaw,et al.  Computer graphics techniques: theory and practice , 1990 .

[7]  Colin Ware,et al.  Information Visualization: Perception for Design , 2000 .

[8]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[9]  I Spence,et al.  Target detection in scientific visualization. , 2001, Journal of experimental psychology. Applied.

[10]  Philip K. Robertson,et al.  The Generation of Color Sequences for Univariate and Bivariate Mapping , 1986, IEEE Computer Graphics and Applications.

[11]  Haim Levkowitz,et al.  Color Theory and Modeling for Computer Graphics, Visualization, and Multimedia Applications , 1997 .

[12]  Malte Buchholz,et al.  Expression profiling of the influence of RAS mutants on the TGFB1‐induced phenotype of the pancreatic cancer cell line PANC‐1 , 2004, Genes, chromosomes & cancer.

[13]  Matthew Anderson,et al.  Proposal for a Standard Default Color Space for the Internet - sRGB , 1996, CIC.