Robin: An Intuitive Wizard Application for R-Based Expression Microarray Quality Assessment and Analysis1[W][OA]

The wide application of high-throughput transcriptomics using microarrays has generated a plethora of technical platforms, data repositories, and sophisticated statistical analysis methods, leaving the individual scientist with the problem of choosing the appropriate approach to address a biological question. Several software applications that provide a rich environment for microarray analysis and data storage are available (e.g. GeneSpring, EMMA2), but these are mostly commercial or require an advanced informatics infrastructure. There is a need for a noncommercial, easy-to-use graphical application that aids the lab researcher to find the proper method to analyze microarray data, without this requiring expert understanding of the complex underlying statistics, or programming skills. We have developed Robin, a Java-based graphical wizard application that harnesses the advanced statistical analysis functions of the R/BioConductor project. Robin implements streamlined workflows that guide the user through all steps of two-color, single-color, or Affymetrix microarray analysis. It provides functions for thorough quality assessment of the data and automatically generates warnings to notify the user of potential outliers, low-quality chips, or low statistical power. The results are generated in a standard format that allows ready use with both specialized analysis tools like MapMan and PageMan and generic spreadsheet applications. To further improve user friendliness, Robin includes both integrated help and comprehensive external documentation. To demonstrate the statistical power and ease of use of the workflows in Robin, we present a case study in which we apply Robin to analyze a two-color microarray experiment comparing gene expression in tomato (Solanum lycopersicum) leaves, flowers, and roots.

[1]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[2]  A. Fernie,et al.  Decreased Mitochondrial Activities of Malate Dehydrogenase and Fumarase in Tomato Lead to Altered Root Growth and Architecture via Diverse Mechanisms1[W][OA] , 2008, Plant Physiology.

[3]  Rainer Breitling,et al.  Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments , 2004, FEBS letters.

[4]  A I Saeed,et al.  TM4: a free, open-source system for microarray data management and analysis. , 2003, BioTechniques.

[5]  M. Zanor,et al.  RNA Interference of LIN5 in Tomato Confirms Its Role in Controlling Brix Content, Uncovers the Influence of Sugars on the Levels of Fruit Hormones, and Demonstrates the Importance of Sucrose Cleavage for Normal Fruit Development and Fertility1[W][OA] , 2009, Plant Physiology.

[6]  Gordon K. Smyth,et al.  affylmGUI: a graphical user interface for linear modeling of single channel microarray data , 2006, Bioinform..

[7]  Chungui Lu,et al.  Cold- and light-induced changes in the transcriptome of wheat leading to phase transition from vegetative to reproductive growth , 2009, BMC Plant Biology.

[8]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[9]  Guide to Probe Logarithmic Intensity Error ( PLIER ) Estimation , 2005 .

[10]  Yves Gibon,et al.  Global Transcript Levels Respond to Small Changes of the Carbon Status during Progressive Exhaustion of Carbohydrates in Arabidopsis Rosettes1[W][OA] , 2008, Plant Physiology.

[11]  Rainer Breitling,et al.  RankProd: a bioconductor package for detecting differentially expressed genes in meta-analysis , 2006, Bioinform..

[12]  P. Zimmermann,et al.  GENEVESTIGATOR. Arabidopsis Microarray Database and Analysis Toolbox1[w] , 2004, Plant Physiology.

[13]  S. Dudoit,et al.  Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. , 2002, Nucleic acids research.

[14]  P. Robles,et al.  Flower and fruit development in Arabidopsis thaliana. , 2005, The International journal of developmental biology.

[15]  A. Pühler,et al.  Overlaps in the Transcriptional Profiles of Medicago truncatula Roots Inoculated with Two Different Glomus Fungi Provide Insights into the Genetic Program Activated during Arbuscular Mycorrhiza1[w] , 2005, Plant Physiology.

[16]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[17]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[18]  G. Ditta,et al.  B and C floral organ identity functions require SEPALLATA MADS-box genes , 2000, Nature.

[19]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[20]  Mark Stitt,et al.  A guide to using MapMan to visualize and compare Omics data in plants: a case study in the crop species, Maize. , 2009, Plant, cell & environment.

[21]  Terry Speed,et al.  Normalization of cDNA microarray data. , 2003, Methods.

[22]  Crispin J. Miller,et al.  Simpleaffy: a BioConductor package for Affymetrix Quality Control and data analysis , 2005, Bioinform..

[23]  C. C. Black,et al.  Sucrose Synthase in Wild Tomato, Lycopersicon chmielewskii, and Tomato Fruit Sink Strength. , 1992, Plant physiology.

[24]  G. Martin,et al.  ESTs, cDNA microarrays, and gene expression profiling: tools for dissecting plant physiology and development. , 2004, The Plant journal : for cell and molecular biology.

[25]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[26]  M. Zanor,et al.  RNA Interference of LIN 5 in Tomato Confirms Its Role in Controlling Brix Content , 2011 .

[27]  Eivind Hovig,et al.  MArray: analysing single, replicated or reversed microarray experiments , 2002, Bioinform..

[28]  Beate Sick,et al.  RACE: Remote Analysis Computation for gene Expression data , 2005, Nucleic Acids Res..

[29]  Rafael A. Irizarry,et al.  A Model-Based Background Adjustment for Oligonucleotide Expression Arrays , 2004 .

[30]  B. Usadel,et al.  Xeml Lab: a tool that supports the design of experiments at a graphical interface and generates computer-readable metadata files, which capture information about genotypes, growth conditions, environmental perturbations and sampling strategy. , 2009, Plant, cell & environment.

[31]  Marc Strickert,et al.  Gene expression patterns reveal tissue-specific signaling networks controlling programmed cell death and ABA- regulated maturation in developing barley seeds. , 2006, The Plant journal : for cell and molecular biology.

[32]  A. J. Greenland,et al.  A maize pectin methylesterase-like gene, ZmC5, specifically expressed in pollen , 1998, Plant Molecular Biology.

[33]  D. Hincha,et al.  A quality-controlled microarray method for gene expression profiling. , 2005, Analytical biochemistry.

[34]  Y Mizukami,et al.  Functional domains of the floral regulator AGAMOUS: characterization of the DNA binding domain and analysis of dominant negative mutations. , 1996, The Plant cell.

[35]  G. Mouille,et al.  Homogalacturonan methyl-esterification and plant development. , 2009, Molecular plant.

[36]  Ben Bolstad,et al.  Low-level Analysis of High-density Oligonucleotide Array Data: Background, Normalization and Summarization , 2003 .

[37]  Dennis B. Troup,et al.  NCBI GEO: mining tens of millions of expression profiles—database and tools update , 2006, Nucleic Acids Res..

[38]  J. Golz,et al.  YABBYs and the Transcriptional Corepressors LEUNIG and LEUNIG_HOMOLOG Maintain Leaf Polarity and Meristem Activity in Arabidopsis[W] , 2009, The Plant Cell Online.

[39]  C. Scutt,et al.  Functional conservation between CRABS CLAW orthologues from widely diverged angiosperms. , 2007, Annals of botany.

[40]  Zlatko Trajanoski,et al.  CARMAweb: comprehensive R- and bioconductor-based web service for microarray data analysis , 2006, Nucleic Acids Res..

[41]  Yves Gibon,et al.  PageMan: An interactive ontology tool to generate, display, and annotate overview graphs for profiling experiments , 2006, BMC Bioinformatics.

[42]  Gordon K. Smyth,et al.  limmaGUI: A graphical user interface for linear modeling of microarray data , 2004, Bioinform..

[43]  J. Trygg,et al.  A cross-species transcriptomics approach to identify genes involved in leaf development , 2008, BMC Genomics.

[44]  Helen Parkinson,et al.  MIAME/Plant – adding value to plant microarrray experiments , 2006, Plant Methods.

[45]  Stefan R. Henz,et al.  A gene expression map of Arabidopsis thaliana development , 2005, Nature Genetics.

[46]  M. Gonzalo Claros,et al.  PreP+07: improvements of a user friendly tool to preprocess and analyse microarray data , 2009, BMC Bioinformatics.

[47]  Matthias Lange,et al.  The CRABS CLAW ortholog from California poppy (Eschscholzia californica, Papaveraceae), EcCRC, is involved in floral meristem termination, gynoecium differentiation and ovule initiation. , 2009, The Plant journal : for cell and molecular biology.

[48]  L. Willmitzer,et al.  Evidence of the crucial role of sucrose synthase for sink strength using transgenic potato plants (Solanum tuberosum L.). , 1995, The Plant journal : for cell and molecular biology.

[49]  Kay Nieselt,et al.  Mayday-a microarray data analysis workbench , 2006, Bioinform..

[50]  Richard D. Thompson,et al.  Identification of gene functions by applying TILLING and insertional mutagenesis strategies on microarray-based expression data , 2005 .

[51]  K. Murai,et al.  The spatial expression patterns of DROOPING LEAF orthologs suggest a conserved function in grasses. , 2009, Genes & genetic systems.

[52]  S. D. Rider,et al.  Light induces phenylpropanoid metabolism in Arabidopsis roots. , 2004, The Plant journal : for cell and molecular biology.

[53]  Z. Trajanoski,et al.  CARMAweb : comprehensive Rand bioconductor-based web service for microarray data analysis , 2006 .

[54]  Joachim Selbig,et al.  Extension of the Visualization Tool MapMan to Allow Statistical Analysis of Arrays, Display of Coresponding Genes, and Comparison with Known Responses1 , 2005, Plant Physiology.

[55]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[56]  A. Maule,et al.  Normal growth of Arabidopsis requires cytosolic invertase but not sucrose synthase , 2009, Proceedings of the National Academy of Sciences.

[57]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[58]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[59]  Andreas Tauch,et al.  EMMA 2 – A MAGE-compliant system for the collaborative analysis and integration of microarray data , 2009, BMC Bioinformatics.