Goulphar: rapid access and expertise for standard two-color microarray normalization methods

BackgroundRaw data normalization is a critical step in microarray data analysis because it directly affects data interpretation. Most of the normalization methods currently used are included in the R/BioConductor packages but it is often difficult to identify the most appropriate method. Furthermore, the use of R commands for functions and graphics can introduce mistakes that are difficult to trace. We present here a script written in R that provides a flexible means of access to and monitoring of data normalization for two-color microarrays. This script combines the power of BioConductor and R analysis functions and reduces the amount of R programming required.ResultsGoulphar was developed in and runs using the R language and environment. It combines and extends functions found in BioConductor packages (limma and marray) to correct for dye biases and spatial artifacts. Goulphar provides a wide range of optional and customizable filters for excluding incorrect signals during the pre-processing step. It displays informative output plots, enabling the user to monitor the normalization process, and helps adapt the normalization method appropriately to the data. All these analyses and graphical outputs are presented in a single PDF report.ConclusionGoulphar provides simple, rapid access to the power of the R/BioConductor statistical analysis packages, with precise control and visualization of the results obtained. Complete documentation, examples and online forms for setting script parameters are available from http://transcriptome.ens.fr/goulphar/.

[1]  Duccio Cavalieri,et al.  Standards for Microarray Data , 2002, Science.

[2]  Zlatko Trajanoski,et al.  CARMAweb: comprehensive R- and bioconductor-based web service for microarray data analysis , 2006, Nucleic Acids Res..

[3]  Terry Speed,et al.  Normalization of cDNA microarray data. , 2003, Methods.

[4]  David P. Kreil,et al.  There is no silver bullet - a guide to low-level data transforms and normalisation methods for microarray data , 2005, Briefings Bioinform..

[5]  Robert E. W. Hancock,et al.  ArrayPipe: a flexible processing pipeline for microarray data , 2004, Nucleic Acids Res..

[6]  John N. Weinstein,et al.  Mistaken Identifiers: Gene name errors can be introduced inadvertently when using Excel in bioinformatics , 2004, BMC Bioinformatics.

[7]  J. Michael Cherry,et al.  Microarray data quality analysis: lessons from the AFGC project , 2004, Plant Molecular Biology.

[8]  D. Wilkins,et al.  The effect of normalization on microarray data analysis. , 2004, DNA and cell biology.

[9]  C. Pipper,et al.  [''R"--project for statistical computing]. , 2008, Ugeskrift for laeger.

[10]  S. Dudoit,et al.  Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. , 2002, Nucleic acids research.

[11]  Joaquín Dopazo,et al.  DNMAD: web-based diagnosis and normalization for microarray data , 2004, Bioinform..

[12]  Kathleen F. Kerr,et al.  Standardizing global gene expression analysis between laboratories and across platforms , 2005, Nature Methods.

[13]  Gordon K. Smyth,et al.  limmaGUI: A graphical user interface for linear modeling of microarray data , 2004, Bioinform..

[14]  Chiara Romualdi,et al.  MIDAW: a web tool for statistical analysis of microarray data , 2005, Nucleic Acids Res..

[15]  Jonathan Pevsner,et al.  SNOMAD (Standardization and NOrmalization of MicroArray Data): web-accessible gene expression data analysis , 2002, Bioinform..

[16]  John Quackenbush,et al.  Multiple-laboratory comparison of microarray platforms , 2005, Nature Methods.

[17]  Yipeng Wang,et al.  WebArray: an online platform for microarray data analysis , 2005, BMC Bioinformatics.

[18]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[19]  Claude Jacq,et al.  New Insights into the Pleiotropic Drug Resistance Network from Genome-Wide Characterization of the YRR1 Transcription Factor Regulation System , 2002, Molecular and Cellular Biology.