Array Analysis Manager—An automated DNA microarray analysis tool simplifying microarray data filtering, bias recognition, normalization, and expression analysis

Desoxyribonucleic acid (DNA) microarray experiments generate big datasets. To successfully harness the potential information within, multiple filtering, normalization, and analysis methods need to be applied. An in‐depth knowledge of underlying physical, chemical, and statistical processes is crucial to the success of this analysis. However, due to the interdisciplinarity of DNA microarray applications and experimenter backgrounds, the published analyses differ greatly, for example, in methodology. This severely limits the comprehensibility and comparability among studies and research fields. In this work, we present a novel end‐user software, developed to automatically filter, normalize, and analyze two‐channel microarray experiment data. It enables the user to analyze single chip, dye‐swap, and loop experiments with an extended dynamic intensity range using a multiscan approach. Furthermore, to our knowledge, this is the first analysis software solution, that can account for photobleaching, automatically detected by an artificial neural network. The user gets feedback on the effectiveness of each applied normalization regarding bias minimization. Standardized methods for expression analysis are included as well as the possibility to export the results in the Gene Expression Omnibus (GEO) format. This software was designed to simplify the microarray analysis process and help the experimenter to make educated decisions about the analysis process to contribute to reproducibility and comparability.

[1]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[2]  Tao Han,et al.  Microarray scanner calibration curves: characteristics and implications , 2005, BMC Bioinformatics.

[3]  David P. Kreil,et al.  Physico-chemical foundations underpinning microarray and next-generation sequencing experiments , 2013, Nucleic acids research.

[4]  Rafael A. Irizarry,et al.  quantro: a data-driven approach to guide the choice of an appropriate normalization method , 2015, Genome Biology.

[5]  G. Tseng,et al.  Comprehensive literature review and statistical considerations for microarray meta-analysis , 2012, Nucleic acids research.

[6]  Petri Auvinen,et al.  Bayesian Hierarchical Model for Correcting Signal Saturation in Microarrays Using Pixel Intensities , 2006, Statistical applications in genetics and molecular biology.

[7]  Zhijin Wu,et al.  A review of statistical methods for preprocessing oligonucleotide microarrays , 2009, Statistical methods in medical research.

[8]  Johan Vallon-Christersson,et al.  Calibration and assessment of channel-specific biases in microarray data with extended dynamical range , 2004, BMC Bioinformatics.

[9]  Joris Vermeesch,et al.  Development of a novel ozone- and photo-stable HyPer5 red fluorescent dye for array CGH and microarray gene expression analysis with consistent performance irrespective of environmental conditions , 2008, BMC biotechnology.

[10]  D. Allison,et al.  Microarray data analysis: from disarray to consolidation and consensus , 2006, Nature Reviews Genetics.

[11]  Marc L Salit,et al.  Microarray Scanner Performance Over a Five-Week Period as Measured With Cy5 and Cy3 Serial Dilution Slides , 2008, Journal of research of the National Institute of Standards and Technology.

[12]  Chris A. Glasbey,et al.  Statistical estimation of gene expression using multiple laser scans of microarrays , 2006, Bioinform..

[13]  E. Hovig,et al.  Profound influence of microarray scanner characteristics on gene expression ratios: analysis and procedure for correction , 2004, BMC Genomics.

[14]  Frank Stahl,et al.  Impact of microarray technology in nutrition and food research. , 2005, Molecular nutrition & food research.

[15]  T. Scheper,et al.  The Impact of Photobleaching on Microarray Analysis , 2015, Biology.

[16]  Weida Tong,et al.  Reproducible and reliable microarray results through quality control: good laboratory proficiency and appropriate data analysis practices are essential. , 2008, Current opinion in biotechnology.

[17]  João Ricardo Sato,et al.  GEDI: a user-friendly toolbox for analysis of large-scale gene expression data , 2007, BMC Bioinformatics.

[18]  J. V. van Delft,et al.  Application of four dyes in gene expression analyses by microarrays , 2005, BMC Genomics.

[19]  Nicola Cannata,et al.  SaDA: From Sampling to Data Analysis—An Extensible Open Source Infrastructure for Rapid, Robust and Automated Management and Analysis of Modern Ecological High-Throughput Microarray Data , 2015, International journal of environmental research and public health.

[20]  A. Ehrenreich DNA microarray technology for the microbiologist: an overview , 2006, Applied Microbiology and Biotechnology.

[21]  Andrew Williams,et al.  Effects of scanning sensitivity and multiple scan algorithms on microarray data quality , 2010, BMC Bioinformatics.

[22]  Inge Jonassen,et al.  XHM: A system for detection of potential cross hybridizations in DNA microarrays , 2004, BMC Bioinformatics.

[23]  Benoit Macq,et al.  Combining Multiple Laser Scans of Spotted Microarrays by Means of a Two-Way ANOVA Model , 2012, Statistical applications in genetics and molecular biology.

[24]  Monika A. Ciuba,et al.  Photophysical processes in single molecule organic fluorescent probes. , 2014, Chemical Society reviews.

[25]  Jay C. D. Hinton,et al.  BABAR: an R package to simplify the normalisation of common reference design microarray-based transcriptomic datasets , 2010, BMC Bioinformatics.

[26]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..