Identification of biomarker genes for resistance to a pathogen by a novel method for meta-analysis of single-channel microarray datasets

The search for fast and reliable methods allowing for extraction of biomarker genes, e.g. responsible for a plant resistance to a certain pathogen, is one of the most important and highly exploited data mining problem in bioinformatics. Here we describe a simple and efficient method suitable for combining results from multiple single-channel microarray experiments for meta-analysis. A new technique presented here makes use of the fuzzy set logic for the initial gene selection and of the machine learning algorithm AdaBoost to retrieve a set of genes where expression profiles are the most different between the resistant and susceptible classes. As a proof of concept, our method has been applied to the analysis of a gene expression dataset composed of many independent microarray experiments on wheat head tissue, to identify genes that are biomarkers of resistance to the fungus Fusarium graminearum. We used microarray data from many experiments performed on wheat lines of various resistance level. The resulting set of genes was validated by qPCR experiments.

[1]  Jirui Wang,et al.  RNA profiling of fusarium head blight-resistant wheat addition lines containing the Thinopyrum elongatum chromosome 7E , 2010 .

[2]  Ruth Etzioni,et al.  Combining Results of Microarray Experiments: A Rank Aggregation Approach , 2006 .

[3]  Y. Liao,et al.  Resistance to Fusarium head blight and seedling blight in wheat is associated with activation of a cytochrome p450 gene. , 2010, Phytopathology.

[4]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[5]  W. Hartung,et al.  Abscisic acid in the xylem: where does it come from, where does it go to? , 2002, Journal of experimental botany.

[6]  H. Ohm,et al.  Expression analysis of defense-related genes in wheat in response to infection by Fusarium graminearum. , 2007, Genome.

[7]  John D. Storey,et al.  Empirical Bayes Analysis of a Microarray Experiment , 2001 .

[8]  A. Hoffmann,et al.  Selection for cold resistance alters gene transcript levels in Drosophila melanogaster. , 2009, Journal of insect physiology.

[9]  G. Muehlbauer,et al.  Transcriptome analysis of a wheat near-isogenic line pair carrying Fusarium head blight-resistant and -susceptible alleles. , 2009, Molecular plant-microbe interactions : MPMI.

[10]  Mark C. Jordan,et al.  Differential transcriptome analyses of three wheat genotypes reveal different host response pathways associated with Fusarium head blight and trichothecene resistance , 2012 .

[11]  D. Riechers,et al.  Identification of proteins induced or upregulated by Fusarium head blight infection in the spikes of hexaploid wheat (Triticum aestivum). , 2005, Genome.

[12]  W. Nierman,et al.  Gene Expression Profiling and Identification of Resistance Genes to Aspergillus flavus Infection in Peanut through EST and Microarray Strategies , 2011, Toxins.

[13]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[14]  Koji Kadota,et al.  A weighted average difference method for detecting differentially expressed genes from microarray data , 2008, Algorithms for Molecular Biology.

[15]  Witold Dzwinel,et al.  Method of particles in visual clustering of multi-dimensional and large data sets , 1999, Future Gener. Comput. Syst..

[16]  Witold Dzwinel,et al.  Interactive Data Mining by Using Multidimensional Scaling , 2013, ICCS.

[17]  G. Muehlbauer,et al.  Transcriptome analysis of the barley-Fusarium graminearum interaction. , 2006, Molecular plant-microbe interactions : MPMI.

[18]  Rainer Breitling,et al.  Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments , 2004, FEBS letters.

[19]  J. Hemmer-Hansen,et al.  Adaptive differences in gene expression in European flounder (Platichthys flesus) , 2007, Molecular ecology.

[20]  A. Laroche,et al.  Differential expression of proteins in response to the interaction between the pathogen Fusarium graminearum and its host, Hordeum vulgare , 2008, Proteomics.

[21]  Koji Kadota,et al.  Ranking differentially expressed genes from Affymetrix gene expression data: methods with reproducibility, sensitivity, and specificity , 2008, Algorithms for Molecular Biology.

[22]  Bing Wang,et al.  The target gene of tae-miR164, a novel NAC transcription factor from the NAM subfamily, negatively regulates resistance of wheat to stripe rust. , 2014, Molecular plant pathology.

[23]  A. Zadissa,et al.  Microarray analysis of selection lines from outbred populations to identify genes involved with nematode parasite resistance in sheep. , 2005, Physiological genomics.

[24]  David A. Yuen,et al.  Visual exploration of data by using multidimensional scaling on multicore CPU, GPU, and MPI cluster , 2014, Concurrency and Computation.

[25]  Dimitris Achlioptas,et al.  Database-friendly random projections: Johnson-Lindenstrauss with binary coins , 2003, J. Comput. Syst. Sci..

[26]  Jin Xiao,et al.  Transcriptome-based discovery of pathways and genes related to resistance against Fusarium head blight in wheat landrace Wangshuibai , 2013, BMC Genomics.

[27]  Guihua Bai,et al.  Fusarium graminearum-induced changes in gene expression between Fusarium head blight-resistant and susceptible wheat cultivars , 2006, Functional & Integrative Genomics.

[28]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[29]  Yan Lu,et al.  Isolation and molecular characterization of the Triticum aestivum L. ethylene-responsive factor 1 (TaERF1) that increases multiple stress tolerance , 2007, Plant Molecular Biology.

[30]  A. Kushalappa,et al.  Integrated Metabolo-Proteomic Approach to Decipher the Mechanisms by Which Wheat QTL (Fhb1) Contributes to Resistance against Fusarium graminearum , 2012, PloS one.

[31]  Koji Kadota,et al.  TCC: an R package for comparing tag count data with robust normalization strategies , 2013, BMC Bioinformatics.

[32]  Korbinian Strimmer,et al.  Statistical Applications in Genetics and Molecular Biology , 2005 .

[33]  Rainer Breitling,et al.  A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments , 2008, Bioinform..

[34]  Haibin Xu,et al.  Resistance to Hemi-Biotrophic F. graminearum Infection Is Associated with Coordinated and Ordered Expression of Diverse Defense Signaling Pathways , 2011, PloS one.

[35]  Ulrich Bodenhofer,et al.  APCluster: an R package for affinity propagation clustering , 2011, Bioinform..

[36]  Mark Culp,et al.  ada: An R Package for Stochastic Boosting , 2006 .

[37]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.