Applying Linear Models to Learn Regulation Programs in a Transcription Regulatory Module Network

The module network method has been widely used to infer transcriptional regulatory network from gene expression data. A common strategy of module network learning algorithms is to apply regression trees to infer the regulation program of a module. In this work we propose to apply linear models to fulfill this task. The novelty of our method is to extract the contrast in which a module's genes are most significantly differentially expressed. Consequently, the process of learning the regulation program for the module becomes one of identifying transcription factors that are also differentially expressed in this contrast. The effectiveness of our algorithm is demonstrated by the experiments in a yeast benchmark dataset.

[1]  Jing Li,et al.  Regulatory module network of basic/helix-loop-helix transcription factors in mouse brain , 2007, Genome Biology.

[2]  J. Davis Bioinformatics and Computational Biology Solutions Using R and Bioconductor , 2007 .

[3]  Alexandre P. Francisco,et al.  YEASTRACT-DISCOVERER: new tools to improve the analysis of transcriptional regulatory associations in Saccharomyces cerevisiae , 2007, Nucleic Acids Res..

[4]  Martin A. Nowak,et al.  Inferring Cellular Networks Using Probabilistic Graphical Models , 2004 .

[5]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[6]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[7]  T. Cooper,et al.  The Level of DAL80 Expression Down-Regulates GATA Factor-Mediated Transcription inSaccharomyces cerevisiae , 2000, Journal of bacteriology.

[8]  Rafael A. Irizarry,et al.  Bioinformatics and Computational Biology Solutions using R and Bioconductor , 2005 .

[9]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[10]  Nir Friedman,et al.  Learning Module Networks , 2002, J. Mach. Learn. Res..

[11]  Gregory Butler,et al.  A regression tree-based Gibbs sampler to learn the regulation programs in a transcription regulatory module network , 2010, 2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[12]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[13]  V. Barnett,et al.  Applied Linear Statistical Models , 1975 .

[14]  Kathleen Marchal,et al.  Module networks revisited: computational assessment and prioritization of model predictions , 2009, Bioinform..

[15]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[16]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[17]  Yves Van de Peer,et al.  Analysis of a Gibbs sampler method for model-based clustering of gene expression data , 2008, Bioinform..

[18]  Gordon K. Smyth,et al.  limma: Linear Models for Microarray Data , 2005 .

[19]  C. Kaiser,et al.  Nitrogen regulation in Saccharomyces cerevisiae. , 2002, Gene.