Inferring weighted and directed gene interaction networks from gene expression data using the phi-mixing coefficient

In this paper, we present a new algorithm for reverse-engineering gene interaction networks (GINs) from expression data, using the so-called phi-mixing coefficient between two random variables. Unlike existing methods, the GINs constructed using the algorithm presented here have edges that are both directed and weighted. The GIN constructed is, in a very specific sense, a minimal network that is compatible with the data. Several GINs have been constructed for various data sets in lung cancer, ovarian cancer and melanoma. Lung cancer and melanoma networks have been validated by comparing their predictions against the output of ChIP-seq data. The neighbors of three transcription factors (ASCL1, PPARG and NKX2-1) in lung cancer, and one transcription factor SOX10 in melanoma, are significantly enriched with ChIP-seq genes compared to pure chance.

[1]  Diego di Bernardo,et al.  Reverse Engineering and Analysis of Genome-Wide Gene Regulatory Networks from Gene Expression Profiles Using High-Performance Computing , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  Nir Friedman,et al.  Context-Specific Bayesian Clustering for Gene Expression Data , 2002, J. Comput. Biol..

[3]  Peng Qiu,et al.  Reducing the Computational Complexity of Information Theoretic Approaches for Reconstructing Gene Regulatory Networks , 2010, J. Comput. Biol..

[4]  Mathukumalli Vidyasagar,et al.  Mixing Coefficients Between Discrete and Real Random Variables: Computation and Properties , 2012, IEEE Transactions on Automatic Control.

[5]  E. Dougherty,et al.  Inferring Connectivity of Genetic Regulatory Networks Using Information-Theoretic Criteria , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[6]  J. Collins,et al.  Inferring Genetic Networks and Identifying Compound Mode of Action via Expression Profiling , 2003, Science.

[7]  Thomas M. Cover,et al.  Elements of Information Theory: Cover/Elements of Information Theory, Second Edition , 2005 .

[8]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[9]  Mariano J. Alvarez,et al.  Genome-wide Identification of Post-translational Modulators of Transcription Factor Activity in Human B-Cells , 2009, Nature Biotechnology.

[10]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[11]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[12]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[13]  I. Ibragimov,et al.  Some Limit Theorems for Stationary Processes , 1962 .

[14]  Atul J. Butte,et al.  Mutual information relevance networks , 2000 .

[15]  Martin A. Nowak,et al.  Inferring Cellular Networks Using Probabilistic Graphical Models , 2004 .