Using directed information to build biologically relevant influence networks.

The systematic inference of biologically relevant influence networks remains a challenging problem in computational biology. Even though the availability of high-throughput data has enabled the use of probabilistic models to infer the plausible structure of such networks, their true interpretation of the biology of the process is questionable. In this work, we propose a network inference methodology, based on the directed information (DTI) criterion, which incorporates the biology of transcription within the framework, so as to enable experimentally verifiable inference. We use publicly available embryonic kidney and T-cell microarray datasets to demonstrate our results. We present two variants of network inference via DTI (supervised and unsupervised) and the inferred networks relevant to mammalian nephrogenesis as well as T-cell activation. We demonstrate the conformity of the obtained interactions with literature as well as comparison with the coefficient of determination (CoD) method. Apart from network inference, the proposed framework enables the exploration of specific interactions, not just those revealed by data.

[1]  H. Marko,et al.  The Bidirectional Communication Theory - A Generalization of Information Theory , 1973, IEEE Transactions on Communications.

[2]  J. Geweke,et al.  Measurement of Linear Dependence and Feedback between Multiple Time Series , 1982 .

[3]  H. Joe Relative Entropy Measures of Multivariate Dependence , 1989 .

[4]  J. Massey CAUSALITY, FEEDBACK AND DIRECTED INFORMATION , 1990 .

[5]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[6]  G. Dressler,et al.  Pax-2 is a DNA-binding protein expressed in embryonic kidney and Wilms tumor. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[7]  S. T. Buckland,et al.  An Introduction to the Bootstrap. , 1994 .

[8]  Gavin Kelsey,et al.  Characterization of the mouse HNF-4 gene and its expression during mouse embryogenesis , 1994, Mechanisms of Development.

[9]  J. Morris,et al.  Repression of Pax-2 by WT1 during normal kidney development. , 1995, Development.

[10]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[11]  J. O. Ramsay,et al.  Functional Data Analysis (Springer Series in Statistics) , 1997 .

[12]  V. Sukhatme,et al.  Sp1 Is a Critical Regulator of the Wilms' tumor-1 Gene* , 1997, The Journal of Biological Chemistry.

[13]  D. H. Zhang,et al.  Differential responsiveness of the IL-5 and IL-4 genes to transcription factor GATA-3. , 1998, Journal of immunology.

[14]  James Douglas Engel,et al.  Localization of Distant Urogenital System-, Central Nervous System-, and Endocardium-Specific Transcriptional Regulatory Elements in the GATA-3 Locus , 1999, Molecular and Cellular Biology.

[15]  Igor Vajda,et al.  Estimation of the Information by an Adaptive Partitioning of the Observation Space , 1999, IEEE Trans. Inf. Theory.

[16]  William Bialek,et al.  Entropy and Inference, Revisited , 2001, NIPS.

[17]  J Davies,et al.  Intracellular and extracellular regulation of ureteric bud morphogenesis , 2000, Journal of anatomy.

[18]  G. Dressler,et al.  Regulation of ureteric bud outgrowth by Pax2-dependent activation of the glial derived neurotrophic factor gene. , 2001, Development.

[19]  R. Blomhoff,et al.  Gene expression regulation by retinoic acid Published, JLR Papers in Press, August 16, 2002. DOI 10.1194/jlr.R100015-JLR200 , 2002, Journal of Lipid Research.

[20]  B. De Moor,et al.  Toucan: deciphering the cis-regulatory logic of coregulated genes. , 2003, Nucleic acids research.

[21]  Bruce J Aronow,et al.  A catalogue of gene expression in the developing kidney. , 2003, Kidney international.

[22]  Erik G. Miller A new class of entropy estimators for multi-dimensional densities , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[23]  Liam Paninski,et al.  Estimation of Entropy and Mutual Information , 2003, Neural Computation.

[24]  R. O. Stuart,et al.  Changes in gene expression patterns in the ureteric bud and metanephric mesenchyme in models of kidney development. , 2003, Kidney international.

[25]  Seppo Vainio,et al.  Wnt11 and Ret/Gdnf pathways cooperate in regulating ureteric branching during metanephric kidney development , 2003, Development.

[26]  Michael J. Berry,et al.  Network information and connected correlations. , 2003, Physical review letters.

[27]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[28]  H. A. Rogoff,et al.  Apoptosis Associated with Deregulated E2F Activity Is Dependent on E2F1 and Atm/Nbs1/Chk2 , 2004, Molecular and Cellular Biology.

[29]  Michael L. Bittner,et al.  Growing genetic regulatory networks from seed genes , 2004, Bioinform..

[30]  Gabriel Kreiman,et al.  Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes. , 2004, Nucleic acids research.

[31]  Ilya Nemenman Information theory, multivariate dependence, and genetic network inference , 2004, ArXiv.

[32]  Zoubin Ghahramani,et al.  Modeling T-cell activation using gene expression profiling and state-space models , 2004, Bioinform..

[33]  Melissa J Davis,et al.  J Am Soc Nephrol 15: 2344–2357, 2004 Identifying the Molecular Phenotype of Renal Progenitor Cells , 2022 .

[34]  J. D. Engel,et al.  Multiple, Distant Gata2 Enhancers Specify Temporally and Tissue-Specific Patterning in the Developing Urogenital System , 2004, Molecular and Cellular Biology.

[35]  E. Learned-Miller Hyperspacings and the estimation of information theoretic quantities , 2004 .

[36]  Robert D. Nowak,et al.  Complexity-regularized multiresolution density estimation , 2004, International Symposium onInformation Theory, 2004. ISIT 2004. Proceedings..

[37]  Ivan Ovcharenko,et al.  ECR Browser: a tool for visualizing and accessing data from comparisons of multiple vertebrate genomes , 2004, Nucleic Acids Res..

[38]  J. Ingelfinger,et al.  Angiotensin II increases Pax-2 expression in fetal kidney cells via the AT2 receptor. , 2004, Journal of the American Society of Nephrology : JASN.

[39]  Dmitri Papatsenko,et al.  Computational identification of regulatory DNAs underlying animal development , 2005, Nature Methods.

[40]  Shereen Ezzat,et al.  Ikaros integrates endocrine and immune system development. , 2005, The Journal of clinical investigation.

[41]  S. Grimmond,et al.  Temporal and spatial transcriptional programs in murine kidney development. , 2005, Physiological genomics.

[42]  Zoubin Ghahramani,et al.  A Bayesian approach to reconstructing genetic regulatory networks with hidden factors , 2005, Bioinform..

[43]  Korbinian Strimmer,et al.  An empirical Bayes approach to inferring large-scale gene association networks , 2005, Bioinform..

[44]  Peter J. Woolf,et al.  Bayesian analysis of signaling networks governing embryonic stem cell fate decisions , 2005, Bioinform..

[45]  Alfred O. Hero,et al.  Inference of Biologically Relevant Gene Influence Networks Using the Directed Information Criterion , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[46]  M. Busslinger,et al.  Pax2/8-regulated Gata3 expression is necessary for morphogenesis and guidance of the nephric duct in the developing kidney , 2006, Development.

[47]  John A. Gubner Probability and Random Processes for Electrical and Computer Engineers , 2006 .

[48]  Ernest Fraenkel,et al.  Practical Strategies for Discovering Regulatory DNA Sequence Motifs , 2006, PLoS Comput. Biol..

[49]  G. Dressler,et al.  Regulation of c-Ret in the developing kidney is responsive to Pax2 gene dosage. , 2006, Human molecular genetics.

[50]  Matthias Kretzler,et al.  Comparative promoter analysis allows de novo identification of specialized cell junction-associated proteins. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Korbinian Strimmer,et al.  USING REGULARIZED DYNAMIC CORRELATION TO INFER GENE DEPENDENCY NETWORKS FROM TIME-SERIES MICROARRAY DATA , 2006 .

[52]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[53]  J. E. Hudson,et al.  Signal Processing Using Mutual Information , 2006, IEEE Signal Processing Magazine.

[54]  Susan A. Murphy,et al.  Monographs on statistics and applied probability , 1990 .

[55]  Ramji Venkataramanan,et al.  Source Coding With Feed-Forward: Rate-Distortion Theorems and Error Exponents for a General Source , 2007, IEEE Transactions on Information Theory.

[56]  Huai Li,et al.  Analysis of Gene Coexpression by B-Spline Based CoD Estimation , 2007, EURASIP J. Bioinform. Syst. Biol..