Hunting drug targets by systems-level modeling of gene expression profiles

Structural learning of Bayesian networks applied to sets of genome-wide expression patterns has been recently discovered as a potentially useful tool for the systems-level statistical description of gene interactions. We train and analyze Bayesian networks with the goal of inferring biological aspects of gene function. Our two-component approach focuses on supporting the drug discovery process by identifying genes with central roles for the network operation, which could act as drug targets. The first component, referred to as scale-free analysis, uses topological measures of the network-related to a high-traffic load of genes-as estimators for their functional importance. The second component, referred to as generative inverse modeling, is a method of estimating the effect of a simulated drug treatment or mutation on the global state of the network, as measured in the expression profile. We show for a dataset from acute lymphoblastic leukemia patients that both approaches are suitable for finding genes with central cellular functions. In addition, generative inverse modeling correctly identifies a known oncogene in a purely data-driven way.

[1]  Satoru Miyano,et al.  Estimation of Genetic Networks and Functional Structures Between Genes by Using Bayesian Networks and Nonparametric Regression , 2001, Pacific Symposium on Biocomputing.

[2]  Adilson E Motter,et al.  Range-based attack on links in scale-free networks: are long-range links responsible for the small-world phenomenon? , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Werner Dubitzky,et al.  Multiclass Cancer Classification Using Gene Expression Profiling and Probabilistic Neural Networks , 2002, Pacific Symposium on Biocomputing.

[4]  J. Downing,et al.  Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling. , 2002, Cancer cell.

[5]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[6]  Werner Dubitzky,et al.  A Practical Approach to Microarray Data Analysis , 2003, Springer US.

[7]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[8]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[9]  C. Murre,et al.  The role of E2A-PBX1 in leukemogenesis , 2001, Oncogene.

[10]  Mathäus Dejori,et al.  Identifying Interventional and Pathogenic Mechanisms by Generative Inverse Modeling of Gene Expression Profiles , 2004, J. Comput. Biol..

[11]  C. Murre,et al.  Pbx1 is converted into a transcriptional activator upon acquiring the N-terminal region of E2A in pre-B-cell acute lymphoblastoid leukemia. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Anton Schwaighofer,et al.  Mining functional modules in genetic networks with decomposable graphical models. , 2004, Omics : a journal of integrative biology.

[13]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Harald Steck,et al.  Constraint-based structural learning in Bayesian networks using finite data sets , 2001 .

[15]  R Dulbecco,et al.  A turning point in cancer research: sequencing the human genome. , 1986, Science.

[16]  S. P. Fodor,et al.  Multiplexed biochemical assays with biological chips , 1993, Nature.

[17]  Gregory F. Cooper,et al.  A Bayesian Method for Constructing Bayesian Belief Networks from Databases , 1991, UAI.

[18]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[19]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[20]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[21]  Abdelhak M. Zoubir,et al.  Bootstrap: theory and applications , 1993, Optics & Photonics.

[22]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[23]  Nir Friedman,et al.  Inferring subnetworks from perturbed expression profiles , 2001, ISMB.

[24]  R. Pomerantz,et al.  RNA interference: on the road to an alternate therapeutic strategy! , 2003, Reviews in medical virology.

[25]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[26]  Satoru Miyano,et al.  Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks , 2004, J. Bioinform. Comput. Biol..

[27]  Rina Dechter,et al.  Bucket elimination: A unifying framework for probabilistic inference , 1996, UAI.

[28]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[29]  Tommi S. Jaakkola,et al.  Using Graphical Models and Genomic Expression Data to Statistically Validate Models of Genetic Regulatory Networks , 2000, Pacific Symposium on Biocomputing.

[30]  Gustavo Deco,et al.  Large-Scale Computational Modeling of Genetic Regulatory Networks , 2003, Artificial Intelligence Review.

[31]  César A. Hidalgo,et al.  Scale-free networks , 2008, Scholarpedia.

[32]  D. Hanahan,et al.  The Hallmarks of Cancer , 2000, Cell.

[33]  Mathäus Dejori,et al.  Bayesian Inference of Genetic Networks from Gene-Expression-Data: Convergence and Reliability , 2003, IC-AI.

[34]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.