A Novel Knowledge-Driven Systems Biology Approach for Phenotype Prediction upon Genetic Intervention

Deciphering the biological networks underlying complex phenotypic traits, e.g., human disease is undoubtedly crucial to understand the underlying molecular mechanisms and to develop effective therapeutics. Due to the network complexity and the relatively small number of available experiments, data-driven modeling is a great challenge for deducing the functions of genes/proteins in the network and in phenotype formation. We propose a novel knowledge-driven systems biology method that utilizes qualitative knowledge to construct a Dynamic Bayesian network (DBN) to represent the biological network underlying a specific phenotype. Edges in this network depict physical interactions between genes and/or proteins. A qualitative knowledge model first translates typical molecular interactions into constraints when resolving the DBN structure and parameters. Therefore, the uncertainty of the network is restricted to a subset of models which are consistent with the qualitative knowledge. All models satisfying the constraints are considered as candidates for the underlying network. These consistent models are used to perform quantitative inference. By in silico inference, we can predict phenotypic traits upon genetic interventions and perturbing in the network. We applied our method to analyze the puzzling mechanism of breast cancer cell proliferation network and we accurately predicted cancer cell growth rate upon manipulating (anti)cancerous marker genes/proteins.

[1]  Tommi S. Jaakkola,et al.  Combining Location and Expression Data for Principled Discovery of Genetic Regulatory Network Models , 2001, Pacific Symposium on Biocomputing.

[2]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1999, Innovations in Bayesian Networks.

[3]  J. Massagué,et al.  Defective repression of c-myc in breast cancer cells: A loss at the core of the transforming growth factor beta growth arrest program. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Rui Chang,et al.  Consistent Modeling, Integration and Simulation of Molecular Interaction Networks in Space-Time Dimension , 2007, 2007 IEEE 7th International Symposium on BioInformatics and BioEngineering.

[5]  David Heckerman,et al.  Causal independence for probability assessment and inference using Bayesian networks , 1996, IEEE Trans. Syst. Man Cybern. Part A.

[6]  James M. Roberts,et al.  Cloning of p27 Kip1 , a cyclin-dependent kinase inhibitor and a potential mediator of extracellular antimitogenic signals , 1994, Cell.

[7]  Gregory J. Hannon,et al.  pl5INK4B is a potentia| effector of TGF-β-induced cell cycle arrest , 1994, Nature.

[8]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[9]  Nir Friedman,et al.  Learning Bayesian Networks with Local Structure , 1996, UAI.

[10]  Joshua LaBaer,et al.  Cooperation of the ErbB2 receptor and transforming growth factor beta in induction of migration and invasion in mammary epithelial cells. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Rui Chang,et al.  Quantitative Inference by Qualitative Semantic Knowledge Mining with Bayesian Model Averaging , 2008, IEEE Transactions on Knowledge and Data Engineering.

[12]  Guy Karlebach,et al.  Modelling and analysis of gene regulatory networks , 2008, Nature Reviews Molecular Cell Biology.

[13]  Xiao-Fan Wang,et al.  Transforming growth factor beta induces the cyclin-dependent kinase inhibitor p21 through a p53-independent mechanism. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[14]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[15]  K. Keyomarsi,et al.  Redundant cyclin overexpression and gene amplification in breast cancer cells. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Keiji Kanazawa,et al.  A model for reasoning about persistence and causation , 1989 .

[17]  T. Bayes An essay towards solving a problem in the doctrine of chances , 2003 .

[18]  Q. Ouyang,et al.  The yeast cell-cycle network is robustly designed. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[19]  A. Iavarone,et al.  Kip/Cip and Ink4 Cdk inhibitors cooperate to induce cell cycle arrest in response to TGF-beta. , 1995, Genes & development.

[20]  Carolyn Gibson,et al.  Genes and development. , 2011, European journal of oral sciences.

[21]  Robert L Sutherland,et al.  Cell cycle control in breast cancer cells , 2006, Journal of cellular biochemistry.

[22]  Zoubin Ghahramani,et al.  A Bayesian approach to reconstructing genetic regulatory networks with hidden factors , 2005, Bioinform..

[23]  Nir Friedman,et al.  Inferring Cellular Networks Using Probabilistic Graphical Models , 2004, Science.

[24]  Dirk Husmeier,et al.  Gene Regulatory Network Reconstruction by Bayesian Integration of Prior Knowledge and/or Different Experimental Conditions , 2008, J. Bioinform. Comput. Biol..

[25]  H BRICAIRE,et al.  Journal of Endocrinology , 1939, Nature.

[26]  A. Owen,et al.  A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae) , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[27]  D. Neal,et al.  Transforming growth factor-beta1 up-regulates p15, p21 and p27 and blocks cell cycling in G1 in human prostate epithelium. , 1999, The Journal of endocrinology.

[28]  J. Baldassare,et al.  RhoA Stimulates p27Kip Degradation through Its Regulation of Cyclin E/CDK2 Activity* , 1999, The Journal of Biological Chemistry.

[29]  W. Marsden I and J , 2012 .

[30]  Satoru Miyano,et al.  Inferring gene networks from time series microarray data using dynamic Bayesian networks , 2003, Briefings Bioinform..

[31]  Haidong Wang,et al.  Discovering molecular pathways from protein interaction and gene expression data , 2003, ISMB.

[32]  Thomas Lengauer,et al.  Pathway analysis in metabolic databases via differetial metabolic display (DMD) , 2000, German Conference on Bioinformatics.

[33]  Monika Heiner,et al.  STEPP - Search Tool for Exploration of Petri net Paths: A new tool for Petri net-based path analysis in biochemical networks , 2004, Silico Biol..

[34]  Robert L Sutherland,et al.  Lack of relationship between CDK activity and G1 cyclin expression in breast cancer cells , 1998, Oncogene.

[35]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[36]  Rui Chang,et al.  Modeling semantics of inconsistent qualitative knowledge for quantitative Bayesian network inference , 2008, Neural Networks.

[37]  J. Silvio Gutkind,et al.  Signaling networks and cell cycle control : the molecular basis of cancer and other diseases , 2000 .

[38]  Albert Goldbeter,et al.  Circadian rhythms and molecular noise. , 2006, Chaos.

[39]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[40]  V. N. Reddy,et al.  Qualitative analysis of biochemical reaction systems , 1996, Comput. Biol. Medicine.

[41]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[42]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[43]  Denis Thieffry,et al.  Petri net modelling of biological regulatory networks , 2008, J. Discrete Algorithms.

[44]  Jérôme Eeckhoute,et al.  A cell-type-specific transcriptional network required for estrogen regulation of cyclin D1 and cell cycle progression in breast cancer. , 2006, Genes & development.

[45]  J. Massagué,et al.  Cytostatic and apoptotic actions of TGF-beta in homeostasis and cancer. , 2003, Nature reviews. Cancer.

[46]  Hong Zhou,et al.  Involvement of Ras Activation in Human Breast Cancer Cell Signaling, Invasion, and Anoikis , 2004, Cancer Research.

[47]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[48]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[49]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[50]  Christian Borgelt,et al.  Local Structure Learning in Graphical Models , 2003, Planning Based on Decision Theory.

[51]  J. Massagué,et al.  E2F4/5 and p107 as Smad Cofactors Linking the TGFβ Receptor to c-myc Repression , 2002, Cell.

[52]  Qun Zhou,et al.  Inhibition of cyclin D expression in human breast carcinoma cells by retinoids in vitro , 1997, Oncogene.

[53]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[54]  J. Massagué,et al.  Myc suppression of the p21Cip1 Cdk inhibitor influences the outcome of the p53 response to DNA damage , 2002, Nature.

[55]  Jens Timmer,et al.  Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge , 2007, BMC Systems Biology.

[56]  Mark Craven,et al.  Connecting quantitative regulatory-network models to the genome , 2007, ISMB/ECCB.

[57]  Carsten Peterson,et al.  Random Boolean network models and the yeast transcriptional network , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[58]  J. Massagué,et al.  TGFβ influences Myc, Miz-1 and Smad to control the CDK inhibitor p15INK4b , 2001, Nature Cell Biology.

[59]  Nir Friedman,et al.  Learning Belief Networks in the Presence of Missing Values and Hidden Variables , 1997, ICML.