Mathematical and Statistical Modeling in Cancer Systems Biology

Cancer is a major health problem with high mortality rates. In the post-genome era, investigators have access to massive amounts of rapidly accumulating high-throughput data in publicly available databases, some of which are exclusively devoted to housing Cancer data. However, data interpretation efforts have not kept pace with data collection, and gained knowledge is not necessarily translating into better diagnoses and treatments. A fundamental problem is to integrate and interpret data to further our understanding in Cancer Systems Biology. Viewing cancer as a network provides insights into the complex mechanisms underlying the disease. Mathematical and statistical models provide an avenue for cancer network modeling. In this article, we review two widely used modeling paradigms: deterministic metabolic models and statistical graphical models. The strength of these approaches lies in their flexibility and predictive power. Once a model has been validated, it can be used to make predictions and generate hypotheses. We describe a number of diverse applications to Cancer Biology, including, the system-wide effects of drug-treatments, disease prognosis, tumor classification, forecasting treatment outcomes, and survival predictions.

[1]  Yang Li,et al.  Critical reasoning on causal inference in genome-wide linkage and association studies. , 2010, Trends in genetics : TIG.

[2]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[3]  Mike West,et al.  Bayesian Learning in Sparse Graphical Factor Models via Variational Mean-Field Annealing , 2010, J. Mach. Learn. Res..

[4]  David Edwards,et al.  Selecting high-dimensional mixed graphical models using minimal AIC or BIC forests , 2010, BMC Bioinformatics.

[5]  D. Calvetti,et al.  Large-scale Bayesian parameter estimation for a three-compartment cardiac metabolism model during ischemia , 2006 .

[6]  Jinghui Zhang,et al.  Rrp1b, a New Candidate Susceptibility Gene for Breast Cancer Progression and Metastasis , 2007, PLoS genetics.

[7]  Francis S. Collins,et al.  Mapping the cancer genome , 2007 .

[8]  M. V. Heiden,et al.  Targeting cancer metabolism: a therapeutic window opens , 2011, Nature Reviews Drug Discovery.

[9]  A. Hoffmann,et al.  The I (cid:1) B –NF-(cid:1) B Signaling Module: Temporal Control and Selective Gene Activation , 2022 .

[10]  Jingyuan Fu,et al.  Defining gene and QTL networks. , 2009, Current opinion in plant biology.

[11]  Maulik Shukla,et al.  Data Integration for Dynamic and Sustainable Systems Biology Resources: Challenges and Lessons Learned , 2010, Chemistry & biodiversity.

[12]  Ilya Shmulevich,et al.  On Learning Gene Regulatory Networks Under the Boolean Network Model , 2003, Machine Learning.

[13]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[14]  O. Warburg [Origin of cancer cells]. , 1956, Oncologia.

[15]  P Haddawy,et al.  Preliminary investigation of a Bayesian network for mammographic diagnosis of breast cancer. , 1995, Proceedings. Symposium on Computer Applications in Medical Care.

[16]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[17]  Monica L. Mo,et al.  Global reconstruction of the human metabolic network based on genomic and bibliomic data , 2007, Proceedings of the National Academy of Sciences.

[18]  Roded Sharan,et al.  Genome-Scale Metabolic Modeling Elucidates the Role of Proliferative Adaptation in Causing the Warburg Effect , 2011, PLoS Comput. Biol..

[19]  G. Giaccone,et al.  Drug Development: Portals of Discovery , 2012, Clinical Cancer Research.

[20]  A. Jemal,et al.  Cancer statistics, 2011 , 2011, CA: a cancer journal for clinicians.

[21]  Bart De Moor,et al.  Predicting the prognosis of breast cancer by integrating clinical and microarray data with Bayesian networks , 2006, ISMB.

[22]  Satoru Miyano,et al.  Combining microarrays and biological knowledge for estimating gene networks via Bayesian networks , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[23]  Oliver Medvedik,et al.  Stimulus Specificity of Gene Expression Programs Determined by Temporal Control of IKK Activity , 2013 .

[24]  Daniela Calvetti,et al.  Large-Scale Statistical Parameter Estimation in Complex Systems with an Application to Metabolic Models , 2006, Multiscale Model. Simul..

[25]  R. Tibshirani,et al.  Repeated observation of breast tumor subtypes in independent gene expression data sets , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[26]  M. West,et al.  Gene expression profiling and genetic markers in glioblastoma survival. , 2005, Cancer research.

[27]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[28]  Mauro Ferrari,et al.  Mathematical modeling of cancer progression and response to chemotherapy , 2006, Expert review of anticancer therapy.

[29]  Kamil Erguler,et al.  Practical limits for reverse engineering of dynamical systems: a statistical analysis of sensitivity and parameter inferability in systems biology models. , 2011, Molecular bioSystems.

[30]  G. Wagner,et al.  The road to modularity , 2007, Nature Reviews Genetics.

[31]  H. Pelicano,et al.  Cancer metabolism: is glutamine sweeter than glucose? , 2010, Cancer cell.

[32]  Alexander Schliep,et al.  Clustering cancer gene expression data: a comparative study , 2008, BMC Bioinformatics.

[33]  Peter K. Sorger,et al.  Logic-Based Models for the Analysis of Cell Signaling Networks† , 2010, Biochemistry.

[34]  D. Husmeier,et al.  Reconstructing Gene Regulatory Networks with Bayesian Networks by Combining Expression Data with Multiple Sources of Prior Knowledge , 2007, Statistical applications in genetics and molecular biology.

[35]  B. Yandell,et al.  CAUSAL GRAPHICAL MODELS IN SYSTEMS GENETICS: A UNIFIED FRAMEWORK FOR JOINT INFERENCE OF CAUSAL NETWORK AND GENETIC ARCHITECTURE FOR CORRELATED PHENOTYPES. , 2010, The annals of applied statistics.

[36]  Bernhard O. Palsson,et al.  Metabolic flux balance analysis and the in silico analysis of Escherichia coli K-12 gene deletions , 2000, BMC Bioinformatics.

[37]  C. E. Kahn,et al.  A Bayesian network for diagnosis of primary bone tumors , 2001, Journal of Digital Imaging.

[38]  Juan Eloy Ruiz-Castro,et al.  Non‐homogeneous Markov models in the analysis of survival after breast cancer , 2001 .

[39]  E. Ruppin,et al.  Computational reconstruction of tissue-specific metabolic models: application to human liver metabolism , 2010, Molecular systems biology.

[40]  Lincoln Stein,et al.  Reactome knowledgebase of human biological pathways and processes , 2008, Nucleic Acids Res..

[41]  Sach Mukherjee,et al.  Network inference using informative priors , 2008, Proceedings of the National Academy of Sciences.

[42]  D. Kerr,et al.  Predictive biomarkers: a paradigm shift towards personalized cancer medicine , 2011, Nature Reviews Clinical Oncology.

[43]  Bruce Tidor,et al.  Sloppy models, parameter uncertainty, and the role of experimental design. , 2010, Molecular bioSystems.

[44]  Y. Leea,et al.  Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular target , 2006 .

[45]  S. Weinhouse,et al.  Metabolism of neoplastic tissue. IV. A study of lipid synthesis in neoplastic tissue slices in vitro. , 1953, Cancer research.

[46]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[47]  Satoru Miyano,et al.  Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks , 2004, J. Bioinform. Comput. Biol..

[48]  Karthik Devarajan,et al.  Synthetic Lethal Screen of an EGFR-Centered Network to Improve Targeted Therapies , 2010, Science Signaling.

[49]  L. Goodman,et al.  NITROGEN MUSTARD THERAPY: Use of Methyl-Bis(Beta-Chloroethyl)amine Hydrochloride and Tris(Beta-Chloroethyl)amine Hydrochloride for Hodgkin's Disease, Lymphosarcoma, Leukemia and Certain Allied and Miscellaneous Disorders , 1946 .

[50]  M. Rockman,et al.  Reverse engineering the genotype–phenotype map with natural genetic variation , 2008, Nature.

[51]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[52]  Nicholas Eriksson,et al.  Conjunctive Bayesian networks , 2006, math/0608417.

[53]  M. Tomita,et al.  Enhancing apoptosis in TRAIL-resistant cancer cells using fundamental response rules , 2011, Scientific reports.

[54]  George E. P. Box,et al.  Empirical Model‐Building and Response Surfaces , 1988 .

[55]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .

[56]  M. West,et al.  High-Dimensional Sparse Factor Modeling: Applications in Gene Expression Genomics , 2008, Journal of the American Statistical Association.

[57]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[58]  R. Spiteri,et al.  A comparison of non-standard solvers for ODEs describing cellular reactions in the heart , 2007, Computer methods in biomechanics and biomedical engineering.

[59]  Eleni Bazigou,et al.  Cell signaling and cancer , 2007, Genome Biology.

[60]  Xingming Zhao,et al.  Computational Systems Biology , 2013, TheScientificWorldJournal.

[61]  Howard A. Levine,et al.  Partial differential equations of chemotaxis and angiogenesis , 2001 .

[62]  Scott W. Lowe,et al.  Apoptosis A Link between Cancer Genetics and Chemotherapy , 2002, Cell.

[63]  Ron Korstanje,et al.  A Bayesian Framework for Inference of the Genotype–Phenotype Map for Segregating Populations , 2011, Genetics.

[64]  Kumar Selvarajoo,et al.  Macroscopic law of conservation revealed in the population dynamics of Toll-like receptor signaling , 2011, Cell Communication and Signaling.

[65]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[66]  Ron Edgar,et al.  Gene Expression Omnibus ( GEO ) : Microarray data storage , submission , retrieval , and analysis , 2008 .

[67]  S. Weinhouse,et al.  Metabolism of neoplastic tissue. III. Diphosphopyridine nucleotide requirements for oxidations by mitochondria of neoplastic and non-neoplastic tissues. , 1953, Cancer research.

[68]  Oliver E. Sturm,et al.  Computational modelling of the receptor-tyrosine-kinase-activated MAPK pathway. , 2005, The Biochemical journal.

[69]  Erwin P. Gianchandani,et al.  Flux balance analysis in the era of metabolomics , 2006, Briefings Bioinform..

[70]  Allan Balmain,et al.  Network analysis of skin tumor progression identifies a rewired genetic architecture affecting inflammation and tumor susceptibility , 2011, Genome Biology.

[71]  M. Ookhtens,et al.  Liver and adipose tissue contributions to newly formed fatty acids in an ascites tumor. , 1984, The American journal of physiology.

[72]  Jason A. Papin,et al.  Applications of genome-scale metabolic reconstructions , 2009, Molecular systems biology.

[73]  R K Jain,et al.  Physiologically based pharmacokinetic modeling: principles and applications. , 1983, Journal of pharmaceutical sciences.

[74]  B Ribba,et al.  A multiscale mathematical model of avascular tumor growth to investigate the therapeutic benefit of anti-invasive agents. , 2006, Journal of theoretical biology.

[75]  B. Palsson,et al.  Genome-scale reconstruction of the Saccharomyces cerevisiae metabolic network. , 2003, Genome research.

[76]  R. Milo,et al.  Oscillations and variability in the p53 system , 2006, Molecular systems biology.

[77]  Michael Baudis,et al.  Progenetix.net: an online repository for molecular cytogenetic aberration data , 2001, Bioinform..

[78]  D. Koller,et al.  From signatures to models: understanding cancer using microarrays , 2005, Nature Genetics.

[79]  Jinghui Zhang,et al.  Rrp 1 b , a New Candidate Susceptibility Gene for Breast Cancer Progression and Metastasis , 2007 .

[80]  Niko Beerenwinkel,et al.  Quantifying cancer progression with conjunctive Bayesian networks , 2009, Bioinform..

[81]  M E Andersen,et al.  Estimating the risk of liver cancer associated with human exposures to chloroform using physiologically based pharmacokinetic modeling. , 1990, Toxicology and applied pharmacology.

[82]  H. Varmus,et al.  Science funding: Provocative questions in cancer research , 2012, Nature.

[83]  A. Friedman MATHEMATICAL ANALYSIS AND CHALLENGES ARISING FROM MODELS OF TUMOR GROWTH , 2007 .

[84]  D. Remington,et al.  Effects of Genetic and Environmental Factors on Trait Network Predictions From Quantitative Trait Locus Data , 2009, Genetics.

[85]  D. Baltimore,et al.  Achieving Stability of Lipopolysaccharide-Induced NF-κB Activation , 2005, Science.

[86]  J. William Ahwood,et al.  CLASSIFICATION , 1931, Foundations of Familiar Language.

[87]  N. V. Zhukov,et al.  Targeted therapy in the treatment of solid tumors: Practice contradicts theory , 2008, Biochemistry (Moscow).

[88]  Lawrence F. Shampine,et al.  Solving ODEs with MATLAB , 2002 .

[89]  Dong-Yup Lee,et al.  Genome-scale modeling and in silico analysis of mouse cell metabolic network. , 2009, Molecular bioSystems.

[90]  K. Aldape,et al.  Integrated array-comparative genomic hybridization and expression array profiles identify clinically relevant molecular subtypes of glioblastoma. , 2005, Cancer research.