Computer-Aided Whole-Cell Design: Taking a Holistic Approach by Integrating Synthetic With Systems Biology

Computer-aided design (CAD) for synthetic biology promises to accelerate the rational and robust engineering of biological systems. It requires both detailed and quantitative mathematical and experimental models of the processes to (re)design biology, and software and tools for genetic engineering and DNA assembly. Ultimately, the increased precision in the design phase will have a dramatic impact on the production of designer cells and organisms with bespoke functions and increased modularity. CAD strategies require quantitative models of cells that can capture multiscale processes and link genotypes to phenotypes. Here, we present a perspective on how whole-cell, multiscale models could transform design-build-test-learn cycles in synthetic biology. We show how these models could significantly aid in the design and learn phases while reducing experimental testing by presenting case studies spanning from genome minimization to cell-free systems. We also discuss several challenges for the realization of our vision. The possibility to describe and build whole-cells in silico offers an opportunity to develop increasingly automatized, precise and accessible CAD tools and strategies.

[1]  Sarma Mutturi FOCuS: a metaheuristic algorithm for computing knockouts from genome-scale models for strain optimization. , 2017, Molecular bioSystems.

[2]  Tom Ellis,et al.  The challenges facing synthetic biology in eukaryotes , 2018, Nature Reviews Molecular Cell Biology.

[3]  Oliver Ray,et al.  Automating the Development of Metabolic Network Models , 2015, CMSB.

[4]  Uwe Völker,et al.  Large-scale reduction of the Bacillus subtilis genome: consequences for the transcriptional network, resource allocation, and metabolism , 2017, Genome research.

[5]  Genji Kurisu,et al.  PDB-Dev: a Prototype System for Depositing Integrative/Hybrid Structural Models. , 2017, Structure.

[6]  M. Tomita Whole-cell simulation: a grand challenge of the 21st century. , 2001, Trends in biotechnology.

[7]  Joseph D Puglisi,et al.  Quantitative polysome analysis identifies limitations in bacterial cell-free protein synthesis. , 2005, Biotechnology and bioengineering.

[8]  Bonny Jain,et al.  Towards a whole-cell modeling approach for synthetic biology. , 2013, Chaos.

[9]  G. Stan,et al.  Overloaded and stressed: whole-cell considerations for bacterial synthetic biology. , 2016, Current opinion in microbiology.

[10]  Antonis Papachristodoulou,et al.  Quantification of Interactions between Dynamic Cellular Network Functionalities by Cascaded Layering , 2015, PLoS Comput. Biol..

[11]  Masaru Tomita,et al.  Space in systems biology of signaling pathways – towards intracellular molecular crowding in silico , 2005, FEBS letters.

[12]  D. G. Gibson,et al.  Design and synthesis of a minimal bacterial genome , 2016, Science.

[13]  Vincent Noireaux,et al.  Coarse-grained dynamics of protein synthesis in a cell-free system. , 2011, Physical review letters.

[14]  Milsee Mol,et al.  Genome modularity and synthetic biology: Engineering systems. , 2017, Progress in biophysics and molecular biology.

[15]  D A Rand,et al.  Mapping global sensitivity of cellular network dynamics: sensitivity heat maps and a global summation law , 2008, Journal of The Royal Society Interface.

[16]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[17]  Zaida Luthey-Schulten,et al.  Essential metabolism for a minimal cell , 2019, eLife.

[18]  Markus Krummenacker,et al.  The MetaCyc database of metabolic pathways and enzymes , 2017, Nucleic acids research.

[19]  M L Shuler,et al.  A mathematical model for the growth of a single cell of E. coli on a glucose/glutamine/ammonium medium , 1989, Biotechnology and bioengineering.

[20]  Marc R. Birtwistle,et al.  A mechanistic pan-cancer pathway model informed by multi-omics data interprets stochastic cell fate responses to drugs and mitogens , 2018, PLoS Comput. Biol..

[21]  Lucia Marucci,et al.  Genome-driven cell engineering review: in vivo and in silico metabolic and genome engineering , 2019, Essays in biochemistry.

[22]  Diego di Bernardo,et al.  A tunable dual-input system for ‘on-demand’ dynamic gene expression regulation , 2018, bioRxiv.

[23]  Richard M. Murray,et al.  Rapidly Characterizing the Fast Dynamics of RNA Genetic Circuitry with Cell-Free Transcription–Translation (TX-TL) Systems , 2014, ACS synthetic biology.

[24]  Chris J. Myers,et al.  Toward community standards and software for whole-cell modeling , 2016, IEEE Transactions on Biomedical Engineering.

[25]  Maria Yazdanbakhsh,et al.  Sex-Differential Effect on Infant Mortality of Oral Polio Vaccine Administered with BCG at Birth in Guinea-Bissau. A Natural Experiment , 2008, PloS one.

[26]  Kazufumi Hosoda,et al.  Robustness of a Reconstituted Escherichia coli Protein Translation System Analyzed by Computational Modeling. , 2018, ACS synthetic biology.

[27]  Adam P. Arkin,et al.  Mutant phenotypes for thousands of bacterial genes of unknown function , 2018, Nature.

[28]  John J Tyson,et al.  Bifurcation analysis of a model of the budding yeast cell cycle. , 2004, Chaos.

[29]  G. Stephanopoulos,et al.  Metabolic engineering: past and future. , 2013, Annual review of chemical and biomolecular engineering.

[30]  Timothy K Lu,et al.  Advancing bacteriophage-based microbial diagnostics with synthetic biology. , 2013, Trends in biotechnology.

[31]  Mohd Saberi Mohamad,et al.  A Review of Gene Knockout Strategies for Microbial Cells. , 2016, Recent patents on biotechnology.

[32]  Hans V. Westerhoff,et al.  Clb3-centered regulations are recurrent across distinct parameter regions in minimal autonomous cell cycle oscillator designs , 2020, npj Systems Biology and Applications.

[33]  Mikaël M. Martino,et al.  Editorial: Vascularization for Regenerative Medicine , 2018, Front. Bioeng. Biotechnol..

[34]  Adrian H. Elcock,et al.  Diffusion, Crowding & Protein Stability in a Dynamic Molecular Model of the Bacterial Cytoplasm , 2010, PLoS Comput. Biol..

[35]  B. Palsson,et al.  Stoichiometric flux balance models quantitatively predict growth and metabolic by-product secretion in wild-type Escherichia coli W3110 , 1994, Applied and environmental microbiology.

[36]  Junli Liu,et al.  Bayesian uncertainty analysis for complex systems biology models: emulation, global parameter searches and evaluation of gene functions , 2016, BMC Systems Biology.

[37]  A. Burgard,et al.  Optknock: A bilevel programming framework for identifying gene knockout strategies for microbial strain optimization , 2003, Biotechnology and bioengineering.

[38]  David S. Wishart,et al.  ECMDB 2.0: A richer resource for understanding the biochemistry of E. coli , 2015, Nucleic Acids Res..

[39]  Hirofumi Honda,et al.  Oxidative stress sensitivity of engineered Escherichia coli cells with a reduced genome. , 2011, FEMS microbiology letters.

[40]  Benjamin J. Raphael,et al.  Visible Machine Learning for Biomedicine , 2018, Cell.

[41]  Mark A. Girolami,et al.  BioBayes: A software package for Bayesian inference in systems biology , 2008, Bioinform..

[42]  Adam D. Silverman,et al.  Cell-free gene expression: an expanded repertoire of applications , 2019, Nature Reviews Genetics.

[43]  S. Belkin,et al.  Where microbiology meets microengineering: design and applications of reporter bacteria , 2010, Nature Reviews Microbiology.

[44]  Antoine Danchin,et al.  Scaling up synthetic biology: Do not forget the chassis , 2012, FEBS letters.

[45]  J. Ramon,et al.  Nonmonotonic Learning in Large Biological Networks , 2016 .

[46]  Pablo Carbonell,et al.  Computer-aided design for metabolic engineering. , 2014, Journal of biotechnology.

[47]  C. Hutchison,et al.  Minimal Cells-Real and Imagined. , 2017, Cold Spring Harbor perspectives in biology.

[48]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[49]  Michael P H Stumpf,et al.  How to deal with parameters for whole-cell modelling , 2017, Journal of The Royal Society Interface.

[50]  Jonathan R. Karr,et al.  The principles of whole-cell modeling. , 2015, Current opinion in microbiology.

[51]  Liz Fletcher,et al.  Future Trends in Synthetic Biology—A Report , 2019, Front. Bioeng. Biotechnol..

[52]  Katsumi Inoue,et al.  Analyzing Pathways Using ASP-Based Approaches , 2010, ANB.

[53]  A. Arkin,et al.  Stochastic mechanisms in gene expression. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[54]  J. Tyson Modeling the cell division cycle: cdc2 and cyclin interactions. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[55]  J. H. Hofmeyr,et al.  MetaModel: a program for modelling and control analysis of metabolic pathways on the IBM PC and compatibles , 1991, Comput. Appl. Biosci..

[56]  Radhakrishnan Mahadevan,et al.  Redesigning metabolism based on orthogonality principles , 2017, Nature Communications.

[57]  Tom Ellis,et al.  Cell-free prediction of protein expression costs for growing cells , 2017, Nature Communications.

[58]  John J. Tyson,et al.  Cell Cycle Control by a Minimal Cdk Network , 2015, PLoS Comput. Biol..

[59]  Frank Alber,et al.  Opportunities and Challenges in Building a Spatiotemporal Multi-scale Model of the Human Pancreatic β Cell , 2018, Cell.

[60]  M. di Bernardo,et al.  A comparative analysis of synthetic genetic oscillators , 2010, Journal of The Royal Society Interface.

[61]  Z. Bar-Joseph,et al.  Using neural networks for reducing the dimensions of single-cell RNA-Seq data , 2017, Nucleic acids research.

[62]  Masaru Tomita,et al.  E-CELL: software environment for whole-cell simulation , 1999, Bioinform..

[63]  Albert Goldbeter,et al.  Dependence of the period on the rate of protein degradation in minimal models for circadian oscillations , 2009, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[64]  G. Karniadakis,et al.  Model inversion via multi-fidelity Bayesian optimization: a new paradigm for parameter estimation in haemodynamics, and beyond , 2016, Journal of The Royal Society Interface.

[65]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[66]  Claude Gérard,et al.  Minimal models for cell-cycle control based on competitive inhibition and multisite phosphorylations of Cdk substrates. , 2013, Biophysical journal.

[67]  M. Schreiber,et al.  Development of bacteria-based bioassays for arsenic detection in natural waters , 2009, Analytical and bioanalytical chemistry.

[68]  Robert B Russell,et al.  The hard cell: From proteomics to a whole cell model , 2007, FEBS letters.

[69]  S. De,et al.  Integrating machine learning and multiscale modeling—perspectives, challenges, and opportunities in the biological, biomedical, and behavioral sciences , 2019, npj Digital Medicine.

[70]  Roded Sharan,et al.  Using deep learning to model the hierarchical structure and function of a cell , 2018, Nature Methods.

[71]  Paul S. Freemont,et al.  Rapid acquisition and model-based analysis of cell-free transcription–translation reactions from nonmodel bacteria , 2018, Proceedings of the National Academy of Sciences.

[72]  Oliver Purcell,et al.  The genome design suite: enabling massive in-silico experiments to design genomes , 2019, bioRxiv.

[73]  Arthur P. Goldberg,et al.  Guidelines for Reproducibly Building and Simulating Systems Biology Models , 2016, IEEE Transactions on Biomedical Engineering.

[74]  Rodrigo Ledesma-Amaro,et al.  Microbial Chassis Development for Natural Product Biosynthesis. , 2020, Trends in biotechnology.

[75]  Robert Petryszak,et al.  ArrayExpress update—simplifying data submissions , 2014, Nucleic Acids Res..

[76]  Bharath Ananthasubramaniam,et al.  Amplitude effects allow short jetlags and large seasonal phase shifts in minimal clock models , 2019, bioRxiv.

[77]  Diogo M. Camacho,et al.  Next-Generation Machine Learning for Biological Networks , 2018, Cell.

[78]  Baojun Wang,et al.  Synthetic Biology Enables Programmable Cell‐Based Biosensors , 2019, Chemphyschem : a European journal of chemical physics and physical chemistry.

[79]  Jens Timmer,et al.  Summary of the DREAM8 Parameter Estimation Challenge: Toward Parameter Identification for Whole-Cell Models , 2015, PLoS Comput. Biol..

[80]  Oliver Purcell,et al.  Designing minimal genomes using whole-cell models , 2020, Nature Communications.

[81]  Pablo I. Nikel,et al.  Chasing bacterial chassis for metabolic engineering: a perspective review from classical to non‐traditional microorganisms , 2018, Microbial biotechnology.

[82]  J. Collins,et al.  Synthetic biology devices for in vitro and in vivo diagnostics , 2015, Proceedings of the National Academy of Sciences.

[83]  Sarala M. Wimalaratne,et al.  The Systems Biology Graphical Notation , 2009, Nature Biotechnology.

[84]  Thomas Thorne,et al.  Model selection in systems and synthetic biology. , 2013, Current opinion in biotechnology.

[85]  Edda Klipp,et al.  A Clb/Cdk1-mediated regulation of Fkh2 synchronizes CLB expression in the budding yeast cell cycle , 2017, npj Systems Biology and Applications.

[86]  J. Carrera,et al.  Model-based redesign of global transcription regulation , 2009, Nucleic acids research.

[87]  Erwin P. Gianchandani,et al.  Dynamic Analysis of Integrated Signaling, Metabolic, and Regulatory Networks , 2008, PLoS Comput. Biol..

[88]  Edda Klipp,et al.  Sic1 plays a role in timing and oscillatory behaviour of B-type cyclins. , 2012, Biotechnology advances.

[89]  Daniel C. Zielinski,et al.  Personalized Whole-Cell Kinetic Models of Metabolism for Discovery in Genomics and Pharmacodynamics. , 2015, Cell systems.

[90]  Takahashi,et al.  E-CELL: Software Environment for Whole Cell Simulation. , 1997, Genome informatics. Workshop on Genome Informatics.

[91]  Filippo Castiglione,et al.  Modeling Biology Spanning Different Scales: An Open Challenge , 2014, BioMed research international.

[92]  Costas D Maranas,et al.  MinGenome: An In Silico Top-Down Approach for the Synthesis of Minimized Genomes. , 2017, ACS synthetic biology.

[93]  R. Zimmer,et al.  Experiment and mathematical modeling of gene expression dynamics in a cell-free system. , 2012, Integrative biology : quantitative biosciences from nano to macro.

[94]  Marta Z. Kwiatkowska,et al.  PRISM 4.0: Verification of Probabilistic Real-Time Systems , 2011, CAV.

[95]  Darren J. Wilkinson,et al.  Bayesian methods in bioinformatics and computational systems biology , 2006, Briefings Bioinform..

[96]  M L Shuler,et al.  A modular minimal cell model: purine and pyrimidine transport and metabolism. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[97]  Peter D. Karp,et al.  The MetaCyc database of metabolic pathways and enzymes - a 2019 update , 2019, Nucleic Acids Res..

[98]  Javier Carrera,et al.  Why Build Whole-Cell Models? , 2015, Trends in cell biology.

[99]  Naotake Ogasawara,et al.  Genetic manipulations restored the growth fitness of reduced-genome Escherichia coli. , 2013, Journal of bioscience and bioengineering.

[100]  Nikolaus Sonnenschein,et al.  Improving Reproducibility in Synthetic Biology , 2019, Front. Bioeng. Biotechnol..

[101]  Yuji Sugita,et al.  Complete atomistic model of a bacterial cytoplasm for integrating physics, biochemistry, and systems biology. , 2015, Journal of molecular graphics & modelling.

[102]  Jonathan R. Karr,et al.  A Whole-Cell Computational Model Predicts Phenotype from Genotype , 2012, Cell.

[103]  Changbong Hyeon,et al.  Capturing the essence of folding and functions of biomolecules using coarse-grained models. , 2011, Nature communications.

[104]  Xingming Zhao,et al.  Computational Systems Biology , 2013, TheScientificWorldJournal.

[105]  Yuji Sugita,et al.  Whole-Cell Models and Simulations in Molecular Detail. , 2019, Annual review of cell and developmental biology.

[106]  Antonis Papachristodoulou,et al.  On validation and invalidation of biological models , 2009, BMC Bioinformatics.

[107]  Bas Teusink,et al.  A Systematic Assessment Of Current Genome-Scale Metabolic Reconstruction Tools , 2019 .

[108]  R. Bar-Ziv,et al.  Principles of cell-free genetic circuit assembly , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[109]  R. Murray,et al.  Gene circuit performance characterization and resource usage in a cell-free "breadboard". , 2014, ACS synthetic biology.

[110]  J. Swartz,et al.  Energizing cell-free protein synthesis with glucose metabolism. , 2005, Biotechnology and bioengineering.

[111]  C. J.,et al.  Predicting Temporal Fluctuations in an Intracellular Signalling Pathway , 1998 .

[112]  Lei Shi,et al.  SABIO-RK—database for biochemical reaction kinetics , 2011, Nucleic Acids Res..

[113]  Jeff Hasty,et al.  Programmable probiotics for detection of cancer in urine , 2015, Science Translational Medicine.

[114]  Albert Solernou,et al.  Fluctuating Finite Element Analysis (FFEA): A continuum mechanics software tool for mesoscale simulation of biomolecules , 2018, PLoS Comput. Biol..

[115]  Michel Dumontier,et al.  Controlled vocabularies and semantics in systems biology , 2011, Molecular systems biology.

[116]  J. Keasling,et al.  Integrating Biological Redesign: Where Synthetic Biology Came From and Where It Needs to Go , 2014, Cell.

[117]  Jeffrey Skolnick,et al.  Crowding and hydrodynamic interactions likely dominate in vivo macromolecular motion , 2010, Proceedings of the National Academy of Sciences.

[118]  M. di Bernardo,et al.  How to Turn a Genetic Circuit into a Synthetic Tunable Oscillator, or a Bistable Switch , 2009, PloS one.

[119]  Jacob Beal,et al.  Organizing genome engineering for the gigabase scale , 2020, Nature Communications.

[120]  Jonathan R. Karr,et al.  A blueprint for human whole-cell modeling. , 2018, Current opinion in systems biology.

[121]  Stevens K. Rehen,et al.  Genetic switches designed for eukaryotic cells and controlled by serine integrases , 2020, Communications Biology.

[122]  Kamil Erguler,et al.  Practical limits for reverse engineering of dynamical systems: a statistical analysis of sensitivity and parameter inferability in systems biology models. , 2011, Molecular bioSystems.

[123]  James J Collins,et al.  Programmable bacteria detect and record an environmental signal in the mammalian gut , 2014, Proceedings of the National Academy of Sciences.

[124]  Haruki Nakamura,et al.  Outcome of the First wwPDB Hybrid/Integrative Methods Task Force Workshop. , 2015, Structure.

[125]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[126]  J. Tyson,et al.  Numerical analysis of a comprehensive model of M-phase control in Xenopus oocyte extracts and intact embryos. , 1993, Journal of cell science.

[127]  Jonathan R. Karr,et al.  WholeCellKB: model organism databases for comprehensive whole-cell models , 2012, Nucleic Acids Res..

[128]  Ronan M. T. Fleming,et al.  Genome-Scale Reconstruction of Escherichia coli's Transcriptional and Translational Machinery: A Knowledge Base, Its Mathematical Formulation, and Its Functional Characterization , 2009, PLoS Comput. Biol..

[129]  F. Blattner,et al.  Emergent Properties of Reduced-Genome Escherichia coli , 2006, Science.

[130]  Y. Sugita,et al.  Reaching new levels of realism in modeling biological macromolecules in cellular environments. , 2013, Journal of molecular graphics & modelling.

[131]  Adam M. Feist,et al.  Basic and applied uses of genome-scale metabolic network reconstructions of Escherichia coli , 2013, Molecular systems biology.

[132]  Z. Qin,et al.  CasHRA (Cas9-facilitated Homologous Recombination Assembly) method of constructing megabase-sized DNA , 2016, Nucleic acids research.

[133]  A Goldbeter,et al.  A minimal cascade model for the mitotic oscillator involving cyclin and cdc2 kinase. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[134]  Adilson E Motter,et al.  Sub-optimal phenotypes of double-knockout mutants of Escherichia coli depend on the order of gene deletions. , 2015, Integrative biology : quantitative biosciences from nano to macro.

[135]  Derek N. Macklin,et al.  The future of whole-cell modeling. , 2014, Current opinion in biotechnology.

[136]  Andrés Moya,et al.  Toward minimal bacterial cells: evolution vs. design , 2008, FEMS microbiology reviews.

[137]  Albertha J. M. Walhout,et al.  Metabolic network modeling with model organisms. , 2017, Current opinion in chemical biology.

[138]  Mauricio Barahona,et al.  Computational Re-Design of Synthetic Genetic Oscillators for Independent Amplitude and Frequency Modulation , 2017, bioRxiv.

[139]  Norman Pavelka,et al.  Emerging and evolving concepts in gene essentiality , 2017, Nature Reviews Genetics.

[140]  Aaron M Prescott,et al.  Combining in silico evolution and nonlinear dimensionality reduction to redesign responses of signaling networks , 2017, Physical biology.

[141]  M. Sriram Iyengar Symbolic Systems Biology: Theory and Methods , 2010 .

[142]  Kara Calhoun,et al.  Sequence Specific Modeling of E. coli Cell-Free Protein Synthesis. , 2018, ACS synthetic biology.

[143]  Brad J Marsh,et al.  Expedited approaches to whole cell electron tomography and organelle mark-up in situ in high-pressure frozen pancreatic islets. , 2008, Journal of structural biology.

[144]  Gavin Brooks,et al.  Cell Cycle Control , 2004, Methods in Molecular Biology™.

[145]  Maksat Ashyraliyev,et al.  Systems biology: parameter estimation for biochemical models , 2009, The FEBS journal.

[146]  Matteo Barberis,et al.  Advanced Modeling of Cellular Proliferation: Toward a Multi-scale Framework Coupling Cell Cycle to Metabolism by Integrating Logical and Constraint-Based Models. , 2019, Methods in molecular biology.

[147]  Philip Miller,et al.  BiGG Models: A platform for integrating, standardizing and sharing genome-scale models , 2015, Nucleic Acids Res..

[148]  Jonathan R. Karr,et al.  Emerging whole-cell modeling principles and methods. , 2017, Current opinion in biotechnology.

[149]  Mathilde Koch,et al.  Models for Cell-Free Synthetic Biology: Make Prototyping Easier, Better, and Faster , 2018, Front. Bioeng. Biotechnol..

[150]  Byran J. Smucker,et al.  Optimal experimental design , 2018, Nature Methods.

[151]  Y. Sugita,et al.  Biomolecular interactions modulate macromolecular structure and dynamics in atomistic model of a bacterial cytoplasm , 2016, eLife.

[152]  Petter Holland,et al.  ChIP-exo analysis highlights Fkh1 and Fkh2 transcription factors as hubs that integrate multi-scale networks in budding yeast , 2019, Nucleic acids research.

[153]  Markus W. Covert,et al.  Simultaneous cross-evaluation of heterogeneous E. coli datasets via mechanistic simulation , 2019, Science.

[154]  Hierarchical Organization of Modularity in Metabolic Networks Supporting Online Material , 2002 .

[155]  M Ander,et al.  SmartCell, a framework to simulate cellular processes that combines stochastic approximation with diffusion and localisation: analysis of simple networks. , 2004, Systems biology.