An Automated Phenotype-Driven Approach (GeneForce) for Refining Metabolic and Regulatory Models

Integrated constraint-based metabolic and regulatory models can accurately predict cellular growth phenotypes arising from genetic and environmental perturbations. Challenges in constructing such models involve the limited availability of information about transcription factor—gene target interactions and computational methods to quickly refine models based on additional datasets. In this study, we developed an algorithm, GeneForce, to identify incorrect regulatory rules and gene-protein-reaction associations in integrated metabolic and regulatory models. We applied the algorithm to refine integrated models of Escherichia coli and Salmonella typhimurium, and experimentally validated some of the algorithm's suggested refinements. The adjusted E. coli model showed improved accuracy (∼80.0%) for predicting growth phenotypes for 50,557 cases (knockout mutants tested for growth in different environmental conditions). In addition to identifying needed model corrections, the algorithm was used to identify native E. coli genes that, if over-expressed, would allow E. coli to grow in new environments. We envision that this approach will enable the rapid development and assessment of genome-scale metabolic and regulatory network models for less characterized organisms, as such models can be constructed from genome annotations and cis-regulatory network predictions.

[1]  B. Palsson,et al.  Genome-scale models of microbial cells: evaluating the consequences of constraints , 2004, Nature Reviews Microbiology.

[2]  Michael C. Jewett,et al.  Linking high-resolution metabolic flux phenotypes and transcriptional regulation in yeast modulated by the global regulator Gcn4p , 2009, Proceedings of the National Academy of Sciences.

[3]  E. Lin,et al.  Constitutive activation of the fucAO operon and silencing of the divergently transcribed fucPIK operon by an IS5 element in Escherichia coli mutants selected for growth on L-1,2-propanediol , 1989, Journal of bacteriology.

[4]  B. Palsson,et al.  Constraints-based models: regulation of gene expression reduces the steady-state solution space. , 2003, Journal of theoretical biology.

[5]  B. Hove-Jensen,et al.  d-Allose Catabolism ofEscherichia coli: Involvement of alsI and Regulation of als Regulon Expression by Allose and Ribose , 1999, Journal of bacteriology.

[6]  Hideyuki Suzuki,et al.  γ-Glutamylputrescine Synthetase in the Putrescine Utilization Pathway of Escherichia coli K-12* , 2008, Journal of Biological Chemistry.

[7]  R. Cooper,et al.  Two ribose-5-phosphate isomerases from Escherichia coli K12: partial characterisation of the enzymes and consideration of their possible physiological roles. , 1975, European journal of biochemistry.

[8]  Stefan Bornholdt,et al.  Boolean network models of cellular regulation: prospects and limitations , 2008, Journal of The Royal Society Interface.

[9]  Hirotada Mori,et al.  Functional analysis of 1440 Escherichia coli genes using the combination of knock-out library and phenotype microarrays. , 2005, Metabolic engineering.

[10]  Markus J. Herrgård,et al.  Network-based prediction of human tissue-specific metabolism , 2008, Nature Biotechnology.

[11]  B. Palsson,et al.  Metabolic modelling of microbes: the flux-balance approach. , 2002, Environmental microbiology.

[12]  G. Church,et al.  Analysis of optimality in natural and perturbed metabolic networks , 2002 .

[13]  Bernhard O. Palsson,et al.  Iterative Reconstruction of Transcriptional Regulatory Networks: An Algorithmic Approach , 2006, PLoS Comput. Biol..

[14]  P. Dimroth,et al.  The Escherichia coli Citrate Carrier CitT: a Member of a Novel Eubacterial Transporter Family Related to the 2-Oxoglutarate/Malate Translocator from Spinach Chloroplasts , 1998, Journal of bacteriology.

[15]  Christopher H. Bryant,et al.  Functional genomic hypothesis generation and experimentation by a robot scientist , 2004, Nature.

[16]  L. Aravind,et al.  Reconstructing prokaryotic transcriptional regulatory networks: lessons from actinobacteria , 2009, Journal of biology.

[17]  R. Sharan,et al.  A genome-scale computational study of the interplay between transcriptional regulation and metabolism , 2007, Molecular systems biology.

[18]  Vinay Satish Kumar,et al.  GrowMatch: An Automated Method for Reconciling In Silico/In Vivo Growth Predictions , 2009, PLoS Comput. Biol..

[19]  C. Park,et al.  The D-allose operon of Escherichia coli K-12 , 1997, Journal of bacteriology.

[20]  Michael K. Gilson,et al.  ASAP, a systematic annotation package for community analysis of genomes , 2003, Nucleic Acids Res..

[21]  J. Collins,et al.  Inferring Genetic Networks and Identifying Compound Mode of Action via Expression Profiling , 2003, Science.

[22]  B. Wanner,et al.  One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[23]  C. Sella,et al.  Properties of subcloned subunits of bacterial acetohydroxy acid synthases , 1992, Journal of bacteriology.

[24]  Bernhard O. Palsson,et al.  Constraint-based analysis of metabolic capacity of Salmonella typhimurium during host-pathogen interaction , 2009, BMC Systems Biology.

[25]  K. I. Sørensen,et al.  Ribose catabolism of Escherichia coli: characterization of the rpiB gene encoding ribose phosphate isomerase B and of the rpiR gene, which is involved in regulation of rpiB expression , 1996, Journal of bacteriology.

[26]  J. Heijenoort,et al.  Copurification of glucosamine-1-phosphate acetyltransferase and N-acetylglucosamine-1-phosphate uridyltransferase activities of Escherichia coli: characterization of the glmU gene product as a bifunctional enzyme catalyzing two subsequent steps in the pathway for UDP-N-acetylglucosamine synthesis , 1994, Journal of bacteriology.

[27]  E. Vimr,et al.  Convergent Pathways for Utilization of the Amino Sugars N-Acetylglucosamine,N-Acetylmannosamine, and N-Acetylneuraminic Acid by Escherichia coli , 1999, Journal of bacteriology.

[28]  Christian L. Barrett,et al.  Genome-scale reconstruction of the Lrp regulatory network in Escherichia coli , 2008, Proceedings of the National Academy of Sciences.

[29]  A. Horswill,et al.  Studies of Regulation of Expression of the Propionate (prpBCDE) Operon Provide Insights into How Salmonella typhimurium LT2 Integrates Its 1,2-Propanediol and Propionate Catabolic Pathways , 1998, Journal of bacteriology.

[30]  J. Plumbridge,et al.  Co‐ordinated regulation of amino sugar biosynthesis and degradation: the NagC repressor acts as both an activator and a repressor for the transcription of the glmUS operon and requires two separated NagC binding sites. , 1995, The EMBO journal.

[31]  E. Lin,et al.  Ferrous-activated nicotinamide adenine dinucleotide-linked dehydrogenase from a mutant of Escherichia coli capable of growth on 1, 2-propanediol. , 1969, Journal of bacteriology.

[32]  Adam M. Feist,et al.  A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information , 2007, Molecular systems biology.

[33]  W. D. Nunn,et al.  Genetic and molecular characterization of the genes involved in short-chain fatty acid degradation in Escherichia coli: the ato system , 1987, Journal of bacteriology.

[34]  P. Overath,et al.  ato Operon: a highly inducible system for acetoacetate and butyrate degradation in Escherichia coli. , 1972, European journal of biochemistry.

[35]  R. Mahadevan,et al.  The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. , 2003, Metabolic engineering.

[36]  G. Unden,et al.  Regulation of tartrate metabolism by TtdR and relation to the DcuS-DcuR-regulated C4-dicarboxylate metabolism of Escherichia coli. , 2009, Microbiology.

[37]  B. Palsson,et al.  Systems approach to refining genome annotation , 2006, Proceedings of the National Academy of Sciences.

[38]  Markus J. Herrgård,et al.  Integrating high-throughput and computational data elucidates bacterial networks , 2004, Nature.

[39]  H. Mori,et al.  Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection , 2006, Molecular systems biology.

[40]  Adam M. Feist,et al.  Reconstruction of biochemical networks in microorganisms , 2009, Nature Reviews Microbiology.

[41]  B. Palsson,et al.  An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR) , 2003, Genome Biology.

[42]  Jennifer L. Reed,et al.  OptORF: Optimal metabolic and regulatory perturbations for metabolic engineering of microbial strains , 2010, BMC Systems Biology.

[43]  S. Teichmann,et al.  Evolutionary dynamics of prokaryotic transcriptional regulatory networks. , 2006, Journal of molecular biology.

[44]  N. Costantino,et al.  E. coli genome manipulation by P1 transduction. , 2007, Current protocols in molecular biology.

[45]  Vinay Satish Kumar,et al.  Optimization based automated curation of metabolic reconstructions , 2007, BMC Bioinformatics.

[46]  Guy Karlebach,et al.  Modelling and analysis of gene regulatory networks , 2008, Nature Reviews Molecular Cell Biology.

[47]  Markus J. Herrgård,et al.  Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae. , 2006, Genome research.

[48]  N. Obradors,et al.  Evolution of an Escherichia coli Protein with Increased Resistance to Oxidative Stress* , 1998, The Journal of Biological Chemistry.

[49]  J. Escalante‐Semerena,et al.  prpR, ntrA, and ihf Functions Are Required for Expression of the prpBCDE Operon, Encoding Enzymes That Catabolize Propionate in Salmonella enterica Serovar Typhimurium LT2 , 2000, Journal of bacteriology.

[50]  Bernhard O. Palsson,et al.  Identification of Genome-Scale Metabolic Network Models Using Experimentally Measured Flux Profiles , 2006, PLoS Comput. Biol..

[51]  R. Welch,et al.  DsdX Is the Second d-Serine Transporter in Uropathogenic Escherichia coli Clinical Isolate CFT073 , 2006, Journal of bacteriology.

[52]  Jason A. Papin,et al.  Applications of genome-scale metabolic reconstructions , 2009, Molecular systems biology.

[53]  Markus J. Herrgård,et al.  Reconstruction of microbial transcriptional regulatory networks. , 2004, Current opinion in biotechnology.

[54]  Desmond S. Lun,et al.  Interpreting Expression Data with Metabolic Flux Models: Predicting Mycobacterium tuberculosis Mycolic Acid Production , 2009, PLoS Comput. Biol..

[55]  J. Escalante‐Semerena,et al.  2-Methylcitrate-dependent activation of the propionate catabolic operon (prpBCDE) of Salmonella enterica by the PrpR protein. , 2004, Microbiology.

[56]  Peter D. Karp,et al.  A Bayesian method for identifying missing enzymes in predicted metabolic pathway databases , 2004, BMC Bioinformatics.

[57]  B. Palsson,et al.  Regulation of gene expression in flux balance models of metabolism. , 2001, Journal of theoretical biology.

[58]  Eva Cusa,et al.  Regulation of the Escherichia coli allantoin regulon: coordinated function of the repressor AllR and the activator AllS. , 2002, Journal of molecular biology.