Computational predictions of the mutant behavior of AraC.

An algorithm implemented in Rosetta correctly predicts the folding capabilities of the 17-residue N-terminal arm of the AraC gene regulatory protein when arabinose is bound to the protein and the dramatically different structure of this arm when arabinose is absent. The transcriptional activity of 43 mutant AraC proteins with alterations in the arm sequences was measured in vivo and compared with their predicted folding properties. Seventeen of the mutants possessed regulatory properties that could be directly compared with folding predictions. Sixteen of the 17 mutants were correctly predicted. The algorithm predicts that the N-terminal arm sequences of AraC homologs fold to the Escherichia coli AraC arm structure. In contrast, it predicts that random sequences of the same length and many partially randomized E. coli arm sequences do not fold to the E. coli arm structure. The high level of success shows that relatively "simple" computational methods can in some cases predict the behavior of mutant proteins with good reliability.

[1]  Dietmar Schomburg,et al.  Computational modeling of protein mutant stability: analysis and optimization of statistical potentials and structural features reveal insights into prediction model development , 2007, BMC Structural Biology.

[2]  R. Schleif,et al.  Mutational analysis of residue roles in AraC function. , 2003, Journal of molecular biology.

[3]  Dietmar Schomburg,et al.  Structural analysis and prediction of protein mutant stability using distance and torsion potentials: Role of secondary structure and solvent accessibility , 2006, Proteins.

[4]  R. Lobell,et al.  AraC-DNA looping: orientation and distance-dependent loop breaking by the cyclic AMP receptor protein. , 1991, Journal of molecular biology.

[5]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[6]  R. Schleif,et al.  Variation of half‐site organization and DNA looping by AraC protein. , 1993, The EMBO journal.

[7]  William H. Press,et al.  The Art of Scientific Computing Second Edition , 1998 .

[8]  Richard A Friesner,et al.  Prediction of Protein Loop Conformations using the AGBNP Implicit Solvent Model and Torsion Angle Sampling. , 2008, Journal of chemical theory and computation.

[9]  R. Schleif,et al.  Arm-domain interactions in AraC. , 1998, Journal of molecular biology.

[10]  C. Larkin,et al.  Structure and properties of a truely apo form of AraC dimerization domain , 2006, Proteins.

[11]  S Banu Ozkan,et al.  The protein folding problem: when will it be solved? , 2007, Current opinion in structural biology.

[12]  David Baker,et al.  Macromolecular modeling with rosetta. , 2008, Annual review of biochemistry.

[13]  S L Mayo,et al.  De novo protein design: towards fully automated sequence selection. , 1997, Journal of molecular biology.

[14]  M. Michael Gromiha,et al.  CUPSAT: prediction of protein stability upon point mutations , 2006, Nucleic Acids Res..

[15]  S. L. Mayo,et al.  Computational protein design. , 1999, Structure.

[16]  K. Martin,et al.  The DNA loop model for ara repression: AraC protein occupies the proposed loop sites in vivo and repression-negative mutations lie in these same sites. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[17]  R. Schleif,et al.  Functional modes of the regulatory arm of AraC , 2009, Proteins.

[18]  Roland L. Dunbrack Rotamer libraries in the 21st century. , 2002, Current opinion in structural biology.

[19]  Feng Ding,et al.  Modeling backbone flexibility improves protein stability estimation. , 2007, Structure.

[20]  Eric A. Althoff,et al.  De Novo Computational Design of Retro-Aldol Enzymes , 2008, Science.

[21]  P. S. Kim,et al.  High-resolution protein design with backbone freedom. , 1998, Science.

[22]  R F Schleif,et al.  DNA looping and unlooping by AraC protein , 1990, Science.

[23]  William H. Press,et al.  Numerical recipes in C , 2002 .

[24]  Ceslovas Venclovas,et al.  Progress over the first decade of CASP experiments , 2005, Proteins.

[25]  J. Onuchic,et al.  Toward an outline of the topography of a realistic protein-folding funnel. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[26]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[27]  T. Dunn,et al.  An operator at -280 base pairs that is required for repression of araBAD operon promoter: addition of DNA helical turns between the operator and promoter cyclically hinders repression. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Andrew D. Robertson,et al.  Protein Structure and the Energetics of Protein Stability. , 1997, Chemical reviews.

[29]  Anna Tramontano,et al.  Assessment of predictions in the model quality assessment category , 2007, Proteins.

[30]  M. Gromiha,et al.  Role of structural and sequence information in the prediction of protein stability changes: comparison between buried and partially buried mutations. , 1999, Protein engineering.

[31]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[32]  R. Friesner,et al.  Long loop prediction using the protein local optimization program , 2006, Proteins.

[33]  C. Camacho,et al.  SIMPLE estimate of the free energy change due to aliphatic mutations: Superior predictions based on first principles , 2007, Proteins.

[34]  C. Wolberger,et al.  Structural basis for ligand-regulated oligomerization of AraC. , 1997, Science.

[35]  V. Hilser,et al.  Ensemble modulation as an origin of denaturant-independent hydrogen exchange in proteins. , 2000, Journal of molecular biology.

[36]  D. Baker,et al.  A surprising simplicity to protein folding , 2000, Nature.

[37]  M. Wu,et al.  The role of rigidity in DNA looping-unlooping by AraC. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[38]  William H. Press,et al.  Numerical recipes in C. The art of scientific computing , 1987 .

[39]  David E. Kim,et al.  Computational Alanine Scanning of Protein-Protein Interfaces , 2004, Science's STKE.

[40]  Martin Wu,et al.  Strengthened Arm-Dimerization Domain Interactions in AraC* , 2001, The Journal of Biological Chemistry.

[41]  T. Dunn,et al.  Upstream repression and CRP stimulation of the Escherichia coli L-arabinose operon. , 1984, Journal of molecular biology.

[42]  J. Onuchic,et al.  Theory of protein folding: the energy landscape perspective. , 1997, Annual review of physical chemistry.

[43]  Tanja Kortemme,et al.  Design of Multi-Specificity in Protein Interfaces , 2007, PLoS Comput. Biol..

[44]  Brian D. Weitzner,et al.  Benchmarking and Analysis of Protein Docking Performance in Rosetta v3.2 , 2011, PloS one.

[45]  J. Onuchic,et al.  Theory of Protein Folding This Review Comes from a Themed Issue on Folding and Binding Edited Basic Concepts Perfect Funnel Landscapes and Common Features of Folding Mechanisms , 2022 .

[46]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[47]  C Venclovas,et al.  Comparison of performance in successive CASP experiments , 2001, Proteins.

[48]  R. Schleif,et al.  Apo-AraC actively seeks to loop. , 1998, Journal of molecular biology.