Correlated mutations select misfolded from properly folded proteins

Motivation: The recently developed direct coupling analysis (DCA) method has greatly improved the accuracy with which residue‐residue contacts can be predicted from multiple sequence alignments. Contact prediction accuracy, though, is still often not sufficient for complete ab initio protein structure prediction. DCA can, however, support protein structure studies in several ways. Results: We show that DCA can select the better structure from among properly folded and misfolded variants. This idea was tested by comparing obsolete PDB files with their more correctly folded successors and by the comparison of structures with deliberately misfolded decoy models from the Decoys ′R′ Us database. The DCA method systematically predicts more contacts for properly folded structures than for misfolded ones. The method works much better for X‐ray structures than for NMR structures. Availability and Implementation: All data are available from http://comprec‐lin.iiar.pwr.edu.pl/dcaVSmisfolds/ and http://swift.cmbi.ru.nl/dcaVSmisfolds/. Contact: malgorzata.kotulska@pwr.edu.pl. Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  C. B. Roth,et al.  Structure of MsbA from E. coli: a homolog of the multidrug resistance ATP binding cassette (ABC) transporters. , 2001, Science.

[2]  G Vriend,et al.  The interaction of class B G protein-coupled receptors with their hormones. , 1998, Receptors & channels.

[3]  M. Boissinot,et al.  Complete chemical structure of photoactive yellow protein: novel thioester-linked 4-hydroxycinnamyl chromophore and photocycle chemistry. , 1994, Biochemistry.

[4]  P. Barth,et al.  Evolutionary-guided de novo structure prediction of self-associated transmembrane helical proteins with near-atomic accuracy , 2015, Nature Communications.

[5]  Marcin J. Skwark,et al.  Improving Contact Prediction along Three Dimensions , 2014, PLoS Comput. Biol..

[6]  William R. Taylor,et al.  Direct correlation analysis improves fold recognition , 2011, Comput. Biol. Chem..

[7]  Thomas A. Hopf,et al.  Three-Dimensional Structures of Membrane Proteins from Genomic Sequencing , 2012, Cell.

[8]  A. McPherson,et al.  Refined structure of the gene 5 DNA binding protein from bacteriophage fd. , 1983, Journal of molecular biology.

[9]  Anna Tramontano,et al.  Evaluation of residue–residue contact prediction in CASP10 , 2014, Proteins.

[10]  Malgorzata Kotulska,et al.  Automated Procedure for Contact-Map-Based Protein Structure Reconstruction , 2014, The Journal of Membrane Biology.

[11]  C. Sander,et al.  Evaluation of protein models by atomic solvation preference. , 1992, Journal of molecular biology.

[12]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[13]  A. Tramontano,et al.  Evaluation of residue–residue contact predictions in CASP9 , 2011, Proteins.

[14]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[15]  Geoffrey Chang,et al.  X-ray Structure of the EmrE Multidrug Transporter in Complex with a Substrate , 2005, Science.

[16]  A. Biegert,et al.  HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment , 2011, Nature Methods.

[17]  David A. Agard,et al.  α-Bungaratoxin structure revealed by a rapid method for averaging electron density of non-crystallographically translationally related molecules , 1980 .

[18]  Cristina Marino Buslje,et al.  I-COMS: Interprotein-COrrelated Mutations Server , 2015, Nucleic Acids Res..

[19]  R M Stroud,et al.  The crystal structure of alpha-bungarotoxin at 2.5 A resolution: relation to solution structure and binding to acetylcholine receptor. , 1986, Protein engineering.

[20]  G. Chang,et al.  Structure of the multidrug resistance efflux transporter EmrE from Escherichia coli. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Marcin J. Skwark,et al.  PconsC: combination of direct information methods and alignments improves contact prediction , 2013, Bioinform..

[22]  S. Kim,et al.  Crystal structures of two intensely sweet proteins. , 1989, Trends in biochemical sciences.

[23]  Robert D. Finn,et al.  The Pfam protein families database: towards a more sustainable future , 2015, Nucleic Acids Res..

[24]  G W Vuister,et al.  Solution structure and backbone dynamics of the photoactive yellow protein. , 1998, Biochemistry.

[25]  S Brunak,et al.  Protein structures from distance inequalities. , 1993, Journal of molecular biology.

[26]  Gert Vriend,et al.  New ways to boost molecular dynamics simulations , 2015, J. Comput. Chem..

[27]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[28]  K. Hellingwerf,et al.  Primary structure of a photoactive yellow protein from the phototrophic bacterium Ectothiorhodospira halophila, with evidence for the mass and the binding site of the chromophore , 1993, Protein science : a publication of the Protein Society.

[29]  Devon A. Thompson,et al.  Crystal structure of Focal Adhesion Kinase (FAK) , 2003 .

[30]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[31]  Gilles Labesse,et al.  Molecular recognition of paxillin LD motifs by the focal adhesion targeting domain. , 2003, Structure.

[32]  E. Aurell,et al.  Improved contact prediction in proteins: using pseudolikelihoods to infer Potts models. , 2012, Physical review. E, Statistical, nonlinear, and soft matter physics.

[33]  Karen N. Allen,et al.  X-ray crystallographic structures of D-xylose isomerase-substrate complexes position the substrate and provide evidence for metal movement during catalysis. , 1994, Biochemistry.

[34]  K. Hellingwerf,et al.  Thiol ester-linked p-coumaric acid as a new photoactive prosthetic group in a protein with rhodopsin-like photochemistry. , 1994, Biochemistry.

[35]  Dagmar Ringe,et al.  X-ray crystallographic structures of D-xylose isomerase-substrate complexes position the substrate and provide evidence for metal movement during catalysis. , 1994, Biochemistry.

[36]  Geoffrey Chang,et al.  Flexibility in the ABC transporter MsbA: Alternating access with a twist , 2007, Proceedings of the National Academy of Sciences.

[37]  J. Skolnick,et al.  MONSSTER: a method for folding globular proteins with a small number of distance restraints. , 1997, Journal of molecular biology.

[38]  G. Borgstahl,et al.  1.4 A structure of photoactive yellow protein, a cytosolic photoreceptor: unusual fold, active site, and chromophore. , 1995, Biochemistry.

[39]  B. Rost,et al.  Effective use of sequence correlation and conservation in fold recognition. , 1999, Journal of molecular biology.

[40]  Michael Nilges,et al.  Refined Solution Structure of the Tyr41-]His Mutant of the M13 Gene-V Protein - a Comparison with the Crystal-Structure , 1995 .

[41]  G A Petsko,et al.  Crystallographic studies of the mechanism of xylose isomerase. , 1989, Biochemistry.

[42]  Geoffrey Chang,et al.  Structure of MsbA from Vibrio cholera: a multidrug resistance ABC transporter homolog in a closed conformation. , 2003, Journal of molecular biology.

[43]  William R Taylor,et al.  Prediction of protein structure from ideal forms , 2008, Proteins.

[44]  Simona Cocco,et al.  From Principal Component to Direct Coupling Analysis of Coevolution in Proteins: Low-Eigenvalue Modes are Needed for Structure Prediction , 2012, PLoS Comput. Biol..

[45]  M Vendruscolo,et al.  Recovery of protein structure from contact maps. , 1997, Folding & design.

[46]  S. Pietrokovski,et al.  A pair‐to‐pair amino acids substitution matrix and its applications for protein structure prediction , 2007, Proteins.

[47]  K Nishikawa,et al.  A geometrical constraint approach for reproducing the native backbone conformation of a protein , 1993, Proteins.

[48]  G. Chang,et al.  Structure of the ABC Transporter MsbA in Complex with ADP·Vanadate and Lipopolysaccharide , 2005, Science.

[49]  W R Taylor,et al.  Protein fold refinement: building models from idealized folds using motif constraints and multiple sequence data. , 1993, Protein engineering.

[50]  D E McRee,et al.  Crystallographic structure of a photoreceptor protein at 2.4 A resolution. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Brian W Matthews,et al.  Five retracted structure reports: Inverted or incorrect? , 2007, Protein science : a publication of the Protein Society.

[52]  Zhao Li,et al.  Identification of Protein-Protein Interactions by Detecting Correlated Mutation at the Interface , 2015, J. Chem. Inf. Model..

[53]  Michel Frey,et al.  Crystal structure and electron transfer properties of cytochrome c3. , 1985, The Journal of biological chemistry.

[54]  Simona Cocco,et al.  Direct-Coupling Analysis of nucleotide coevolution facilitates RNA secondary and tertiary structure prediction , 2015, Nucleic acids research.

[55]  R Samudrala,et al.  Decoys ‘R’ Us: A database of incorrect conformations to improve protein structure prediction , 2000, Protein science : a publication of the Protein Society.

[56]  M Czjzek,et al.  Crystal structure of cytochrome c3 from Desulfovibrio desulfuricans Norway at 1.7 A resolution. , 1994, Journal of molecular biology.

[57]  Geoffrey Chang,et al.  X-ray structure of EmrE supports dual topology model , 2007, Proceedings of the National Academy of Sciences.

[58]  David A. Lee,et al.  CATH: comprehensive structural and functional annotations for genome sequences , 2014, Nucleic Acids Res..

[59]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[60]  Michael Lappe,et al.  Defining an Essence of Structure Determining Residue Contacts in Proteins , 2009, PLoS Comput. Biol..

[61]  R. Dawson,et al.  Structure of a bacterial multidrug ABC transporter , 2006, Nature.

[62]  Frank Schluenzen,et al.  High Resolution Structure of the Large Ribosomal Subunit from a Mesophilic Eubacterium , 2001, Cell.

[63]  Tim J. P. Hubbard,et al.  SCOP: a Structural Classification of Proteins database , 1999, Nucleic Acids Res..

[64]  C. Sander,et al.  Direct-coupling analysis of residue coevolution captures native contacts across many protein families , 2011, Proceedings of the National Academy of Sciences.

[65]  Gert Vriend,et al.  Traditional Biomolecular Structure Determination by NMR Spectroscopy Allows for Major Errors , 2005, PLoS Comput. Biol..