ProQM-resample: improved model quality assessment for membrane proteins by limited conformational sampling

Summary: Model Quality Assessment Programs (MQAPs) are used to predict the quality of modeled protein structures. These usually use two approaches: methods using consensus of many alternative models and methods requiring only a single model to do its prediction. The consensus methods are useful to improve overall accuracy; however, they frequently fail to pick out the best possible model and cannot be used to generate and score new structures. Single-model methods, on the other hand, do not have these inherent shortcomings and can be used to both sample new structures and improve existing consensus methods. Here, we present ProQM-resample, a membrane protein-specific single-model MQAP, that couples side-chain resampling with MQAP rescoring by ProQM to improve model selection. The side-chain resampling is able to improve side-chain packing for 96% of all models, and improve model selection by 24% as measured by the sum of the Z-score for the first-ranked model (from 25.0 to 31.1), even better than the state-of-the-art consensus method Pcons. The improved model selection can be attributed to the improved side-chain quality, which enables the MQAP to rescue good backbone models with poor side-chain packing. Availability and implementation: http://proqm.wallnerlab.org/download/. Contact: bjornw@ifm.liu.se Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Arne Elofsson,et al.  Identification of correct regions in protein models using structural, alignment, and consensus information , 2006, Protein science : a publication of the Protein Society.

[2]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[3]  A. Elofsson,et al.  Can correct protein models be identified? , 2003, Protein science : a publication of the Protein Society.

[4]  D. Baker,et al.  Toward high-resolution prediction and design of transmembrane helical protein structures , 2007, Proceedings of the National Academy of Sciences.

[5]  Arne Elofsson,et al.  Assessment of global and local model quality in CASP8 using Pcons and ProQ , 2009, Proteins.

[6]  Thomas A. Hopf,et al.  Three-Dimensional Structures of Membrane Proteins from Genomic Sequencing , 2012, Cell.

[7]  Thorsten Joachims,et al.  Learning to classify text using support vector machines - methods, theory and algorithms , 2002, The Kluwer international series in engineering and computer science.

[8]  P. Argos,et al.  Knowledge‐based protein secondary structure assignment , 1995, Proteins.

[9]  Arne Elofsson,et al.  TOPCONS: consensus prediction of membrane protein topology , 2009, Nucleic Acids Res..

[10]  David Baker,et al.  Macromolecular modeling with rosetta. , 2008, Annual review of biochemistry.

[11]  Jianlin Cheng,et al.  Evaluating the absolute quality of a single protein model using structural features and support vector machines , 2009, Proteins.

[12]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[13]  Yang Zhang,et al.  Scoring function for automated assessment of protein structure template quality , 2004, Proteins.

[14]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[15]  Björn Wallner,et al.  Model quality assessment for membrane proteins , 2010, Bioinform..

[16]  Arne Elofsson,et al.  ZPRED: Predicting the distance to the membrane center for residues in alpha-helical membrane proteins , 2006, ISMB.

[17]  Arne Elofsson,et al.  MPRAP: An accessibility predictor for a-helical transmem-brane proteins that performs well inside and outside the membrane , 2010, BMC Bioinformatics.