The challenge of modeling protein assemblies: the CASP12‐CAPRI experiment

We present the quality assessment of 5613 models submitted by predictor groups from both CAPRI and CASP for the total of 15 most tractable targets from the second joint CASP‐CAPRI protein assembly prediction experiment. These targets comprised 12 homo‐oligomers and 3 hetero‐complexes. The bulk of the analysis focuses on 10 targets (of CAPRI Round 37), which included all 3 hetero‐complexes, and whose protein chains or the full assembly could be readily modeled from structural templates in the PDB. On average, 28 CAPRI groups and 10 CASP groups (including automatic servers), submitted models for each of these 10 targets. Additionally, about 16 groups participated in the CAPRI scoring experiments. A range of acceptable to high quality models were obtained for 6 of the 10 Round 37 targets, for which templates were available for the full assembly. Poorer results were achieved for the remaining targets due to the lower quality of the templates available for the full complex or the individual protein chains, highlighting the unmet challenge of modeling the structural adjustments of the protein components that occur upon binding or which must be accounted for in template‐based modeling. On the other hand, our analysis indicated that residues in binding interfaces were correctly predicted in a sizable fraction of otherwise poorly modeled assemblies and this with higher accuracy than published methods that do not use information on the binding partner. Lastly, the strengths and weaknesses of the assessment methods are evaluated and improvements suggested.

[1]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[2]  Thomas A. Hopf,et al.  Protein structure prediction from sequence variation , 2012, Nature Biotechnology.

[3]  C. Chothia,et al.  The atomic structure of protein-protein recognition sites. , 1999, Journal of molecular biology.

[4]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[5]  Denise Gorse,et al.  Morphological aspects of oligomeric protein structures. , 2005, Progress in biophysics and molecular biology.

[6]  Andrej Sali,et al.  Integrative Structural Biology , 2013, Science.

[7]  Pinak Chakrabarti,et al.  The subunit interfaces of weakly associated homodimeric proteins. , 2010, Journal of molecular biology.

[8]  Alexandre M J J Bonvin,et al.  How proteins get in touch: interface prediction in the study of biomolecular complexes. , 2008, Current protein & peptide science.

[9]  Dima Kozakov,et al.  Convergence and combination of methods in protein-protein docking. , 2009, Current opinion in structural biology.

[10]  Kengo Kinoshita,et al.  Community-wide assessment of protein-interface modeling suggests improvements to design methodology. , 2011, Journal of molecular biology.

[11]  Kengo Kinoshita,et al.  Blind prediction of interfacial water positions in CAPRI , 2014, Proteins.

[12]  Johannes Söding,et al.  Fast and accurate automatic structure prediction with HHpred , 2009, Proteins.

[13]  J. Thornton,et al.  Diversity of protein–protein interactions , 2003, The EMBO journal.

[14]  J. Thornton,et al.  Structural characterisation and functional significance of transient protein-protein interactions. , 2003, Journal of molecular biology.

[15]  David T. Jones,et al.  MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins , 2014, Bioinform..

[16]  Marc F Lensink,et al.  Docking, scoring, and affinity prediction in CAPRI , 2013, Proteins.

[17]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[18]  R. Sharan,et al.  Protein networks in disease. , 2008, Genome research.

[19]  Zoran Obradovic,et al.  Statistical analysis of interface similarity in crystals of homologous proteins. , 2008, Journal of molecular biology.

[20]  Shoshana J. Wodak,et al.  Landscape of intertwined associations in multi-domain homo-oligomeric proteins. , 2015, Journal of molecular biology.

[21]  David E. Kim,et al.  Simultaneous Optimization of Biomolecular Energy Functions on Features from Small Molecules and Macromolecules. , 2016, Journal of chemical theory and computation.

[22]  Timothy A. Whitehead,et al.  Optimization of affinity, specificity and function of designed influenza inhibitors using deep sequencing , 2012, Nature Biotechnology.

[23]  Marc F Lensink,et al.  Blind predictions of protein interfaces by docking calculations in CAPRI , 2010, Proteins.

[24]  Marc F Lensink,et al.  Docking and scoring protein interactions: CAPRI 2009 , 2010, Proteins.

[25]  J. Skolnick,et al.  GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction. , 2011, Biophysical journal.

[26]  Lisa N Kinch,et al.  Evaluation of free modeling targets in CASP11 and ROLL , 2016, Proteins.

[27]  Yang Zhang,et al.  Template-based structure modeling of protein-protein interactions. , 2014, Current opinion in structural biology.

[28]  Daisuke Kihara,et al.  Prediction of homoprotein and heteroprotein complexes by protein docking and template‐based modeling: A CASP‐CAPRI experiment , 2016, Proteins.

[29]  S. Wodak,et al.  Docking and scoring protein complexes: CAPRI 3rd Edition , 2007, Proteins.

[30]  Kliment Olechnovič,et al.  VoroMQA: Assessment of protein structure quality using interatomic contact areas , 2017, Proteins.

[31]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[32]  Georgios A. Pavlopoulos,et al.  Protein structure determination using metagenome sequence data , 2017, Science.

[33]  Julie C. Mitchell,et al.  Community‐wide evaluation of methods for predicting the effect of mutations on protein–protein interactions , 2013, Proteins.

[34]  Terence Hwa,et al.  Coevolutionary signals across protein lineages help capture multiple protein conformations , 2013, Proceedings of the National Academy of Sciences.

[35]  David Baker,et al.  Computational design of novel protein binders and experimental affinity maturation. , 2013, Methods in enzymology.

[36]  Hilla Peretz,et al.  The , 1966 .

[37]  Chenghua Shao,et al.  Trendspotting in the Protein Data Bank , 2013, FEBS letters.

[38]  Thomas A. Hopf,et al.  Structured States of Disordered Proteins from Genomic Sequences , 2016, Cell.

[39]  Anna Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP) — round x , 2014, Proteins.

[40]  Zhengwei Zhu,et al.  Templates are available to model nearly all complexes of structurally characterized proteins , 2012, Proceedings of the National Academy of Sciences.

[41]  Emmanuel D Levy,et al.  Structural, evolutionary, and assembly principles of protein oligomerization. , 2013, Progress in molecular biology and translational science.

[42]  B. Alberts The Cell as a Collection of Protein Machines: Preparing the Next Generation of Molecular Biologists , 1998, Cell.

[43]  Xiaoqin Zou,et al.  Statistical mechanics‐based method to extract atomic distance‐dependent potentials from protein structures , 2011, Proteins.

[44]  David E. Kim,et al.  Improved de novo structure prediction in CASP11 by incorporating coevolution information into Rosetta , 2016, Proteins.

[45]  Yutaka Akiyama,et al.  FORTE: a profile-profile comparison tool for protein fold recognition , 2004, Bioinform..

[46]  David W Ritchie,et al.  Recent progress and future directions in protein-protein docking. , 2008, Current protein & peptide science.

[47]  Kliment Olechnovic,et al.  The PPI3D web server for searching, analyzing and modeling protein‐protein interactions in the context of 3D structures , 2016, Bioinform..

[48]  K. Henrick,et al.  Inference of macromolecular assemblies from crystalline state. , 2007, Journal of molecular biology.

[49]  J. Janin,et al.  Structural basis of macromolecular recognition. , 2002, Advances in protein chemistry.

[50]  Vasant G Honavar,et al.  Computational prediction of protein interfaces: A review of data driven methods , 2015, FEBS letters.

[51]  Patrick Aloy,et al.  Assessing the applicability of template-based protein docking in the twilight zone. , 2014, Structure.

[52]  SödingJohannes Protein homology detection by HMM--HMM comparison , 2005 .

[53]  Mayya Sedova,et al.  PDBFlex: exploring flexibility in protein structures , 2015, Nucleic Acids Res..

[54]  Jose M. Duarte,et al.  A PDB-wide, evolution-based assessment of protein–protein interfaces , 2014, BMC Structural Biology.