Using Sequence-Predicted Contacts to Guide Template-free Protein Structure Prediction

Template-free protein structure prediction seeks three-dimensional structures that organize the serially-bonded amino acids in ways that lower interatomic energy. It is now well understood that energy functions are unreliable guides towards biologically-active structures. This realization raises questions on the proper role and utilization of energy functions. Recent work suggests employing complementary information in the form of amino-acid contacts. Here, we advance this line of work and leverage multi-objective optimization to investigate a variety of combinations of interatomic energy and contact-based scoring. Evaluation on diverse datasets demonstrates the superiority of combining contact information with energy functions in a multi-objective optimization setting for template-free protein structure prediction.

[1]  Xiaogen Zhou,et al.  Secondary Structure and Contact Guided Differential Evolution for Protein Structure Prediction , 2020, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  Michael Levitt,et al.  Generalized ensemble methods for de novo structure prediction , 2009, Proceedings of the National Academy of Sciences.

[3]  Mario Garza-Fabre,et al.  Improved fragment-based protein structure prediction by redesign of search heuristics , 2018, Scientific Reports.

[4]  Amarda Shehu,et al.  Balancing multiple objectives in conformation sampling to control decoy diversity in template-free protein structure prediction , 2019, BMC Bioinformatics.

[5]  A. D. McLachlan,et al.  A mathematical procedure for superimposing atomic coordinates of proteins , 1972 .

[6]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[7]  D. Baker,et al.  Coupled prediction of protein secondary and tertiary structure , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Andriy Kryshtafovych,et al.  Assessment of contact predictions in CASP12: Co‐evolution and deep learning coming of age , 2017, Proteins.

[9]  Colin R. Reeves,et al.  Evolutionary computation: a unified approach , 2007, Genetic Programming and Evolvable Machines.

[10]  Joshua D. Knowles,et al.  Multiobjective Optimization in Bioinformatics and Computational Biology , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[11]  Jinbo Xu,et al.  Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model , 2016 .

[12]  Jie Hou,et al.  DNCON2: improved protein contact prediction using two-level deep convolutional neural networks , 2017, bioRxiv.

[13]  Yang Zhang,et al.  Template‐based and free modeling of I‐TASSER and QUARK pipelines using predicted contact maps in CASP12 , 2018, Proteins.

[14]  L. Sifre,et al.  A 7 D De novo structure prediction with deep learning based scoring , 2022 .

[15]  Ruth Nussinov,et al.  A second molecular biology revolution? The energy landscapes of biomolecular function. , 2014, Physical chemistry chemical physics : PCCP.

[16]  Brian S. Olson,et al.  Multi-Objective Optimization Techniques for Conformational Sampling in Template-Free Protein Structure Prediction , 2014 .

[17]  A. Liwo,et al.  Protein structure prediction by global optimization of a potential energy function. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Amarda Shehu,et al.  Multi-Objective Stochastic Search for Sampling Local Minima in the Protein Energy Surface , 2013, BCB.

[19]  R. Fisher On the Interpretation of χ 2 from Contingency Tables , and the Calculation of P Author , 2022 .

[20]  Kenneth A. De Jong,et al.  Off-lattice protein structure prediction with homologous crossover , 2013, GECCO '13.

[21]  Hongyi Zhou,et al.  DESTINI: A deep-learning approach to contact-driven protein structure prediction , 2019, Scientific Reports.

[22]  Ngaam J Cheung,et al.  De novo protein structure prediction using ultra-fast molecular dynamics simulation , 2018, PloS one.

[23]  G. Barnard,et al.  A New Test for 2 × 2 Tables , 1945 .

[24]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[25]  Krzysztof Fidelis,et al.  Processing and evaluation of predictions in CASP4 , 2001, Proteins.