OPUS-Fold: An Open-Source Protein Folding Framework Based on Torsion-Angle Sampling.

In this paper, we propose a protein folding framework, named OPUS-Fold, which can integrate various methods for subproblems in protein structure prediction to contribute to folding. OPUS-Fold is based on torsion-angle sampling. After each sampling step, it reconstructs the structure and estimates the model quality with an energy function that is formed by combining many different constraining terms designed either by ourselves or by others in literature. OPUS-Fold balances the accuracy and the efficiency, delivers good results in a short time, and leaves more space for including the results of other subproblem methods. Moreover, OPUS-Fold also contains a fast side-chain modeling method OPUS-Rota2 (J. Chem. Theory Comput. 2019, 15 (9), 5154-5160), which enables a speedy construction of all-atom atomic models during the folding process that allows the usage of all-atom-required subproblem methods. In summary, OPUS-Fold provides a protein folding platform for incorporating the results from various subproblem methods, including those containing non-differentiable information such as partial experimental data. The source code of OPUS-Fold can be downloaded from https://github.com/thuxugang/opus_fold.

[1]  Yaoqi Zhou,et al.  Improved fragment sampling for ab initio protein structure prediction using deep neural networks , 2019, Nature Machine Intelligence.

[2]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[3]  Jianpeng Ma,et al.  OPUS-DOSP: A Distance- and Orientation-Dependent All-Atom Potential Derived from Side-Chain Packing. , 2017, Journal of molecular biology.

[4]  Jianpeng Ma,et al.  OPUS-Rota2: An Improved Fast and Accurate Side-chain Modeling Method. , 2019, Journal of chemical theory and computation.

[5]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[6]  Wayne A Hendrickson,et al.  A force field for virtual atom molecular mechanics of proteins , 2009, Proceedings of the National Academy of Sciences.

[7]  Zhen Li,et al.  Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model , 2016, bioRxiv.

[8]  Jianpeng Ma,et al.  OPUS‐SSF: A side‐chain‐inclusive scoring function for ranking protein structural models , 2019, Protein science : a publication of the Protein Society.

[9]  Kuldip K. Paliwal,et al.  Capturing non‐local interactions by long short‐term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility , 2017, Bioinform..

[10]  Jie Hou,et al.  DNCON2: improved protein contact prediction using two-level deep convolutional neural networks , 2017, bioRxiv.

[11]  Roland L. Dunbrack,et al.  A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions. , 2011, Structure.

[12]  Sheng-You Huang,et al.  New Knowledge-Based Scoring Function with Inclusion of Backbone Conformational Entropies from Protein Structures , 2018, J. Chem. Inf. Model..

[13]  Jianpeng Ma,et al.  OPUS-Refine: A Fast Sampling-based Framework for Refining Protein Backbone Torsion Angles and Global Conformation. , 2020, Journal of chemical theory and computation.

[14]  Yaoqi Zhou,et al.  Protein side chain modeling with orientation‐dependent atomic force fields derived by series expansions , 2011, J. Comput. Chem..

[15]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[16]  Yang Zhang,et al.  A Novel Side-Chain Orientation Dependent Potential Derived from Random-Walk Reference State for Protein Fold Selection and Structure Prediction , 2010, PloS one.

[17]  Yang Zhang,et al.  3DRobot: automated generation of diverse and well-packed protein structure decoys , 2016, Bioinform..

[18]  Ole Winther,et al.  NetSurfP‐2.0: Improved prediction of protein structural features by integrated deep learning , 2019, Proteins.

[19]  Yang Zhang,et al.  Scoring function for automated assessment of protein structure template quality , 2004, Proteins.

[20]  Richard Bonneau,et al.  An improved protein decoy set for testing energy functions for protein structure prediction , 2003, Proteins.

[21]  Yuedong Yang,et al.  SPOT‐Fold: Fragment‐Free Protein Structure Prediction Guided by Predicted Backbone Structure and Contact Map , 2020, J. Comput. Chem..

[22]  Chi Zhang,et al.  Fast and accurate prediction of protein side-chain conformations , 2011, Bioinform..

[23]  Wei Li,et al.  CoinFold: a web server for protein contact prediction and contact-assisted protein folding , 2016, Nucleic Acids Res..

[24]  Roland L. Dunbrack,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improved prediction of protein side-chain conformations with SCWRL4 , 2022 .

[25]  Wei Li,et al.  RaptorX-Property: a web server for protein structure property prediction , 2016, Nucleic Acids Res..

[26]  Kuldip K. Paliwal,et al.  Accurate prediction of protein contact maps by coupling residual two-dimensional bidirectional long short-term memory with convolutional neural networks , 2018, Bioinform..

[27]  Sheng Wang,et al.  RaptorX-Angle: real-value prediction of protein backbone dihedral angles through a hybrid method of clustering and deep learning , 2018, BMC Bioinformatics.

[28]  Ngaam J Cheung,et al.  De novo protein structure prediction using ultra-fast molecular dynamics simulation , 2018, PloS one.

[29]  Yang Zhang,et al.  Ensembling multiple raw coevolutionary features with deep residual neural networks for contact‐map prediction in CASP13 , 2019, Proteins.

[30]  J. Skolnick,et al.  GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction. , 2011, Biophysical journal.

[31]  Jianpeng Ma,et al.  OPUS‐CSF: A C‐atom‐based scoring function for ranking protein structural models , 2017, bioRxiv.

[32]  David T Jones,et al.  Prediction of interresidue contacts with DeepMetaPSICOV in CASP13 , 2019, Proteins.

[33]  Demis Hassabis,et al.  Improved protein structure prediction using potentials from deep learning , 2020, Nature.

[34]  Yaoqi Zhou,et al.  Improving prediction of protein secondary structure, backbone angles, solvent accessibility and contact numbers by using predicted contact maps and an ensemble of recurrent and residual convolutional neural networks , 2018, Bioinform..

[35]  Jinbo Xu Distance-based protein folding powered by deep learning , 2019, Proceedings of the National Academy of Sciences.

[36]  Yuangen Yao,et al.  ANDIS: an atomic angle- and distance-dependent statistical potential for protein structure quality assessment , 2019, BMC Bioinformatics.

[37]  Sergei Grudinin,et al.  Protein model quality assessment using 3D oriented convolutional neural networks , 2018, bioRxiv.

[38]  Janet E. Jones On the determination of molecular fields. —II. From the equation of state of a gas , 1924 .

[39]  Sheng-You Huang,et al.  Integrating Bonded and Nonbonded Potentials in the Knowledge-Based Scoring Function for Protein Structure Prediction , 2019, J. Chem. Inf. Model..

[40]  Mohammed AlQuraishi End-to-end differentiable learning of protein structure , 2018, bioRxiv.

[41]  Pablo Chacón,et al.  KORP: knowledge-based 6D potential for fast protein and loop modeling , 2019, Bioinform..