AlphaFold predicts novel human proteins with knots

The fact that proteins can have their chain formed in a knot is known for almost 30 years. However, as they are not common, only a fraction of such proteins is available in the Protein Data Bank. It was not possible to assess their importance and versatility up until now because we did not have access to the whole proteome of an organism, let alone a human one. The arrival of efficient machine learning methods for protein structure prediction, such as AlphaFold and RoseTTaFold, changed that. We analyzed all proteins from the human proteome (over 20,000) determined with AlphaFold in search for knots and found them in less than 2% of the structures. Using a variety of methods, including homolog search, clustering, quality assessment, and visual inspection, we determined the nature of each of the knotted structures and classified it as either knotted, potentially knotted, or an artifact, and deposited all of them in a database available at: https://knotprot.cent.uw.edu.pl/alphafold. Overall, we found 51 credible knotted proteins (0.2% of human proteome). The set of potentially knotted structures includes a new complex type of a knot not reported in proteins yet. That knot type, denoted 63 in mathematical notation, would necessitate a more complex folding path than any knotted protein characterized to date.

[1]  J. I. Sulkowska,et al.  First crystal structure of double knotted protein TrmD-Tm1570 – inside from degradation perspective , 2023, bioRxiv.

[2]  O. S.,et al.  Accurate prediction of protein structures and interactions using a three-track neural network , 2022, Yearbook of Paediatric Endocrinology.

[3]  T. Yeates,et al.  AlphaFold predicts the most complex protein knot and composite protein knots , 2022, Protein science : a publication of the Protein Society.

[4]  J. I. Sulkowska,et al.  AlphaKnot: server to analyze entanglement in structures predicted by AlphaFold methods , 2022, Nucleic Acids Res..

[5]  A. Tivey,et al.  Search and sequence analysis tools services from EMBL-EBI in 2022 , 2022, Nucleic Acids Res..

[6]  Ali F. Alsulami,et al.  Slipknot or Crystallographic Error: A Computational Analysis of the Plasmodium falciparum DHFR Structural Folds , 2022, International journal of molecular sciences.

[7]  Joanna I. Sulkowska,et al.  Slipknotted and unknotted monovalent cation-proton antiporters evolved from a common ancestor , 2021, PLoS Comput. Biol..

[8]  Oriol Vinyals,et al.  Highly accurate protein structure prediction with AlphaFold , 2021, Nature.

[9]  Juliane Mundorf,et al.  Ecd promotes U5 snRNP maturation and Prp8 stability , 2021, Nucleic acids research.

[10]  J. I. Sulkowska,et al.  On folding of entangled proteins: knots, lassos, links and θ-curves. , 2020, Current opinion in structural biology.

[11]  F. Seno,et al.  Sequence and structural patterns detected in entangled proteins reveal the importance of co-translational folding , 2019, Scientific Reports.

[12]  Peter Virnau,et al.  Proteins' Knotty Problems. , 2019, Journal of molecular biology.

[13]  Eric J. Rawdon,et al.  KnotProt 2.0: a database of proteins with knots and other entangled structures , 2018, Nucleic Acids Res..

[14]  J. I. Sulkowska,et al.  Protein Knotting by Active Threading of Nascent Polypeptide Chain Exiting from the Ribosome Exit Channel. , 2018, The journal of physical chemistry. B.

[15]  F. Seno,et al.  Sequence and structural patterns detected in entangled proteins reveal the importance of co-translational folding , 2018, Scientific Reports.

[16]  J. Ju,et al.  Association of BCSC-1 and MMP-14 with human breast cancer , 2018, Oncology letters.

[17]  Christopher J. Williams,et al.  MolProbity: More and better reference data for improved all‐atom structure validation , 2018, Protein science : a publication of the Protein Society.

[18]  Antonio Suma,et al.  How to fold intricately: using theory and experiments to unravel the properties of knotted proteins. , 2016, Current opinion in structural biology.

[19]  Miguel A. Soler,et al.  Steric confinement and enhanced local flexibility assist knotting in simple models of protein folding. , 2016, Physical chemistry chemical physics : PCCP.

[20]  Shigeyuki Yokoyama,et al.  Methyl transfer by substrate signaling from a knotted protein fold , 2016, Nature Structural &Molecular Biology.

[21]  Matthias Rief,et al.  Knotting and unknotting of a protein in single molecule experiments , 2016, Proceedings of the National Academy of Sciences.

[22]  P. Lyu,et al.  Comparative analysis of the folding dynamics and kinetics of an engineered knotted protein and its variants derived from HP0242 of Helicobacter pylori , 2015, Journal of physics. Condensed matter : an Institute of Physics journal.

[23]  Marek Cieplak,et al.  Cotranslational folding of deeply knotted proteins , 2015, Journal of physics. Condensed matter : an Institute of Physics journal.

[24]  Shang-Te Danny Hsu,et al.  Unraveling the folding mechanism of the smallest knotted protein, MJ0366. , 2015, The journal of physical chemistry. B.

[25]  Eric J. Rawdon,et al.  KnotProt: a database of proteins with knots and slipknots , 2014, Nucleic Acids Res..

[26]  Joanna I. Sulkowska,et al.  Knotting a Protein in Explicit Solvent , 2013 .

[27]  J. I. Sulkowska,et al.  Identifying knots in proteins. , 2013, Biochemical Society transactions.

[28]  Pietro Faccioli,et al.  Folding Pathways of a Knotted Protein with a Realistic Atomistic Force Field , 2013, PLoS Comput. Biol..

[29]  José N Onuchic,et al.  Hysteresis as a Marker for Complex, Overlapping Landscapes in Proteins. , 2013, The journal of physical chemistry letters.

[30]  C. Nichols,et al.  Self-cleavage of Human CLCA1 Protein by a Novel Internal Metalloprotease Domain Controls Calcium-activated Chloride Channel Activation*♦ , 2012, The Journal of Biological Chemistry.

[31]  Sophie E Jackson,et al.  Knot formation in newly translated proteins is spontaneous and accelerated by chaperonins. , 2012, Nature chemical biology.

[32]  Eric J. Rawdon,et al.  Conservation of complex knotting and slipknotting patterns in proteins , 2012, Proceedings of the National Academy of Sciences.

[33]  Lukasz Goldschmidt,et al.  Structure and folding of a designed knotted protein , 2010, Proceedings of the National Academy of Sciences.

[34]  Joanna I. Sulkowska,et al.  A Stevedore's Protein Knot , 2010, PLoS Comput. Biol..

[35]  Ying Gao,et al.  Bioinformatics Applications Note Sequence Analysis Cd-hit Suite: a Web Server for Clustering and Comparing Biological Sequences , 2022 .

[36]  Piotr Sułkowski,et al.  Dodging the crisis of folding proteins with knots , 2009, Proceedings of the National Academy of Sciences.

[37]  Marek Cieplak,et al.  Stabilizing effect of knots on proteins , 2008, Proceedings of the National Academy of Sciences.

[38]  Janusz M. Bujnicki,et al.  Structural and evolutionary bioinformatics of the SPOUT superfamily of methyltransferases , 2007, BMC Bioinformatics.

[39]  Peter Virnau,et al.  Intricate Knots in Proteins: Function and Evolution , 2006, PLoS Comput. Biol..

[40]  Sam W. Lee,et al.  The human orthologue of Drosophila ecdysoneless protein interacts with p53 and regulates its function. , 2006, Cancer research.

[41]  Sophie E Jackson,et al.  Folding studies on a knotted protein. , 2005, Journal of molecular biology.

[42]  William R. Taylor,et al.  A deeply knotted protein structure and how it might fold , 2000, Nature.

[43]  Marc L. Mansfield,et al.  Fit to be tied , 1997, Nature Structural Biology.

[44]  S. Kamitori,et al.  A Real Knot in Protein , 1996 .

[45]  Marc L. Mansfield,et al.  Are there knots in proteins? , 1994, Nature Structural Biology.

[46]  Jozef H. Przytycki,et al.  Invariants of links of Conway type , 1988, 1610.06679.

[47]  Kenneth C. Millett,et al.  A new polynomial invariant of knots and links , 1985 .

[48]  J. W. Alexander Topological invariants of knots and links , 1928 .

[49]  Aakrosh Ratan,et al.  Energy landscape and multiroute folding of topologically complex proteins adenylate kinase and 2 ouf-knot , 2012 .