Predicting secondary structure propensities in IDPs using simple statistics from three-residue fragments.

Intrinsically Disordered Proteins (IDPs) play key functional roles facilitated by their inherent plasticity. In most of the cases, IDPs recognize their partners through partially-structured elements inserted in fully-disordered chains. The identification and characterization of these elements is fundamental to understand the functional mechanisms of IDPs. Although several computational methods have been developed to identify order within disordered chains, most of the current secondary structure predictors are focused on globular proteins and are not necessarily appropriate for IDPs. Here, we present a comprehensible method, called Local Structural Propensity Predictor (LS2P), to predict secondary structure elements from IDP sequences. LS2P performs statistical analyses from a database of three-residue fragments extracted from coil regions of high-resolution protein structures. In addition to identifying scarcely populated helical and extended regions, the method pinpoints short stretches triggering β-turn formation or promoting α-helices. The simplicity of the method enables a direct connection between experimental observations and structural features encoded in IDP sequences.

[1]  Martin Blackledge,et al.  Characterization of intrinsically disordered proteins and their dynamic complexes: From in vitro to cell-like environments. , 2018, Progress in nuclear magnetic resonance spectroscopy.

[2]  Vladimir N. Uversky,et al.  Order, Disorder, and Everything in Between , 2016, Molecules.

[3]  Alessandro Barducci,et al.  Interplay of Protein Disorder in Retinoic Acid Receptor Heterodimer and Its Corepressor Regulates Gene Expression. , 2019, Structure.

[4]  Shaowen Yao,et al.  Protein secondary structure prediction: A survey of the state of the art. , 2017, Journal of molecular graphics & modelling.

[5]  Sonia Longhi,et al.  Simultaneous quantification of protein order and disorder. , 2017, Nature chemical biology.

[6]  Martin Blackledge,et al.  Structure and dynamics of the MKK7–JNK signaling complex , 2015, Proceedings of the National Academy of Sciences.

[7]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[8]  Martin Blackledge,et al.  Intrinsic disorder within the erythrocyte binding-like proteins from Plasmodium falciparum. , 2014, Biochimica et biophysica acta.

[9]  Monika Fuxreiter,et al.  Fuzziness: linking regulation to protein dynamics. , 2012, Molecular bioSystems.

[10]  Marc S. Cortese,et al.  Analysis of molecular recognition features (MoRFs). , 2006, Journal of molecular biology.

[11]  Christian Griesinger,et al.  Quantitative determination of the conformational properties of partially folded and intrinsically disordered proteins using NMR dipolar couplings. , 2009, Structure.

[12]  Mariusz Jaremko,et al.  Predictive atomic resolution descriptions of intrinsically disordered hTau40 and α-synuclein in solution from NMR and small angle scattering. , 2014, Structure.

[13]  Martin Blackledge,et al.  Mapping the potential energy landscape of intrinsically disordered proteins at amino acid resolution. , 2012, Journal of the American Chemical Society.

[14]  Jakob T Nielsen,et al.  Quality and bias of protein disorder predictors , 2019, Scientific Reports.

[15]  A. Solis,et al.  Deriving High-Resolution Protein Backbone Structure Propensities from All Crystal Data Using the Information Maximization Device , 2014, PloS one.

[16]  Pau Bernadó,et al.  Structural disorder within sendai virus nucleoprotein and phosphoprotein: insight into the structural basis of molecular recognition. , 2010, Protein and peptide letters.

[17]  Zsuzsanna Dosztányi,et al.  IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding , 2018, Nucleic Acids Res..

[18]  Zoran Obradovic,et al.  Predicting intrinsic disorder from amino acid sequence , 2003, Proteins.

[19]  Martin von Bergen,et al.  Highly populated turn conformations in natively unfolded tau protein identified from residual dipolar couplings and molecular simulation. , 2007, Journal of the American Chemical Society.

[20]  Marcos R Betancourt Knowledge-based potential for the polypeptide backbone. , 2008, The journal of physical chemistry. B.

[21]  Yaoqi Zhou,et al.  SPOT-Disorder2: Improved Protein Intrinsic Disorder Prediction by Ensembled Deep Learning , 2019, Genom. Proteom. Bioinform..

[22]  M. Tyers,et al.  Structure/function implications in a dynamic complex of the intrinsically disordered Sic1 with the Cdc4 subunit of an SCF ubiquitin ligase. , 2010, Structure.

[23]  H. Dyson,et al.  Intrinsically disordered proteins in cellular signalling and regulation , 2014, Nature Reviews Molecular Cell Biology.

[24]  Julie D. Forman-Kay,et al.  Dynamic Protein Interaction Networks and New Structural Paradigms in Signaling. , 2016, Chemical reviews.

[25]  Sonia Longhi,et al.  Intrinsic disorder in measles virus nucleocapsids , 2011, Proceedings of the National Academy of Sciences.

[26]  Thierry Siméon,et al.  A reinforcement-learning-based approach to enhance exhaustive protein loop sampling , 2019, Bioinform..

[27]  Martin Blackledge,et al.  Amino acid bulkiness defines the local conformations and dynamics of natively unfolded alpha-synuclein and tau. , 2007, Journal of the American Chemical Society.

[28]  Aidan Budd,et al.  Short linear motifs: ubiquitous and functionally diverse protein interaction modules directing cell regulation. , 2014, Chemical reviews.

[29]  A. Fersht,et al.  Structure of tumor suppressor p53 and its intrinsically disordered N-terminal transactivation domain , 2008, Proceedings of the National Academy of Sciences.

[30]  Julie D Forman-Kay,et al.  Modulation of Intrinsically Disordered Protein Function by Post-translational Modifications , 2016, The Journal of Biological Chemistry.

[31]  Monika Fuxreiter,et al.  Interactions via intrinsically disordered regions: What kind of motifs? , 2012, IUBMB life.

[32]  Kang Chen,et al.  Conformation of the backbone in unfolded proteins. , 2006, Chemical reviews.

[33]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. , 2007, Journal of proteome research.

[34]  Peter Tompa,et al.  Intrinsically disordered proteins: emerging interaction specialists. , 2015, Current opinion in structural biology.

[35]  Martin Blackledge,et al.  Quantitative conformational analysis of partially folded proteins from residual dipolar couplings: application to the molecular recognition element of Sendai virus nucleoprotein. , 2008, Journal of the American Chemical Society.

[36]  Steven E. Brenner,et al.  SCOPe: Structural Classification of Proteins—extended, integrating SCOP and ASTRAL data and classification of new structures , 2013, Nucleic Acids Res..

[37]  Jörg Gsponer,et al.  Intrinsically disordered proteins: regulation and disease. , 2011, Current opinion in structural biology.

[38]  Abhishek K. Jha,et al.  Statistical coil model of the unfolded state: resolving the reconciliation problem. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[39]  H. Dyson,et al.  Mechanism of coupled folding and binding of an intrinsically disordered protein , 2007, Nature.

[40]  Nathalie Sibille,et al.  p15PAF is an intrinsically disordered protein with nonrandom structural preferences at sites of interaction with other proteins. , 2014, Biophysical journal.

[41]  T. Gibson,et al.  Protein disorder prediction: implications for structural proteomics. , 2003, Structure.

[42]  David T. Jones,et al.  DISOPRED3: precise disordered region predictions with annotated protein-binding activity , 2014, Bioinform..

[43]  H. Dyson,et al.  Unfolded proteins and protein folding studied by NMR. , 2004, Chemical reviews.

[44]  Norman E. Davey,et al.  The functional importance of structure in unstructured protein regions. , 2019, Current opinion in structural biology.

[45]  Piero Fariselli,et al.  The s2D method: simultaneous sequence-based prediction of the statistical populations of ordered and disordered regions in proteins. , 2015, Journal of molecular biology.

[46]  Mike Tyers,et al.  Dynamic equilibrium engagement of a polyvalent ligand with a single-site receptor , 2008, Proceedings of the National Academy of Sciences.

[47]  Alexandre G. de Brevern,et al.  Extension of the classical classification of β-turns. , 2016 .

[48]  Miquel Pons,et al.  Structural characterization of the natively unfolded N-terminal domain of human c-Src kinase: insights into the role of phosphorylation of the unique domain. , 2009, Journal of molecular biology.

[49]  Xiaolong Wang,et al.  A comprehensive review and comparison of existing computational methods for intrinsically disordered protein and region prediction , 2019, Briefings Bioinform..

[50]  J. Richardson,et al.  The anatomy and taxonomy of protein structure. , 1981, Advances in protein chemistry.

[51]  J. M. Zimmerman,et al.  The characterization of amino acid sequences in proteins by statistical methods. , 1968, Journal of theoretical biology.

[52]  Martin Blackledge,et al.  Direct prediction of NMR residual dipolar couplings from the primary sequence of unfolded proteins. , 2013, Angewandte Chemie.

[53]  Juan Cortés,et al.  Realistic Ensemble Models of Intrinsically Disordered Proteins Using a Structure-Encoding Coil Database. , 2019, Structure.