Driving Forces for Nonnative Protein Aggregation and Approaches to Predict Aggregation-Prone Regions.

Nonnative protein aggregation is the process by which otherwise folded, monomeric proteins are converted to stable aggregates composed of protein chains that have undergone some degree of unfolding. Often, a conformational change is needed to allow certain sequences of amino acids-so-called aggregation-prone regions (APRs)-to form stable interprotein contacts such as β-sheet structures. In addition to APRs that are needed to stabilize aggregates, other factors or driving forces are also important in inducing aggregation in practice. This review focuses first on the overall process and mechanistic drivers for nonnative aggregation, followed by a more detailed summary of the factors currently thought to be important for determining which amino acid sequences most greatly stabilize nonnative protein aggregates, as well as a survey of many of the existing algorithms that are publicly available to attempt to predict APRs. Challenges with experimental validation of predicted APRs for proteins are briefly discussed.

[1]  Silvio C. E. Tosatto,et al.  The PASTA server for protein aggregation prediction. , 2007, Protein engineering, design & selection : PEDS.

[2]  H. Stanley,et al.  Molecular Dynamics Simulation of Amyloid β Dimer Formation , 2004, physics/0403040.

[3]  David Eisenberg,et al.  An amyloid-forming segment of beta2-microglobulin suggests a molecular model for the fibril. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Shuangye Yin,et al.  Eris: an automated estimator of protein stability , 2007, Nature Methods.

[5]  Arieh Warshel,et al.  Electrostatic contributions to protein stability and folding energy , 2007, FEBS letters.

[6]  Ian Kimber,et al.  Immunogenicity of therapeutic proteins: Influence of aggregation , 2013, Journal of immunotoxicology.

[7]  A. Fink Protein aggregation: folding aggregates, inclusion bodies and amyloid. , 1998, Folding & design.

[8]  U. Baxa,et al.  Amyloid structure and assembly: insights from scanning transmission electron microscopy. , 2011, Journal of structural biology.

[9]  Brian Kuhlman,et al.  Structure-based design of supercharged, highly thermoresistant antibodies. , 2012, Chemistry & biology.

[10]  L. Serrano,et al.  Prediction of sequence-dependent and mutational effects on the aggregation of peptides and proteins , 2004, Nature Biotechnology.

[11]  Bernhardt L Trout,et al.  Developability index: a rapid in silico tool for the screening of antibody aggregation propensity. , 2012, Journal of pharmaceutical sciences.

[12]  D. Baker,et al.  The 3D profile method for identifying fibril-forming segments of proteins. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[13]  A. Cavalli,et al.  The role of aromaticity, exposed surface, and dipole moment in determining protein aggregation rates , 2004, Protein science : a publication of the Protein Society.

[14]  C. Roberts Kinetics of Irreversible Protein Aggregation: Analysis of Extended Lumry−Eyring Models and Implications for Predicting Protein Shelf Life , 2003 .

[15]  L. Croner,et al.  Stability engineering of scFvs for the development of bispecific and multivalent antibodies. , 2010, Protein engineering, design & selection : PEDS.

[16]  Srinivas Devadas,et al.  A method for probing the mutational landscape of amyloid structure , 2011, Bioinform..

[17]  K. Dill,et al.  The Protein Folding Problem , 1993 .

[18]  Christopher J Roberts,et al.  Coarse-grained model for colloidal protein interactions, B(22), and protein cluster formation. , 2013, The journal of physical chemistry. B.

[19]  Samuel L. DeLuca,et al.  Practically Useful: What the Rosetta Protein Modeling Suite Can Do for You , 2010, Biochemistry.

[20]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[21]  Jiwon Choi,et al.  NetCSSP: web application for predicting chameleon sequences and amyloid fibril formation , 2009, Nucleic Acids Res..

[22]  C. Dobson,et al.  Protein misfolding, functional amyloid, and human disease. , 2006, Annual review of biochemistry.

[23]  A. Giuliani,et al.  Early events in protein aggregation: Molecular flexibility and hydrophobicity/charge interaction in amyloid peptides as studied by molecular dynamics simulations , 2004, Proteins.

[24]  Erinc Sahin,et al.  Predicting solution aggregation rates for therapeutic proteins: approaches and challenges. , 2011, International journal of pharmaceutics.

[25]  Christopher J Roberts,et al.  Therapeutic protein aggregation: mechanisms, design, and control. , 2014, Trends in biotechnology.

[26]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[27]  Salvador Ventura,et al.  Prediction of "hot spots" of aggregation in disease-linked polypeptides , 2005, BMC Structural Biology.

[28]  Heeyoung Jung,et al.  CSSP2: An improved method for predicting contact-dependent secondary structure propensity , 2007, Comput. Biol. Chem..

[29]  Anthony Talvas,et al.  MetAmyl: A METa-Predictor for AMYLoid Proteins , 2013, PloS one.

[30]  A. Rosenberg,et al.  Effects of protein aggregates: An immunologic perspective , 2006, The AAPS Journal.

[31]  Hiroyuki Ogata,et al.  AAindex: Amino Acid Index Database , 1999, Nucleic Acids Res..

[32]  Bernhardt L Trout,et al.  Prediction of aggregation prone regions of therapeutic proteins. , 2010, The journal of physical chemistry. B.

[33]  Sergey Lyskov,et al.  PyRosetta: a script-based interface for implementing molecular modeling algorithms using Rosetta , 2010, Bioinform..

[34]  L. Serrano,et al.  A comparative study of the relationship between protein structure and beta-aggregation in globular and intrinsically disordered proteins. , 2004, Journal of molecular biology.

[35]  John F. Carpenter,et al.  Physical Stability of Proteins in Aqueous Solution: Mechanism and Driving Forces in Nonnative Protein Aggregation , 2003, Pharmaceutical Research.

[36]  Silvio C. E. Tosatto,et al.  PASTA 2.0: an improved server for protein aggregation prediction , 2014, Nucleic Acids Res..

[37]  Piero Fariselli,et al.  I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure , 2005, Nucleic Acids Res..

[38]  Raimon Sabate,et al.  Prediction of the aggregation propensity of proteins from the primary sequence: Aggregation properties of proteomes , 2011, Biotechnology journal.

[39]  G. Makhatadze,et al.  Thermal versus guanidine-induced unfolding of ubiquitin. An analysis in terms of the contributions from charge-charge interactions to protein stability. , 1999, Biochemistry.

[40]  Bernhardt L. Trout,et al.  Design of therapeutic proteins with enhanced stability , 2009, Proceedings of the National Academy of Sciences.

[41]  Lenore Cowen,et al.  BETASCAN: Probable β-amyloids Identified by Pairwise Probabilistic Analysis , 2009, PLoS Comput. Biol..

[42]  Wei Wang,et al.  Non-Arrhenius Protein Aggregation , 2013, The AAPS Journal.

[43]  Sandeep Kumar,et al.  Identification and Impact of Aggregation‐Prone Regions in Proteins and Therapeutic Monoclonal Antibodies , 2010 .

[44]  E. Mandelkow,et al.  Structure, Stability, and Aggregation of Paired Helical Filaments from Tau Protein and FTDP-17 Mutants Probed by Tryptophan Scanning Mutagenesis* , 2002, The Journal of Biological Chemistry.

[45]  Alan R Davidson,et al.  Multiple sequence alignment as a guideline for protein engineering strategies. , 2006, Methods in molecular biology.

[46]  L. Serrano,et al.  Protein aggregation and amyloidosis: confusion of the kinds? , 2006, Current opinion in structural biology.

[47]  Salvador Ventura,et al.  Mutagenesis of the central hydrophobic cluster in Aβ42 Alzheimer's peptide , 2006 .

[48]  P. Hammarström,et al.  Is the unfolded state the Rosetta Stone of the protein folding problem? , 2000, Biochemical and biophysical research communications.

[49]  N. Dovidchenko,et al.  Computational Approaches to Identification of Aggregation Sites and the Mechanism of Amyloid Growth. , 2015, Advances in experimental medicine and biology.

[50]  L. Croner,et al.  Conserved amino acid networks involved in antibody variable domain interactions , 2009, Proteins.

[51]  Stavros J Hamodrakas,et al.  Consensus prediction of amyloidogenic determinants in amyloid fibril-forming proteins. , 2007, International journal of biological macromolecules.

[52]  Wei Wang,et al.  Protein aggregation and its inhibition in biopharmaceutics. , 2005, International journal of pharmaceutics.

[53]  Rafael Zambrano,et al.  AGGRESCAN3D (A3D): server for prediction of aggregation properties of protein structures , 2015, Nucleic Acids Res..

[54]  Michele Vendruscolo,et al.  The CamSol method of rational design of protein mutants with enhanced solubility. , 2015, Journal of molecular biology.

[55]  Sandeep Kumar,et al.  Distinct position-specific sequence features of hexa-peptides that form amyloid-fibrils: application to discriminate between amyloid fibril and amorphous β-aggregate forming peptide sequences , 2013, BMC Bioinformatics.

[56]  Louise C. Serpell,et al.  A simple algorithm locates β‐strands in the amyloid fibril core of α‐synuclein, Aβ, and tau using the amino acid sequence alone , 2007 .

[57]  Pawel Gasior,et al.  FISH Amyloid – a new method for finding amyloidogenic segments in proteins based on site specific co-occurence of aminoacids , 2014, BMC Bioinformatics.

[58]  Christopher J Roberts,et al.  A Lumry-Eyring nucleated polymerization model of protein aggregation kinetics: 1. Aggregation with pre-equilibrated unfolding. , 2007, The journal of physical chemistry. B.

[59]  Arieh Warshel,et al.  Effective approach for calculations of absolute stability of proteins using focused dielectric constants , 2009, Proteins.

[60]  Christopher J Roberts,et al.  Conformational stability as a design target to control protein aggregation. , 2014, Protein engineering, design & selection : PEDS.

[61]  A. Plückthun,et al.  Stability engineering of antibody single-chain Fv fragments. , 2001, Journal of molecular biology.

[62]  David R. Liu,et al.  Supercharging proteins can impart unusual resilience. , 2007, Journal of the American Chemical Society.

[63]  Flavio Seno,et al.  Insight into the Structure of Amyloid Fibrils from the Analysis of Globular Proteins , 2006, PLoS Comput. Biol..

[64]  Christopher M. Dobson,et al.  Kinetic partitioning of protein folding and aggregation , 2002, Nature Structural Biology.

[65]  M. Oliveberg Waltz, an exciting new move in amyloid prediction , 2010, Nature Methods.

[66]  Bernhardt L Trout,et al.  Aggregation in protein-based biotherapeutics: computational studies and tools to identify aggregation-prone regions. , 2011, Journal of pharmaceutical sciences.

[67]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[68]  Virgil L. Woods,et al.  Structure and properties of α-synuclein and other amyloids determined at the amino acid level , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[69]  William J Welsh,et al.  Detecting hidden sequence propensity for amyloid fibril formation , 2004, Protein science : a publication of the Protein Society.

[70]  Thomas J Magliery,et al.  Protein stability: computation, sequence statistics, and new experimental methods. , 2015, Current opinion in structural biology.

[71]  Fabrizio Chiti,et al.  Sequence and structural determinants of amyloid fibril formation. , 2006, Accounts of chemical research.

[72]  Andrey V. Kajava,et al.  A structure-based approach to predict predisposition to amyloidosis , 2015, Alzheimer's & Dementia.

[73]  M. Blanco,et al.  Modulating non-native aggregation and electrostatic protein-protein interactions with computationally designed single-point mutations. , 2016, Protein engineering, design & selection : PEDS.

[74]  O. Gursky,et al.  Amyloid-Forming Properties of Human Apolipoproteins: Sequence Analyses and Structural Insights. , 2015, Advances in experimental medicine and biology.

[75]  Mauno Vihinen,et al.  Performance of protein stability predictors , 2010, Human mutation.

[76]  Pradeep Kota,et al.  Computational approaches to understanding protein aggregation in neurodegeneration. , 2014, Journal of molecular cell biology.

[77]  Paul Labute,et al.  Calibrative approaches to protein solubility modeling of a mutant series using physicochemical descriptors , 2010, J. Comput. Aided Mol. Des..

[78]  Michele Vendruscolo,et al.  Prediction of the absolute aggregation rates of amyloidogenic polypeptide chains. , 2004, Journal of molecular biology.

[79]  Michail Yu. Lobanov,et al.  FoldAmyloid: a method of prediction of amyloidogenic regions from protein sequence , 2010, Bioinform..

[80]  P. Y. Chou,et al.  Prediction of protein conformation. , 1974, Biochemistry.

[81]  Yi Liu,et al.  RosettaDesign server for protein design , 2006, Nucleic Acids Res..

[82]  Michele Vendruscolo,et al.  Prediction of "aggregation-prone" and "aggregation-susceptible" regions in proteins associated with neurodegenerative diseases. , 2005, Journal of molecular biology.

[83]  Salvador Ventura,et al.  Short amino acid stretches can mediate amyloid formation in globular proteins: the Src homology 3 (SH3) case. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[84]  Sandeep Kumar,et al.  GAP: towards almost 100 percent prediction for β-strand-mediated aggregating peptides with distinct morphologies , 2014, Bioinform..

[85]  J. Yon Protein folding in the post‐genomic era , 2002, Journal of cellular and molecular medicine.

[86]  G. Gilliland,et al.  Structure-based engineering of a monoclonal antibody for improved solubility. , 2010, Protein engineering, design & selection : PEDS.

[87]  Komal Sharma,et al.  A PRACTICAL OVERVIEW OF QUANTITATIVE STRUCTURE- ACTIVITY RELATIONSHIP , 2016 .

[88]  Vladimir I Razinkov,et al.  Native-state solubility and transfer free energy as predictive tools for selecting excipients to include in protein formulation development studies. , 2012, Journal of pharmaceutical sciences.

[89]  Joost J. J. van Durme,et al.  Solubis: optimize your protein , 2015, Bioinform..

[90]  Christopher J Roberts,et al.  Non‐native protein aggregation kinetics , 2007, Biotechnology and bioengineering.

[91]  C. Dobson,et al.  Rationalization of the effects of mutations on peptide andprotein aggregation rates , 2003, Nature.

[92]  William F Weiss,et al.  Principles, approaches, and challenges for predicting protein aggregation rates and shelf life. , 2009, Journal of pharmaceutical sciences.

[93]  Amedeo Caflisch,et al.  Prediction of aggregation rate and aggregation‐prone segments in polypeptide sequences , 2005, Protein science : a publication of the Protein Society.

[94]  David A. Phoenix,et al.  Prediction of Peptide and Protein Propensity for Amyloid Formation , 2014, PloS one.

[95]  Yang Zhang Progress and challenges in protein structure prediction. , 2008, Current opinion in structural biology.

[96]  Sukjoon Yoon,et al.  Analysis of Chameleon Sequences by Energy Decomposition on a Pairwise Per-residue Basis , 2006, The protein journal.

[97]  M. Michael Gromiha,et al.  CUPSAT: prediction of protein stability upon point mutations , 2006, Nucleic Acids Res..

[98]  H. Stanley,et al.  Molecular dynamics simulation of amyloid beta dimer formation. , 2004, Biophysical journal.

[99]  François Stricher,et al.  The FoldX web server: an online force field , 2005, Nucleic Acids Res..

[100]  A. Robinson,et al.  Competing aggregation pathways for monoclonal antibodies , 2014, FEBS letters.

[101]  Alexander Tropsha,et al.  QSAR modeling of human serum protein binding with several modeling techniques utilizing structure-information representation. , 2006, Journal of medicinal chemistry.

[102]  Matteo Ramazzotti,et al.  Prediction of amyloid aggregation in vivo , 2011, EMBO reports.

[103]  Shinn-Ying Ho,et al.  Prediction and Analysis of Antibody Amyloidogenesis from Sequences , 2013, PloS one.

[104]  Stavros J Hamodrakas,et al.  Exploring the 'aggregation-prone' core of human Cystatin C: A structural study. , 2015, Journal of structural biology.

[105]  M. Vendruscolo,et al.  The Zyggregator method for predicting protein aggregation propensities. , 2008, Chemical Society reviews.

[106]  Francesc X. Avilés,et al.  AGGRESCAN: a server for the prediction and evaluation of "hot spots" of aggregation in polypeptides , 2007, BMC Bioinform..

[107]  V. Uversky,et al.  SS-Stabilizing Proteins Rationally: Intrinsic Disorder-Based Design of Stabilizing Disulphide Bridges in GFP , 2012, Journal of biomolecular structure & dynamics.

[108]  Robert H. Brown,et al.  An intersubunit disulfide bond prevents in vitro aggregation of a superoxide dismutase-1 mutant linked to familial amytrophic lateral sclerosis. , 2004, Biochemistry.

[109]  William F. Weiss,et al.  Computational design and biophysical characterization of aggregation-resistant point mutations for γD crystallin illustrate a balance of conformational stability and intrinsic aggregation propensity. , 2011, Biochemistry.

[110]  Peter M Tessier,et al.  Mutational analysis of domain antibodies reveals aggregation hotspots within and near the complementarity determining regions , 2011, Proteins.

[111]  S. Radford,et al.  Systematic examination of polymorphism in amyloid fibrils by molecular-dynamics simulation. , 2011, Biophysical journal.

[112]  Sukjoon Yoon,et al.  Rapid assessment of contact‐dependent secondary structure propensity: Relevance to amyloidogenic sequences , 2005, Proteins.

[113]  J. Skolnick,et al.  A distance‐dependent atomic knowledge‐based potential for improved protein structure selection , 2001, Proteins.

[114]  A. Lenhoff,et al.  Light-scattering studies of protein solutions: role of hydration in weak protein-protein interactions. , 2005, Biophysical journal.

[115]  Hao Chen,et al.  Identification of amyloid fibril-forming segments based on structure and residue-based statistical potential , 2007, Bioinform..

[116]  Jun Guo,et al.  Prediction of amyloid fibril-forming segments based on a support vector machine , 2009, BMC Bioinformatics.

[117]  Bernhardt L Trout,et al.  Computational methods to predict therapeutic protein aggregation. , 2012, Methods in molecular biology.

[118]  Adrian H Elcock,et al.  Atomically detailed simulations of concentrated protein solutions: the effects of salt, pH, point mutations, and protein concentration in simulations of 1000-molecule systems. , 2006, Journal of the American Chemical Society.

[119]  Zhide Hu,et al.  QSAR method for prediction of protein-peptide binding affinity: application to MHC class I molecule HLA-A*0201. , 2007, Journal of molecular graphics & modelling.

[120]  A. Lenhoff,et al.  A consistent experimental and modeling approach to light-scattering studies of protein-protein interactions in solution. , 2005, Biophysical journal.

[121]  Steven J Shire,et al.  Challenges in the development of high protein concentration formulations. , 2004, Journal of pharmaceutical sciences.

[122]  M. Gruebele Downhill protein folding: evolution meets physics. , 2005, Comptes rendus biologies.

[123]  Stavros J. Hamodrakas,et al.  A Consensus Method for the Prediction of ‘Aggregation-Prone’ Peptides in Globular Proteins , 2013, PloS one.

[124]  William F. Weiss,et al.  Molecular level insights into thermally induced α-chymotrypsinogen A amyloid aggregation mechanism and semiflexible protofibril morphology. , 2010, Biochemistry.

[125]  Andrzej Kolinski,et al.  CABS-flex: server for fast simulation of protein structure fluctuations , 2013, Nucleic Acids Res..

[126]  V. Uversky,et al.  Conformational constraints for amyloid fibrillation: the importance of being unfolded. , 2004, Biochimica et biophysica acta.

[127]  Bonnie Berger,et al.  STITCHER: Dynamic assembly of likely amyloid and prion β-structures from secondary structure predictions , 2011, Proteins.

[128]  G. Schreiber,et al.  Assessing computational methods for predicting protein stability upon mutation: good on average but not in the details. , 2009, Protein engineering, design & selection : PEDS.