Aggregation in protein-based biotherapeutics: computational studies and tools to identify aggregation-prone regions.

Because of their large, complex, and conformationally heterogeneous structures, biotherapeutics are vulnerable to several physicochemical stresses faced during the various processing steps from production to administration. In particular, formation of protein aggregates is a major concern. The greatest risk with aggregates arises from their potential to give rise to immunogenic reactions. Hence, it is desirable to bring forward biotherapeutic drug candidates that show low propensity for aggregation and, thus, improved developability. Here, we present a comprehensive review of computational studies into the sequence and structural factors that underpin protein and peptide aggregation. A number of computational approaches have been applied including coarse grain models, atomistic molecular simulations, and bioinformatic approaches. These studies have focused on both the mechanism of aggregation and the identification of potential aggregation-prone sequence and structural motifs. We also survey the computational tools available to predict aggregation in therapeutic proteins. The findings communicated here provide insights that could be potentially useful in the rational design of therapeutic candidates with not only high potency and specificity but also improved stability and solubility. These sequence-structure-based approaches can be applied to both novel as well as follow-on biotherapeutics.

[1]  Garrett M. Morris,et al.  Crystal Structure of a Neutralizing Human IgG Against HIV-1: A Template for Vaccine Design , 2001, Science.

[2]  C. Pace,et al.  Amino acid contribution to protein solubility: Asp, Glu, and Ser contribute more favorably than the other hydrophilic amino acids in RNase Sa. , 2007, Journal of molecular biology.

[3]  S. Teichmann,et al.  The importance of sequence diversity in the aggregation and evolution of proteins , 2005, Nature.

[4]  Bernhardt L. Trout,et al.  Design of therapeutic proteins with enhanced stability , 2009, Proceedings of the National Academy of Sciences.

[5]  Sandeep Kumar,et al.  Identification and Impact of Aggregation‐Prone Regions in Proteins and Therapeutic Monoclonal Antibodies , 2010 .

[6]  C M Dobson,et al.  Designing conditions for in vitro formation of amyloid protofilaments and fibrils. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Regina M Murphy,et al.  Peptide aggregation in neurodegenerative disease. , 2002, Annual review of biomedical engineering.

[8]  A. Giuliani,et al.  Early events in protein aggregation: Molecular flexibility and hydrophobicity/charge interaction in amyloid peptides as studied by molecular dynamics simulations , 2004, Proteins.

[9]  Bernhardt L Trout,et al.  Aggregation-prone motifs in human immunoglobulin G. , 2009, Journal of molecular biology.

[10]  R. Nussinov,et al.  Folding and binding cascades: Dynamic landscapes and population shifts , 2008, Protein science : a publication of the Protein Society.

[11]  Ruth Nussinov,et al.  Simulations as analytical tools to understand protein aggregation and predict amyloid conformation. , 2006, Current opinion in chemical biology.

[12]  J. Prausnitz,et al.  The competition between protein folding and aggregation: off-lattice minimalist model studies. , 2005, Biotechnology and bioengineering.

[13]  Ruth Nussinov,et al.  Energy landscape of amyloidogenic peptide oligomerization by parallel-tempering molecular dynamics simulation: significant role of Asn ladder. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Francesc X. Avilés,et al.  AGGRESCAN: a server for the prediction and evaluation of "hot spots" of aggregation in polypeptides , 2007, BMC Bioinform..

[15]  Hao Chen,et al.  Identification of amyloid fibril-forming segments based on structure and residue-based statistical potential , 2007, Bioinform..

[16]  Christopher M. Dobson,et al.  Kinetic partitioning of protein folding and aggregation , 2002, Nature Structural Biology.

[17]  L. Tjernberg,et al.  Charge Attraction and β Propensity Are Necessary for Amyloid Fibril Formation from Tetrapeptides* , 2002, The Journal of Biological Chemistry.

[18]  A Caflisch,et al.  A molecular dynamics approach to the structural characterization of amyloid aggregation. , 2006, Journal of molecular biology.

[19]  Ruth Nussinov,et al.  Computational study of the fibril organization of polyglutamine repeats reveals a common motif identified in beta-helices. , 2006, Journal of molecular biology.

[20]  David Eisenberg,et al.  An amyloid-forming segment of beta2-microglobulin suggests a molecular model for the fibril. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[21]  R. Nussinov,et al.  Stabilities and conformations of Alzheimer's β-amyloid peptide oligomers (Aβ16–22, Aβ16–35, and Aβ10–35): Sequence effects , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[22]  R. Riek,et al.  3D structure of Alzheimer's amyloid-β(1–42) fibrils , 2005 .

[23]  Jun Guo,et al.  Prediction of amyloid fibril-forming segments based on a support vector machine , 2009, BMC Bioinformatics.

[24]  C. Dobson,et al.  Rationalization of the effects of mutations on peptide andprotein aggregation rates , 2003, Nature.

[25]  J. Prausnitz,et al.  Effect of single-point sequence alterations on the aggregation propensity of a model protein. , 2006, Journal of the American Chemical Society.

[26]  Michele Vendruscolo,et al.  Prediction of the absolute aggregation rates of amyloidogenic polypeptide chains. , 2004, Journal of molecular biology.

[27]  Hai-Feng Chen Aggregation mechanism investigation of the GIFQINS cross-beta amyloid fibril , 2009, Comput. Biol. Chem..

[28]  Amedeo Caflisch,et al.  Prediction of aggregation rate and aggregation‐prone segments in polypeptide sequences , 2005, Protein science : a publication of the Protein Society.

[29]  Michel Goedert,et al.  A simple algorithm locates beta-strands in the amyloid fibril core of alpha-synuclein, Abeta, and tau using the amino acid sequence alone. , 2007, Protein science : a publication of the Protein Society.

[30]  Kingshuk Ghosh,et al.  What drives amyloid molecules to assemble into oligomers and fibrils? , 2010, Biophysical journal.

[31]  K. Dill,et al.  From Levinthal to pathways to funnels , 1997, Nature Structural Biology.

[32]  Naresh Chennamsetty,et al.  Design and application of antibody cysteine variants. , 2010, Bioconjugate chemistry.

[33]  D. Bratko,et al.  Competition between protein folding and aggregation: A three-dimensional lattice-model simulation , 2001 .

[34]  Fabrizio Chiti,et al.  Amyloid formation by globular proteins under native conditions. , 2009, Nature chemical biology.

[35]  M. J. Parker,et al.  Identification of amyloidogenic peptide sequences using a coarse‐grained physicochemical model , 2009, J. Comput. Chem..

[36]  M. Vendruscolo,et al.  The Zyggregator method for predicting protein aggregation propensities. , 2008, Chemical Society reviews.

[37]  Robert A. Grothe,et al.  Structure of the cross-β spine of amyloid-like fibrils , 2005, Nature.

[38]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[39]  Michele Vendruscolo,et al.  The distribution of residues in a polypeptide sequence is a determinant of aggregation optimized by evolution. , 2007, Biophysical journal.

[40]  R. Nussinov,et al.  Molecular dynamics simulations of alanine rich β‐sheet oligomers: Insight into amyloid formation , 2002, Protein science : a publication of the Protein Society.

[41]  Naresh Chennamsetty,et al.  Dynamic Fluctuations of Protein-Carbohydrate Interactions Promote Protein Aggregation , 2009, PLoS ONE.

[42]  Sandeep Kumar,et al.  Coupling of Aggregation and Immunogenicity in Biotherapeutics: T- and B-Cell Immune Epitopes May Contain Aggregation-Prone Regions , 2011, Pharmaceutical Research.

[43]  P. Carter Potent antibody therapeutics by design , 2006, Nature Reviews Immunology.

[44]  Louise C. Serpell,et al.  A simple algorithm locates β‐strands in the amyloid fibril core of α‐synuclein, Aβ, and tau using the amino acid sequence alone , 2007 .

[45]  R. Nussinov,et al.  Folding funnels, binding funnels, and protein function , 1999, Protein science : a publication of the Protein Society.

[46]  S. A. Marshall,et al.  Rational design and engineering of therapeutic proteins. , 2003, Drug discovery today.

[47]  Luciana Esposito,et al.  Insights into stability and toxicity of amyloid-like oligomers by replica exchange molecular dynamics analyses. , 2008, Biophysical journal.

[48]  Michail Yu. Lobanov,et al.  Prediction of Amyloidogenic and Disordered Regions in Protein Chains , 2006, PLoS Comput. Biol..

[49]  T. Arakawa,et al.  Mechanisms of protein aggregation. , 2009, Current pharmaceutical biotechnology.

[50]  Fabrizio Chiti,et al.  Studies of the aggregation of mutant proteins in vitro provide insights into the genetics of amyloid diseases , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Naresh Chennamsetty,et al.  Predictive tools for stabilization of therapeutic proteins , 2009, mAbs.

[52]  P. Gupta,et al.  Effect of denaturant and protein concentrations upon protein refolding and aggregation: A simple lattice model , 1998, Protein science : a publication of the Protein Society.

[53]  A. Rosenberg,et al.  Effects of protein aggregates: An immunologic perspective , 2006, The AAPS Journal.

[54]  Sandeep Kumar,et al.  Potential aggregation prone regions in biotherapeutics , 2009, mAbs.

[55]  R. Borchardt,et al.  Stability of Protein Pharmaceuticals , 1989, Pharmaceutical Research.

[56]  Sandeep Kumar,et al.  Potential Aggregation-Prone Regions in Complementarity-Determining Regions of Antibodies and Their Contribution Towards Antigen Recognition: A Computational Analysis , 2010, Pharmaceutical Research.

[57]  R Nussinov,et al.  Point mutations and sequence variability in proteins: Redistributions of preexisting populations , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[58]  J. Prausnitz,et al.  Protein-folding landscapes in multichain systems. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[59]  Saurabh Aggarwal,et al.  What's fueling the biotech engine—2009–2010 , 2010, Nature Biotechnology.

[60]  Flavio Seno,et al.  Insight into the Structure of Amyloid Fibrils from the Analysis of Globular Proteins , 2006, PLoS Comput. Biol..

[61]  Maarten G. Wolf,et al.  Quantitative prediction of amyloid fibril growth of short peptides from simulations: calculating association constants to dissect side chain importance. , 2008, Journal of the American Chemical Society.

[62]  Naresh Chennamsetty,et al.  Prediction of protein binding regions , 2011, Proteins.

[63]  C. Hall,et al.  Molecular dynamics simulations of spontaneous fibril formation by random-coil peptides. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[64]  P. Y. Chou,et al.  Conformational parameters for amino acids in helical, beta-sheet, and random coil regions calculated from proteins. , 1974, Biochemistry.

[65]  Stavros J. Hamodrakas,et al.  A protein secondary structure prediction scheme for the IBM PC and compatibles , 1988, Comput. Appl. Biosci..

[66]  Michele Vendruscolo,et al.  Prediction of "aggregation-prone" and "aggregation-susceptible" regions in proteins associated with neurodegenerative diseases. , 2005, Journal of molecular biology.

[67]  D. Baker,et al.  The 3D profile method for identifying fibril-forming segments of proteins. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[68]  A. Baumketner,et al.  Free energy landscapes for amyloidogenic tetrapeptides dimerization. , 2005, Biophysical journal.

[69]  L. Serrano,et al.  Protein aggregation and amyloidosis: confusion of the kinds? , 2006, Current opinion in structural biology.

[70]  Fred Jacobson,et al.  Protein aggregation and bioprocessing , 2006, The AAPS Journal.

[71]  L. Serrano,et al.  Prediction of sequence-dependent and mutational effects on the aggregation of peptides and proteins , 2004, Nature Biotechnology.

[72]  Diannan Lu,et al.  How native proteins aggregate in solution: a dynamic Monte Carlo simulation. , 2008, Biophysical chemistry.

[73]  Brian M. Murphy,et al.  Stability of Protein Pharmaceuticals: An Update , 2010, Pharmaceutical Research.

[74]  A. Miranker,et al.  A native to amyloidogenic transition regulated by a backbone trigger , 2006, Nature Structural &Molecular Biology.

[75]  Bernhardt L Trout,et al.  Prediction of aggregation prone regions of therapeutic proteins. , 2010, The journal of physical chemistry. B.

[76]  Dusan Bratko,et al.  Protein aggregation in silico. , 2007, Trends in biotechnology.

[77]  R. Leapman,et al.  Amyloid Fibril Formation by Aβ16-22, a Seven-Residue Fragment of the Alzheimer's β-Amyloid Peptide, and Structural Characterization by Solid State NMR† , 2000 .

[78]  Silvio C. E. Tosatto,et al.  The PASTA server for protein aggregation prediction. , 2007, Protein engineering, design & selection : PEDS.

[79]  Michele Vendruscolo,et al.  Prediction of aggregation-prone regions in structured proteins. , 2008, Journal of molecular biology.

[80]  Bernhardt L Trout,et al.  Developability index: a rapid in silico tool for the screening of antibody aggregation propensity. , 2012, Journal of pharmaceutical sciences.

[81]  C. Roberts Kinetics of Irreversible Protein Aggregation: Analysis of Extended Lumry−Eyring Models and Implications for Predicting Protein Shelf Life , 2003 .

[82]  Andreas Vitalis,et al.  Atomistic simulations of the effects of polyglutamine chain length and solvent quality on conformational equilibria and spontaneous homodimerization. , 2008, Journal of molecular biology.

[83]  Sheldon Park,et al.  Computational design of protein therapeutics. , 2008, Drug discovery today. Technologies.

[84]  Salvador Ventura,et al.  Short amino acid stretches can mediate amyloid formation in globular proteins: the Src homology 3 (SH3) case. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[85]  Salvador Ventura,et al.  Prediction of "hot spots" of aggregation in disease-linked polypeptides , 2005, BMC Structural Biology.