Computational methods to predict therapeutic protein aggregation.

Protein based biotherapeutics have emerged as a successful class of pharmaceuticals. However, these macromolecules endure a variety of physicochemical degradations during manufacturing, shipping, and storage, which may adversely impact the drug product quality. Of these degradations, the irreversible self-association of therapeutic proteins to form aggregates is a major challenge in the formulation of these molecules. Tools to predict and mitigate protein aggregation are, therefore, of great interest to biopharmaceutical research and development. In this chapter, a number of such computational tools developed to understand and predict the various steps involved in protein aggregation are described. These tools can be grouped into three general classes: unfolding kinetics and native state thermal stability, colloidal stability, and sequence/structure based aggregation liabilities. Chapter sections introduce each class by discussing how these predictive tools provide insight into the molecular events leading to protein aggregation. The computational methods are then explained in detail along with their advantages and limitations.

[1]  Jan H. Jensen,et al.  Very fast empirical prediction and rationalization of protein pKa values , 2005, Proteins.

[2]  William F. Weiss,et al.  Computational design and biophysical characterization of aggregation-resistant point mutations for γD crystallin illustrate a balance of conformational stability and intrinsic aggregation propensity. , 2011, Biochemistry.

[3]  Babatunde A. Ogunnaike,et al.  Multi-variate approach to global protein aggregation behavior and kinetics: effects of pH, NaCl, and temperature for alpha-chymotrypsinogen A. , 2010, Journal of pharmaceutical sciences.

[4]  D. Baker,et al.  An orientation-dependent hydrogen bonding potential improves prediction of specificity and structure for proteins and protein-protein complexes. , 2003, Journal of molecular biology.

[5]  Haipeng Gong,et al.  Local secondary structure content predicts folding rates for simple, two-state proteins. , 2003, Journal of molecular biology.

[6]  G. Gilliland,et al.  Structure-based engineering of a monoclonal antibody for improved solubility. , 2010, Protein engineering, design & selection : PEDS.

[7]  J M Ribeiro,et al.  Isoelectric points of proteins: theoretical determination. , 1989, Analytical biochemistry.

[8]  Theodore W Randolph,et al.  Roles of conformational stability and colloidal stability in the aggregation of recombinant human granulocyte colony‐stimulating factor , 2003, Protein science : a publication of the Protein Society.

[9]  Christopher J Roberts,et al.  A Lumry-Eyring nucleated polymerization model of protein aggregation kinetics: 1. Aggregation with pre-equilibrated unfolding. , 2007, The journal of physical chemistry. B.

[10]  T. Yeates,et al.  Arrangement of subunits and ordering of β-strands in an amyloid sheet , 2002, Nature Structural Biology.

[11]  Henry Eyring,et al.  Conformation Changes of Proteins , 1954 .

[12]  Christopher M. Dobson,et al.  Kinetic partitioning of protein folding and aggregation , 2002, Nature Structural Biology.

[13]  L. Wyns,et al.  Atomic structure of a nanobody-trapped domain-swapped dimer of an amyloidogenic β2-microglobulin variant , 2011, Proceedings of the National Academy of Sciences of the United States of America.

[14]  S. Radford,et al.  Systematic examination of polymorphism in amyloid fibrils by molecular-dynamics simulation. , 2011, Biophysical journal.

[15]  Heather T. McFarlane,et al.  Atomic structures of amyloid cross-β spines reveal varied steric zippers , 2007, Nature.

[16]  Amedeo Caflisch,et al.  Computational models for the prediction of polypeptide aggregation propensity. , 2006, Current opinion in chemical biology.

[17]  D. Mould,et al.  Development of hydrophobicity parameters to analyze proteins which bear post- or cotranslational modifications. , 1991, Analytical biochemistry.

[18]  Michail Yu. Lobanov,et al.  Prediction of Amyloidogenic and Disordered Regions in Protein Chains , 2006, PLoS Comput. Biol..

[19]  L. Serrano,et al.  Prediction of sequence-dependent and mutational effects on the aggregation of peptides and proteins , 2004, Nature Biotechnology.

[20]  Mireille Dumoulin,et al.  Identification of the core structure of lysozyme amyloid fibrils by proteolysis. , 2006, Journal of molecular biology.

[21]  Ali Akbar Saboury,et al.  Large Proteins Have a Great Tendency to Aggregate but a Low Propensity to Form Amyloid Fibrils , 2011, PloS one.

[22]  Naresh Chennamsetty,et al.  Prediction of protein binding regions , 2011, Proteins.

[23]  L. Serpell,et al.  The molecular basis of amyloidosis , 1997, Cellular and Molecular Life Sciences CMLS.

[24]  A. Arabshahi,et al.  [6] Second virial coefficient as predictor in protein crystal growth. , 1997, Methods in enzymology.

[25]  Glyn L. Devlin,et al.  Protein particulates: another generic form of protein aggregation? , 2007, Biophysical journal.

[26]  J. Hansen,et al.  Nonmonotonic variation with salt concentration of the second virial coefficient in protein solutions. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  Robert A. Grothe,et al.  Structure of the cross-β spine of amyloid-like fibrils , 2005, Nature.

[28]  Christopher J Roberts,et al.  Non‐native protein aggregation kinetics , 2007, Biotechnology and bioengineering.

[29]  M. Kanehisa,et al.  Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins. , 1996, Protein engineering.

[30]  Paul Labute,et al.  Calibrative approaches to protein solubility modeling of a mutant series using physicochemical descriptors , 2010, J. Comput. Aided Mol. Des..

[31]  R Nussinov,et al.  A proposed structural model for amyloid fibril elongation: domain swapping forms an interdigitating beta-structure polymer. , 2001, Protein engineering.

[32]  Bernhardt L Trout,et al.  Aggregation in protein-based biotherapeutics: computational studies and tools to identify aggregation-prone regions. , 2011, Journal of pharmaceutical sciences.

[33]  A George,et al.  Predicting protein crystallization from a dilute solution property. , 1994, Acta crystallographica. Section D, Biological crystallography.

[34]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[35]  Hui Chen,et al.  Secondary structure length as a determinant of folding rate of proteins with two‐ and three‐state kinetics , 2007, Proteins.

[36]  William J Welsh,et al.  Detecting hidden sequence propensity for amyloid fibril formation , 2004, Protein science : a publication of the Protein Society.

[37]  M. Michael Gromiha,et al.  A Statistical Method for Predicting Protein Unfolding Rates from Amino Acid Sequence , 2006, J. Chem. Inf. Model..

[38]  Brian M. Murphy,et al.  Stability of Protein Pharmaceuticals: An Update , 2010, Pharmaceutical Research.

[39]  Susan Lindquist,et al.  Hsp90 and Environmental Stress Transform the Adaptive Value of Natural Genetic Variation , 2010, Science.

[40]  B Harihar,et al.  Refinement of the long-range order parameter in predicting folding rates of two-state proteins. , 2009, Biopolymers.

[41]  A. Miranker,et al.  A native to amyloidogenic transition regulated by a backbone trigger , 2006, Nature Structural &Molecular Biology.

[42]  L. Serpell,et al.  Sequence Determinants for Amyloid Fibrillogenesis of Human α-Synuclein , 2007 .

[43]  A. Doig,et al.  Amyloidogenic sequences in native protein structures , 2010, Protein science : a publication of the Protein Society.

[44]  S. Radford,et al.  Competition between Intramolecular and Intermolecular Interactions in an Amyloid-Forming Protein , 2009, Journal of molecular biology.

[45]  Ruth Nussinov,et al.  Simulations as analytical tools to understand protein aggregation and predict amyloid conformation. , 2006, Current opinion in chemical biology.

[46]  C. Dobson,et al.  Rationalization of the effects of mutations on peptide andprotein aggregation rates , 2003, Nature.

[47]  William F Weiss,et al.  Principles, approaches, and challenges for predicting protein aggregation rates and shelf life. , 2009, Journal of pharmaceutical sciences.

[48]  Amedeo Caflisch,et al.  Prediction of aggregation rate and aggregation‐prone segments in polypeptide sequences , 2005, Protein science : a publication of the Protein Society.

[49]  Bernhardt L Trout,et al.  Aggregation-prone motifs in human immunoglobulin G. , 2009, Journal of molecular biology.

[50]  S. Selvaraj,et al.  Application of long‐range order to predict unfolding rates of two‐state proteins , 2011, Proteins.

[51]  M. Ramirez-Alvarado,et al.  Structural Insights into the Role of Mutations in Amyloidogenesis* , 2008, Journal of Biological Chemistry.

[52]  L Serrano,et al.  Elucidating the folding problem of alpha-helices: local motifs, long-range electrostatics, ionic-strength dependence and prediction of NMR parameters. , 1998, Journal of molecular biology.

[53]  Fabrizio Chiti,et al.  A causative link between the structure of aberrant protein oligomers and their toxicity. , 2010, Nature chemical biology.

[54]  Shuangye Yin,et al.  Eris: an automated estimator of protein stability , 2007, Nature Methods.

[55]  Michele Vendruscolo,et al.  Prediction of "aggregation-prone" and "aggregation-susceptible" regions in proteins associated with neurodegenerative diseases. , 2005, Journal of molecular biology.

[56]  Michele Vendruscolo,et al.  Physicochemical principles that regulate the competition between functional and dysfunctional association of proteins , 2009, Proceedings of the National Academy of Sciences.

[57]  W. William Wilson,et al.  Relation between the solubility of proteins in aqueous solutions and the second virial coefficient of the solution , 1999 .

[58]  Louise C. Serpell,et al.  A simple algorithm locates β‐strands in the amyloid fibril core of α‐synuclein, Aβ, and tau using the amino acid sequence alone , 2007 .

[59]  Fabrizio Chiti,et al.  Studies of the aggregation of mutant proteins in vitro provide insights into the genetics of amyloid diseases , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[60]  Michele Vendruscolo,et al.  Prediction of the absolute aggregation rates of amyloidogenic polypeptide chains. , 2004, Journal of molecular biology.

[61]  M A Roseman,et al.  Hydrophilicity of polar amino acid side-chains is markedly reduced by flanking peptide bonds. , 1988, Journal of molecular biology.

[62]  L. Serpell,et al.  Common core structure of amyloid fibrils by synchrotron X-ray diffraction. , 1997, Journal of molecular biology.

[63]  Feng Ding,et al.  Emergence of Protein Fold Families through Rational Design , 2006, PLoS Comput. Biol..

[64]  A. Rosenberg,et al.  Effects of protein aggregates: An immunologic perspective , 2006, The AAPS Journal.

[65]  Thomas M Truskett,et al.  Coarse-grained strategy for modeling protein stability in concentrated solutions. , 2005, Biophysical journal.

[66]  Francesc X. Avilés,et al.  AGGRESCAN: a server for the prediction and evaluation of "hot spots" of aggregation in polypeptides , 2007, BMC Bioinform..

[67]  A. Rathore,et al.  Quality by design for biopharmaceuticals , 2009, Nature Biotechnology.

[68]  Sandeep Kumar,et al.  Coupling of Aggregation and Immunogenicity in Biotherapeutics: T- and B-Cell Immune Epitopes May Contain Aggregation-Prone Regions , 2011, Pharmaceutical Research.

[69]  D. Baker,et al.  The 3D profile method for identifying fibril-forming segments of proteins. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[70]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[71]  M. Karplus,et al.  Effective energy function for proteins in solution , 1999, Proteins.

[72]  M. Gromiha,et al.  Important amino acid properties for enhanced thermostability from mesophilic to thermophilic proteins. , 1999, Biophysical chemistry.

[73]  M. Manning,et al.  Aggregation of recombinant human interferon gamma: kinetics and structural transitions. , 1998, Journal of pharmaceutical sciences.

[74]  Jaewoon Jung,et al.  Topological determinants of protein unfolding rates , 2005, Proteins.

[75]  L. Serrano,et al.  Protein aggregation and amyloidosis: confusion of the kinds? , 2006, Current opinion in structural biology.

[76]  Silvio C. E. Tosatto,et al.  The PASTA server for protein aggregation prediction. , 2007, Protein engineering, design & selection : PEDS.

[77]  Michele Vendruscolo,et al.  Prediction of aggregation-prone regions in structured proteins. , 2008, Journal of molecular biology.

[78]  Roland L. Dunbrack,et al.  Bayesian statistical analysis of protein side‐chain rotamer preferences , 1997, Protein science : a publication of the Protein Society.

[79]  L. Serrano,et al.  A comparative study of the relationship between protein structure and beta-aggregation in globular and intrinsically disordered proteins. , 2004, Journal of molecular biology.

[80]  D. Tobias,et al.  Separating instability from aggregation propensity in γS-crystallin variants. , 2011, Biophysical journal.

[81]  Sandeep Nema,et al.  Protein Structural Conformation and Not Second Virial Coefficient Relates to Long-Term Irreversible Aggregation of a Monoclonal Antibody and Ovalbumin in Solution , 2006, Pharmaceutical Research.

[82]  C. C. Tomich,et al.  Site-directed mutagenesis to probe protein folding: evidence that the formation and aggregation of a bovine growth hormone folding intermediate are dissociable processes. , 1991, Biochemistry.

[83]  M. Karplus,et al.  Simulation of activation free energies in molecular systems , 1996 .

[84]  Wei Wang,et al.  Protein aggregation--pathways and influencing factors. , 2010, International journal of pharmaceutics.

[85]  John F. Carpenter,et al.  Physical Stability of Proteins in Aqueous Solution: Mechanism and Driving Forces in Nonnative Protein Aggregation , 2003, Pharmaceutical Research.

[86]  Piero Fariselli,et al.  I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure , 2005, Nucleic Acids Res..

[87]  Christopher J Roberts,et al.  Comparative effects of pH and ionic strength on protein-protein interactions, unfolding, and aggregation for IgG1 antibodies. , 2010, Journal of pharmaceutical sciences.

[88]  Bernhardt L. Trout,et al.  Design of therapeutic proteins with enhanced stability , 2009, Proceedings of the National Academy of Sciences.

[89]  Sandeep Kumar,et al.  Identification and Impact of Aggregation‐Prone Regions in Proteins and Therapeutic Monoclonal Antibodies , 2010 .

[90]  S L Mayo,et al.  Intrinsic beta-sheet propensities result from van der Waals interactions between side chains and the local backbone. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[91]  M. Gromiha,et al.  Comparison between long-range interactions and contact order in determining the folding rate of two-state proteins: application of long-range order to folding rate prediction. , 2001, Journal of molecular biology.

[92]  Bernhardt L Trout,et al.  Prediction of aggregation prone regions of therapeutic proteins. , 2010, The journal of physical chemistry. B.

[93]  David Eisenberg,et al.  β2-microglobulin forms 3D domain-swapped amyloid fibrils with disulfide linkages , 2010, Nature Structural &Molecular Biology.

[94]  Dmitry N Ivankov,et al.  Chain length is the main determinant of the folding rate for proteins with three‐state folding kinetics , 2003, Proteins.

[95]  Hao Chen,et al.  Identification of amyloid fibril-forming segments based on structure and residue-based statistical potential , 2007, Bioinform..

[96]  C. Roberts,et al.  Lumry-Eyring nucleated-polymerization model of protein aggregation kinetics. 2. Competing growth via condensation and chain polymerization. , 2009, The journal of physical chemistry. B.

[97]  C. Dobson,et al.  Competition between folding, native-state dimerisation and amyloid aggregation in beta-lactoglobulin. , 2009, Journal of molecular biology.

[98]  Sandeep Kumar,et al.  Potential aggregation prone regions in biotherapeutics , 2009, mAbs.

[99]  Sandeep Kumar,et al.  Potential Aggregation-Prone Regions in Complementarity-Determining Regions of Antibodies and Their Contribution Towards Antigen Recognition: A Computational Analysis , 2010, Pharmaceutical Research.

[100]  D. Thirumalai,et al.  Emerging ideas on the molecular basis of protein and peptide aggregation. , 2003, Current opinion in structural biology.

[101]  D. Baker,et al.  Contact order, transition state placement and the refolding rates of single domain proteins. , 1998, Journal of molecular biology.

[102]  Fabrizio Chiti,et al.  Amyloid formation by globular proteins under native conditions. , 2009, Nature chemical biology.

[103]  Joan-Emma Shea,et al.  Coarse-grained models for protein aggregation. , 2011, Current opinion in structural biology.