CPAD, Curated Protein Aggregation Database: A Repository of Manually Curated Experimental Data on Protein and Peptide Aggregation

Accurate distinction between peptide sequences that can form amyloid-fibrils or amorphous β-aggregates, identification of potential aggregation prone regions in proteins, and prediction of change in aggregation rate of a protein upon mutation(s) are critical to research on protein misfolding diseases, such as Alzheimer’s and Parkinson’s, as well as biotechnological production of protein based therapeutics. We have developed a Curated Protein Aggregation Database (CPAD), which has collected results from experimental studies performed by scientific community aimed at understanding protein/peptide aggregation. CPAD contains more than 2300 experimentally observed aggregation rates upon mutations in known amyloidogenic proteins. Each entry includes numerical values for the following parameters: change in rate of aggregation as measured by fluorescence intensity or turbidity, name and source of the protein, Uniprot and Protein Data Bank codes, single point as well as multiple mutations, and literature citation. The data in CPAD has been supplemented with five different types of additional information: (i) Amyloid fibril forming hexa-peptides, (ii) Amorphous β-aggregating hexa-peptides, (iii) Amyloid fibril forming peptides of different lengths, (iv) Amyloid fibril forming hexa-peptides whose crystal structures are available in the Protein Data Bank (PDB) and (v) Experimentally validated aggregation prone regions found in amyloidogenic proteins. Furthermore, CPAD is linked to other related databases and resources, such as Uniprot, Protein Data Bank, PUBMED, GAP, TANGO, WALTZ etc. We have set up a web interface with different search and display options so that users have the ability to get the data in multiple ways. CPAD is freely available at http://www.iitm.ac.in/bioinfo/CPAD/. The potential applications of CPAD have also been discussed.

[1]  Joost J. J. van Durme,et al.  WALTZ-DB: a benchmark database of amyloidogenic hexapeptides , 2015, Bioinform..

[2]  S. Bardien,et al.  Neurodegenerative disorders: Dysregulation of a carefully maintained balance? , 2014, Experimental Gerontology.

[3]  Amedeo Caflisch,et al.  Computational models for the prediction of polypeptide aggregation propensity. , 2006, Current opinion in chemical biology.

[4]  Fabrizio Chiti,et al.  Studies of the aggregation of mutant proteins in vitro provide insights into the genetics of amyloid diseases , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Markus J Buehler,et al.  Nanomechanics of functional and pathological amyloid materials. , 2011, Nature nanotechnology.

[6]  Michael H Hecht,et al.  Mutations enhance the aggregation propensity of the Alzheimer's A beta peptide. , 2008, Journal of molecular biology.

[7]  Sandeep Kumar,et al.  GAP: towards almost 100 percent prediction for β-strand-mediated aggregating peptides with distinct morphologies , 2014, Bioinform..

[8]  D. Sundar,et al.  Protein Aggregation in Neurodegenerative Diseases: Insights from Computational Analyses , 2012 .

[9]  Z. Yue,et al.  Neuronal aggregates: formation, clearance, and spreading. , 2015, Developmental cell.

[10]  A. Cavalli,et al.  The role of aromaticity, exposed surface, and dipole moment in determining protein aggregation rates , 2004, Protein science : a publication of the Protein Society.

[11]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[12]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[13]  David Eisenberg,et al.  Atomic structures of amyloid cross-beta spines reveal varied steric zippers. , 2007, Nature.

[14]  Ehud Gazit,et al.  Amyloids: not only pathological agents but also ordered nanomaterials. , 2008, Angewandte Chemie.

[15]  Sandeep Kumar,et al.  Developability of Biotherapeutics: Computational Approaches , 2015 .

[16]  Bernhardt L Trout,et al.  Aggregation in protein-based biotherapeutics: computational studies and tools to identify aggregation-prone regions. , 2011, Journal of pharmaceutical sciences.

[17]  C. Dobson,et al.  Supersaturation is a major driving force for protein aggregation in neurodegenerative diseases. , 2015, Trends in pharmacological sciences.

[18]  David S. Goodsell,et al.  The RCSB Protein Data Bank: views of structural biology for basic and applied research and education , 2014, Nucleic Acids Res..

[19]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[20]  Heather T. McFarlane,et al.  Atomic structures of amyloid cross-β spines reveal varied steric zippers , 2007, Nature.

[21]  Devadasan Velmurugan,et al.  How do thermophilic proteins resist aggregation? , 2012, Proteins.

[22]  Sandeep Kumar,et al.  On the Role of Aggregation Prone Regions in Protein Evolution, Stability, and Enzymatic Catalysis: Insights from Diverse Analyses , 2013, PLoS Comput. Biol..

[23]  E. Cohen Aging, Protein Aggregation, Chaperones, and Neurodegenerative Disorders: Mechanisms of Coupling and Therapeutic Opportunities , 2012, Rambam Maimonides medical journal.

[24]  Sandeep Kumar,et al.  Autoimmune Responses to Soluble Aggregates of Amyloidogenic Proteins Involved in Neurodegenerative Diseases: Overlapping Aggregation Prone and Autoimmunogenic regions , 2016, Scientific Reports.

[25]  Stavros J. Hamodrakas,et al.  A Consensus Method for the Prediction of ‘Aggregation-Prone’ Peptides in Globular Proteins , 2013, PloS one.

[26]  Sandeep Kumar,et al.  Distinct position-specific sequence features of hexa-peptides that form amyloid-fibrils: application to discriminate between amyloid fibril and amorphous β-aggregate forming peptide sequences , 2013, BMC Bioinformatics.

[27]  Lisa D. Cabrita,et al.  The H50Q Mutation Induces a 10-fold Decrease in the Solubility of α-Synuclein* , 2014, The Journal of Biological Chemistry.