Prediction of Collagen Stability from Amino Acid Sequence*

An algorithm was derived to relate the amino acid sequence of a collagen triple helix to its thermal stability. This calculation is based on the triple helical stabilization propensities of individual residues and their intermolecular and intramolecular interactions, as quantitated by melting temperature values of host-guest peptides. Experimental melting temperature values of a number of triple helical peptides of varying length and sequence were successfully predicted by this algorithm. However, predicted Tm values are significantly higher than experimental values when there are strings of oppositely charged residues or concentrations of like charges near the terminus. Application of the algorithm to collagen sequences highlights regions of unusually high or low stability, and these regions often correlate with biologically significant features. The prediction of stability from sequence indicates an understanding of the major forces maintaining this protein motif. The use of highly favorable KGE and KGD sequences is seen to complement the stabilizing effects of imino acids in modulating stability and may become dominant in the collagenous domains of bacterial proteins that lack hydroxyproline. The effect of single amino acid mutations in the X and Y positions can be evaluated with this algorithm. An interactive collagen stability calculator based on this algorithm is available online.

[1]  P. Byers,et al.  Stability related bias in residues replacing glycines within the collagen triple helix (Gly‐Xaa‐Yaa) in inherited connective tissue disorders , 2004, Human mutation.

[2]  E. Leikina,et al.  Type I collagen is thermally unstable at body temperature , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[3]  H. Berman,et al.  Staggered molecular packing in crystals of a collagen-like peptide with a single charged pair. , 2000, Journal of molecular biology.

[4]  J. Ramshaw,et al.  Gly-X-Y tripeptide frequencies in collagen: a context for host-guest triple-helical peptides. , 1998, Journal of structural biology.

[5]  J. Moult,et al.  SNPs, protein structure, and disease , 2001, Human mutation.

[6]  J. Bujnicki,et al.  Streptococcal Scl1 and Scl2 Proteins Form Collagen-like Triple Helices* , 2002, The Journal of Biological Chemistry.

[7]  Helen M. Berman,et al.  Sequence dependent conformational variations of collagen triple-helical structure , 1999, Nature Structural Biology.

[8]  D. Dinakarpandian,et al.  Collagenase unwinds triple‐helical collagen prior to peptide bond hydrolysis , 2004, The EMBO journal.

[9]  C. Turnbough,et al.  Identification of the Immunodominant Protein and Other Proteins of the Bacillus anthracis Exosporium , 2003, Journal of bacteriology.

[10]  K. Piez,et al.  Equilibrium and kinetic studies of the helix-coil transition in alpha 1-CB2, a small peptide from collagen. , 1970, Biochemistry.

[11]  N. Morris,et al.  Thermal stability and folding of the collagen triple helix and the effects of mutations in osteogenesis imperfecta on the triple helix of type I collagen. , 1993, American journal of medical genetics.

[12]  H. Bächinger,et al.  Glycosylation/Hydroxylation-induced Stabilization of the Collagen Triple Helix , 2000, The Journal of Biological Chemistry.

[13]  Yujia Xu,et al.  Equilibrium thermal transitions of collagen model peptides , 2004, Protein science : a publication of the Protein Society.

[14]  J. Ramshaw,et al.  Stepwise construction of triple-helical heparin binding sites using peptide models. , 2004, Biochimica et biophysica acta.

[15]  J. Baum,et al.  Acid destabilization of a triple‐helical peptide model of the macrophage scavenger receptor , 1995, FEBS letters.

[16]  Pierre Lavigne,et al.  The role of position a in determining the stability and oligomerization state of α‐helical coiled coils: 20 amino acid stability coefficients in the hydrophobic core of proteins , 2008, Protein science : a publication of the Protein Society.

[17]  J. Baum,et al.  Two-dimensional NMR assignments and conformation of (Pro-Hyp-Gly)10 and a designed collagen triple-helical peptide. , 1993, Biochemistry.

[18]  J. Ramshaw,et al.  Amino acid propensities for the collagen triple-helix. , 2000, Biochemistry.

[19]  N. Inestrosa,et al.  Interaction of collagen-like peptide models of asymmetric acetylcholinesterase with glycosaminoglycans: spectroscopic studies of conformational changes and stability. , 2000, Biochemistry.

[20]  H. Berman,et al.  The crystal and molecular structure of a collagen-like peptide with a biologically relevant sequence. , 2001, Journal of molecular biology.

[21]  L. Fugger,et al.  Definition of MHC and T cell receptor contacts in the HLA-DR4restricted immunodominant epitope in type II collagen and characterization of collagen-induced arthritis in HLA-DR4 and human CD4 transgenic mice. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[22]  F. Crick,et al.  The molecular structure of collagen. , 1961, Journal of molecular biology.

[23]  K. Kadler,et al.  Assembly of type I collagen fibrils de novo. Between 37 and 41 degrees C the process is limited by micro-unfolding of monomers. , 1988, The Journal of biological chemistry.

[24]  L. Björck,et al.  Genome-based Identification and Analysis of Collagen-related Structural Motifs in Bacterial and Viral Proteins* , 2003, Journal of Biological Chemistry.

[25]  W. Cole,et al.  Characterization of an arginine 789 to cysteine substitution in alpha 1 (II) collagen chains of a patient with spondyloepiphyseal dysplasia. , 1993, The Journal of biological chemistry.

[26]  Shawn M. Sweeney,et al.  Mapping the Ligand-binding Sites and Disease-associated Mutations on the Most Abundant Protein in the Human, Type I Collagen* , 2002, The Journal of Biological Chemistry.

[27]  J. Ramshaw,et al.  Electrostatic interactions involving lysine make major contributions to collagen triple-helix stability. , 2005, Biochemistry.

[28]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[29]  S. Jimenez,et al.  Position of single amino acid substitutions in the collagen triple helix determines their effect on structure of collagen fibrils. , 2004, Journal of structural biology.

[30]  J. Ramshaw,et al.  Electrostatic interactions in collagen-like triple-helical peptides. , 1994, Biochemistry.

[31]  J. Baum,et al.  Transformation of the Mechanism of Triple-helix Peptide Folding in the Absence of a C-terminal Nucleation Domain and Its Implications for Mutations in Collagen Disorders* , 2004, Journal of Biological Chemistry.

[32]  M. Swindells,et al.  Intrinsic φ,ψ propensities of amino acids, derived from the coil regions of known structures , 1995, Nature Structural Biology.

[33]  B. Brodsky,et al.  Amino acid sequence environment modulates the disruption by osteogenesis imperfecta glycine substitutions in collagen-like peptides. , 1997, Biochemistry.

[34]  J. Baum,et al.  Stability junction at a common mutation site in the collagenous domain of the mannose binding lectin. , 2005, Biochemistry.

[35]  P. Beighton,et al.  Stickler-like syndrome due to a dominant negative mutation in the COL2A1 gene. , 1998, American journal of medical genetics.

[36]  Richard W. Farndale,et al.  Structure of the Integrin α2β1-binding Collagen Peptide , 2004 .

[37]  H. Bächinger,et al.  Sequence specific thermal stability of the collagen triple helix. , 1991, International journal of biological macromolecules.

[38]  K. Kivirikko,et al.  Collagens, modifying enzymes and their mutations in humans, flies and worms. , 2004, Trends in genetics : TIG.

[39]  L Regan,et al.  A thermodynamic scale for the beta-sheet forming tendencies of the amino acids. , 1994, Biochemistry.

[40]  J. Baum,et al.  NMR and CD spectroscopy show that imino acid restriction of the unfolded state leads to efficient folding. , 2003, Biochemistry.

[41]  W. DeGrado,et al.  A thermodynamic scale for the helix-forming tendencies of the commonly occurring amino acids. , 1990, Science.

[42]  K. Sutoh,et al.  Conformational change of the triple‐helical structure. II. Conformation of (Pro‐Pro‐Gly)n and (Pro‐Pro‐Gly)n(Ala‐Pro‐Gly)m(Pro‐Pro‐Gly)n in an aqueous solution , 1974 .

[43]  P. Privalov Stability of proteins. Proteins which do not present a single cooperative system. , 1982, Advances in protein chemistry.

[44]  J. Ramshaw,et al.  Peptide investigations of pairwise interactions in the collagen triple-helix. , 2002, Journal of molecular biology.

[45]  A. Persikov,et al.  Molecular structure of the collagen triple helix. , 2005, Advances in protein chemistry.

[46]  J. Ramshaw,et al.  Gly-Gly-containing triplets of low stability adjacent to a type III collagen epitope. , 1997, Biochemistry.

[47]  J. Ramshaw,et al.  Destabilization of osteogenesis imperfecta collagen-like model peptides correlates with the identity of the residue replacing glycine. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[48]  H M Berman,et al.  Crystal and molecular structure of a collagen-like peptide at 1.9 A resolution. , 1994, Science.