A statistically derived parameterization for the collagen triple‐helix

The triple‐helix is a unique secondary structural motif found primarily within the collagens. In collagen, it is a homo‐ or hetero‐tripeptide with a repeating primary sequence of (Gly‐X‐Y)n, displaying characteristic peptide backbone dihedral angles. Studies of bulk collagen fibrils indicate that the triple‐helix must be a highly repetitive secondary structure, with very specific constraints. Primary sequence analysis shows that most collagen molecules are primarily triple‐helical; however, no high‐resolution structure of any entire protein is yet available. Given the drastic morphological differences in self‐assembled collagen structures with subtle changes in assembly conditions, a detailed knowledge of the relative locations of charged and sterically bulky residues in collagen is desirable. Its repetitive primary sequence and highly conserved secondary structure make collagen, and the triple‐helix in general, an ideal candidate for a general parameterization for prediction of residue locations and for the use of a helical wheel in the prediction of residue orientation. Herein, a statistical analysis of the currently available high‐resolution X‐ray crystal structures of model triple‐helical peptides is performed to produce an experimentally based parameter set for predicting peptide backbone and Cβ atom locations for the triple‐helix. Unlike existing homology models, this allows easy prediction of an entire triple‐helix structure based on all existing high‐resolution triple‐helix structures, rather than only on a single structure or on idealized parameters. Furthermore, regional differences based on the helical propensity of residues may be readily incorporated. The parameter set is validated in terms of the predicted bond lengths, backbone dihedral angles, and interchain hydrogen bonding.

[1]  H. Berman,et al.  Staggered molecular packing in crystals of a collagen-like peptide with a single charged pair. , 2000, Journal of molecular biology.

[2]  A. D. McLachlan,et al.  Rapid comparison of protein structures , 1982 .

[3]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[4]  J. Ramshaw,et al.  Peptide investigations of pairwise interactions in the collagen triple-helix. , 2002, Journal of molecular biology.

[5]  R. Berisio,et al.  Crystal structure of the collagen triple helix model [(Pro‐Pro‐Gly)10]3 , 2002, Protein science : a publication of the Protein Society.

[6]  H M Berman,et al.  Crystal and molecular structure of a collagen-like peptide at 1.9 A resolution. , 1994, Science.

[7]  R. Huber,et al.  Accurate Bond and Angle Parameters for X-ray Protein Structure Refinement , 1991 .

[8]  J. Skolnick,et al.  De Novo Predictions of the Quaternary Structure of Leucine Zippers and Other Coiled Coils , 1999 .

[9]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[10]  B. Brodsky,et al.  The triple‐lielix motif in proteins , 1995, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[11]  Conrad C. Huang,et al.  Computational investigations of structural changes resulting from point mutations in a collagen-like peptide. , 1999, Biopolymers.

[12]  K. Beck,et al.  Supercoiled Protein Motifs: The Collagen Triple-Helix and the α-Helical Coiled Coil , 1998 .

[13]  Helen M. Berman,et al.  Sequence dependent conformational variations of collagen triple-helical structure , 1999, Nature Structural Biology.

[14]  J. Ramshaw,et al.  Amino acid propensities for the collagen triple-helix. , 2000, Biochemistry.

[15]  Roland L. Dunbrack,et al.  Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: a new homology modeling tool. , 1997, Journal of molecular biology.

[16]  H M Berman,et al.  Hydration structure of a collagen peptide. , 1995, Structure.

[17]  Shizuhiko Nishisato,et al.  Elements of Dual Scaling: An Introduction To Practical Data Analysis , 1993 .

[18]  A. Kajava,et al.  Review: proteins with repeated sequence--structural prediction and modeling. , 2001, Journal of structural biology.

[19]  M. Schiffer,et al.  Use of helical wheels to represent the structures of proteins and to identify segments with helical potential. , 1967, Biophysical journal.

[20]  K. Okuyama,et al.  Crystal structure analysis of collagen model peptide (Pro-pro-Gly)10. , 1998, Journal of biochemistry.

[21]  P S Kim,et al.  Repacking protein cores with backbone freedom: structure prediction for coiled coils. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[22]  W. Tzou,et al.  Restraint-driven formation of alpha-helical coiled coils in molecular dynamics simulations. , 1999, Biopolymers.

[23]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[24]  K. Mayo,et al.  NMR and x-ray studies of collagen model peptides. , 1996, Biopolymers.

[25]  Ultrastructure and assembly of segmental long spacing collagen studied by atomic force microscopy. , 2001, Micron.

[26]  K. Okuyama,et al.  Structure analysis of a collagen-model peptide with a (Pro-Hyp-Gly) sequence repeat. , 1999, Journal of biochemistry.

[27]  M F Paige,et al.  A study of fibrous long spacing collagen ultrastructure and assembly by atomic force microscopy. , 2001, Micron.

[28]  Josef Schmee,et al.  Outliers in Statistical Data (2nd ed.) , 1986 .

[29]  F. Crick,et al.  The molecular structure of collagen. , 1961, Journal of molecular biology.

[30]  R. Berisio,et al.  Structural bases of collagen stabilization induced by proline hydroxylation. , 2001, Biopolymers.

[31]  G. N. Ramachandran,et al.  MOLECULAR STRUCTURE OF COLLAGEN. , 1963, International review of connective tissue research.

[32]  K. Okuyama,et al.  A New Structural Model for Collagen , 1977 .

[33]  R. Berisio,et al.  X-ray crystallographic determination of a collagen-like peptide with the repeating sequence (Pro-Pro-Gly). , 1998, Journal of molecular biology.