A reference database for circular dichroism spectroscopy covering fold and secondary structure space

MOTIVATION Circular Dichroism (CD) spectroscopy is a long-established technique for studying protein secondary structures in solution. Empirical analyses of CD data rely on the availability of reference datasets comprised of far-UV CD spectra of proteins whose crystal structures have been determined. This article reports on the creation of a new reference dataset which effectively covers both secondary structure and fold space, and uses the higher information content available in synchrotron radiation circular dichroism (SRCD) spectra to more accurately predict secondary structure than has been possible with existing reference datasets. It also examines the effects of wavelength range, structural redundancy and different means of categorizing secondary structures on the accuracy of the analyses. In addition, it describes a novel use of hierarchical cluster analyses to identify protein relatedness based on spectral properties alone. The databases are shown to be applicable in both conventional CD and SRCD spectroscopic analyses of proteins. Hence, by combining new bioinformatics and biophysical methods, a database has been produced that should have wide applicability as a tool for structural molecular biology.

[1]  N. Sreerama,et al.  Estimation of the number of α‐helical and β‐strand segments in proteins using circular dichroism spectroscopy , 2008, Protein science : a publication of the Protein Society.

[2]  J. Brahms,et al.  Determination of protein secondary structure in solution by vacuum ultraviolet circular dichroism. , 1980, Journal of molecular biology.

[3]  Frances M. G. Pearl,et al.  The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis , 2004, Nucleic Acids Res..

[4]  B. Wallace,et al.  Synchrotron radiation circular dichroism spectroscopy of proteins and applications in structural and functional genomics. , 2006, Chemical Society reviews.

[5]  R. Woody,et al.  Theory of Circular Dichroism of Proteins , 1996 .

[6]  T. Keiderling,et al.  Systematic comparison of statistical analyses of electronic and vibrational circular dichroism for secondary structure prediction of selected proteins. , 1991, Biochemistry.

[7]  Lee Whitmore,et al.  The Protein Circular Dichroism Data Bank (PCDDB): A bioinformatics and spectroscopic resource , 2005, Proteins.

[8]  N. Sreerama,et al.  Estimation of protein secondary structure from circular dichroism spectra: comparison of CONTIN, SELCON, and CDSSTR methods with an expanded reference set. , 2000, Analytical biochemistry.

[9]  Nichole Reisdorph,et al.  Protein identification and profiling with mass spectrometry , 2003 .

[10]  Erik Goormaghtigh,et al.  Rationally selected basis proteins: A new approach to selecting proteins for spectroscopic secondary structure analysis , 2003, Protein science : a publication of the Protein Society.

[11]  W. C. Johnson,et al.  Analyzing protein circular dichroism spectra for accurate secondary structures , 1999, Proteins.

[12]  G. Fasman Circular Dichroism and the Conformational Analysis of Biomolecules , 1996, Springer US.

[13]  N. C. Price,et al.  How to study proteins by circular dichroism. , 2005, Biochimica et biophysica acta.

[14]  Andrew J. Miles,et al.  Calibration and standardisation of synchrotron radiation and conventional circular dichroism spectrometers. Part 2: Factors affecting magnitude and wavelength , 2005 .

[15]  Frank Wien,et al.  Redetermination of the extinction coefficient of camphor-10-sulfonic acid, a calibration standard for circular dichroism spectroscopy. , 2004, Analytical biochemistry.

[16]  Jonathan G. Lees,et al.  CDtool-an integrated software package for circular dichroism spectroscopic data processing, analysis, and archiving. , 2004, Analytical biochemistry.

[17]  M C Manning,et al.  Tyrosine, phenylalanine, and disulfide contributions to the circular dichroism of proteins: circular dichroism spectra of wild-type and mutant bovine pancreatic trypsin inhibitor. , 1999, Biochemistry.

[18]  M. Sternberg,et al.  Left-handed polyproline II helices commonly occur in globular proteins. , 1993, Journal of molecular biology.

[19]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[20]  E. Goormaghtigh,et al.  The optimization of protein secondary structure determination with infrared and circular dichroism spectra. , 2004, European journal of biochemistry.

[21]  B. Wallace,et al.  Differential absorption flattening optical effects are significant in the circular dichroism spectra of large membrane fragments. , 1987, Biochemistry.

[22]  David T. Jones,et al.  Protein superfamilles and domain superfolds , 1994, Nature.

[23]  Jonathan D. Hirst,et al.  Charge-transfer transitions in protein circular dichroism spectra , 2004 .

[24]  W C Johnson,et al.  Extending CD spectra of proteins to 168 nm improves the analysis for secondary structures. , 1992, Analytical biochemistry.

[25]  Frank Wien,et al.  Biomedical applications of synchrotron radiation circular dichroism spectroscopy: identification of mutant proteins associated with disease and development of a reference database for fold motifs. , 2004, Faraday discussions.

[26]  A. Brunger Free R value: a novel statistical quantity for assessing the accuracy of crystal structures. , 1992 .

[27]  Johnson Wc,et al.  Information content in the circular dichroism of proteins. , 1981 .

[28]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[29]  Lee Whitmore,et al.  DICHROWEB: an interactive website for the analysis of protein secondary structure from circular dichroism spectra , 2002, Bioinform..

[30]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[31]  Narasimha Sreerama,et al.  Structural composition of βI‐ and βII‐proteins , 2003 .

[32]  R. W. Janes,et al.  Synchrotron radiation circular dichroism spectroscopy of proteins: secondary structure, fold recognition and structural genomics. , 2001, Current opinion in chemical biology.

[33]  W C Johnson,et al.  Information content in the circular dichroism of proteins. , 1981, Biochemistry.

[34]  B A Wallace,et al.  Calcium Fluoride Micro Cells for Synchrotron Radiation Circular Dichroism Spectroscopy , 2005, Applied spectroscopy.

[35]  Markus P. Fülscher,et al.  Theoretical Study of the Electronic Spectroscopy of Peptides. III. Charge-Transfer Transitions in Polypeptides , 1998 .

[36]  U. Hobohm,et al.  Selection of representative protein data sets , 1992, Protein science : a publication of the Protein Society.

[37]  B A Wallace,et al.  Synchrotron radiation circular dichroism spectroscopy: vacuum ultraviolet irradiation does not damage protein integrity. , 2001, Journal of synchrotron radiation.

[38]  K. Gekko,et al.  Secondary-structure analysis of proteins by vacuum-ultraviolet circular dichroism spectroscopy. , 2004, Journal of biochemistry.

[39]  A. Brünger Free R value: a novel statistical quantity for assessing the accuracy of crystal structures , 1992, Nature.

[40]  Robert W. Janes,et al.  Bioinformatics analyses of circular dichroism protein reference databases , 2005, Bioinform..

[41]  Y H Chen,et al.  A new approach to the calculation of secondary structures of globular proteins by optical rotatory dispersion and circular dichroism. , 1971, Biochemical and biophysical research communications.

[42]  P. V. von Hippel,et al.  Calculation of protein extinction coefficients from amino acid sequence data. , 1989, Analytical biochemistry.

[43]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[44]  Robert W. Janes,et al.  Calibration and Standardisation of Synchrotron Radiation Circular Dichroism and Conventional Circular Dichroism Spectrophotometers , 2003 .

[45]  S. Provencher,et al.  Estimation of globular protein secondary structure from circular dichroism. , 1981, Biochemistry.

[46]  Lee Whitmore,et al.  DICHROWEB, an online server for protein secondary structure analyses from circular dichroism spectroscopic data , 2004, Nucleic Acids Res..

[47]  B. Wallace,et al.  Synchrotron radiation circular-dichroism spectroscopy as a tool for investigating protein structures. , 2000, Journal of synchrotron radiation.

[48]  Frances M. G. Pearl,et al.  The CATH domain structure database. , 2005, Methods of biochemical analysis.