The optimization of protein secondary structure determination with infrared and circular dichroism spectra.

We have used the circular dichroism and infrared spectra of a specially designed 50 protein database [Oberg, K.A., Ruysschaert, J.M. & Goormaghtigh, E. (2003) Protein Sci. 12, 2015-2031] in order to optimize the accuracy of spectroscopic protein secondary structure determination using multivariate statistical analysis methods. The results demonstrate that when the proteins are carefully selected for the diversity in their structure, no smaller subset of the database contains the necessary information to describe the entire set. One conclusion of the paper is therefore that large protein databases, observing stringent selection criteria, are necessary for the prediction of unknown proteins. A second important conclusion is that only the comparison of analyses run on circular dichroism and infrared spectra independently is able to identify failed solutions in the absence of known structure. Interestingly, it was also found in the course of this study that the amide II band has high information content and could be used alone for secondary structure prediction in place of amide I.

[1]  N. Clark,et al.  Anomalous amide I infrared absorption of purple membrane. , 1979, Science.

[2]  Johnson Wc,et al.  Information content in the circular dichroism of proteins. , 1981 .

[3]  S. Provencher,et al.  Estimation of globular protein secondary structure from circular dichroism. , 1981, Biochemistry.

[4]  W C Johnson,et al.  Information content in the circular dichroism of proteins. , 1981, Biochemistry.

[5]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[6]  R. Glaeser,et al.  Peptide-chain secondary structure of bacteriorhodopsin. , 1983, Biophysical journal.

[7]  R. Jakobsen,et al.  An Algorithm for the Reproducible Spectral Subtraction of Water from the FT-IR Spectra of Proteins in Dilute Solutions and Adsorbed Monolayers , 1986 .

[8]  M. Cascio,et al.  Evaluation of methods for the prediction of membrane protein secondary structures. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[9]  H. Susi,et al.  Examination of the secondary structure of proteins by deconvolved FTIR spectra , 1986, Biopolymers.

[10]  W C Johnson,et al.  Variable selection method improves the prediction of protein secondary structure from circular dichroism spectra. , 1987, Analytical biochemistry.

[11]  H. Mantsch,et al.  New insight into protein secondary structure from resolution-enhanced infrared spectra. , 1988, Biochimica et biophysica acta.

[12]  S. Venyaminov,et al.  Quantitative IR spectrophotometry of peptide compounds in water (H2O) solutions. III. Estimation of the protein secondary structure , 1990, Biopolymers.

[13]  M. Pézolet,et al.  Determination of the secondary structure content of proteins in aqueous solutions from their amide I and amide II infrared bands. Comparison between classical and partial least-squares methods. , 1990, Biochemistry.

[14]  E Goormaghtigh,et al.  Secondary structure and dosage of soluble and membrane proteins by attenuated total reflection Fourier-transform infrared spectroscopy on hydrated films. , 1990, European journal of biochemistry.

[15]  R. Mitchell,et al.  Determination of protein secondary structure using factor analysis of infrared spectra. , 1990, Biochemistry.

[16]  S. Venyaminov,et al.  Quantitative IR spectrophotometry of peptide compounds in water (H2O) solutions. I. Spectral parameters of amino acid residue absorption bands , 1990, Biopolymers.

[17]  W. Krueger,et al.  An infrared and circular dichroism combined approach to the analysis of protein secondary structure. , 1991, Analytical biochemistry.

[18]  G. Fasman,et al.  Convex constraint analysis: a natural deconvolution of circular dichroism curves of proteins. , 1991, Protein engineering.

[19]  G. Böhm,et al.  Quantitative analysis of protein far UV circular dichroism spectra by neural networks. , 1992, Protein engineering.

[20]  A. Bairoch,et al.  The SWISS-PROT protein sequence data bank. , 1991, Nucleic acids research.

[21]  Differentiation between transmembrane helices and peripheral helices by the deconvolution of circular dichroism spectra of membrane proteins , 1992, Protein science : a publication of the Protein Society.

[22]  P. Haris,et al.  Protein secondary structure from Fourier transform infrared and/or circular dichroism spectra. , 1993, Analytical biochemistry.

[23]  L. Miercke,et al.  Secondary structure analysis of purified functional CHIP28 water channels by CD and FTIR spectroscopy. , 1993, Biochemistry.

[24]  N. Sreerama,et al.  A self-consistent method for the analysis of protein secondary structure from circular dichroism. , 1993, Analytical biochemistry.

[25]  M. A. Andrade,et al.  Evaluation of secondary structure of proteins from UV circular dichroism spectra using an unsupervised learning neural network. , 1993, Protein engineering.

[26]  Venyaminov SYu,et al.  Determination of protein tertiary structure class from circular dichroism spectra. , 1994, Analytical biochemistry.

[27]  Principal component analysis of Fourier transform infrared and/or circular dichroism spectra of proteins applied in a calibration of protein secondary structure. , 1994, Analytical biochemistry.

[28]  E Goormaghtigh,et al.  Determination of soluble and membrane protein structure by Fourier transform infrared spectroscopy. I. Assignments and model compounds. , 1994, Sub-cellular biochemistry.

[29]  W. Bannister,et al.  Prediction of protein secondary structure from circular dichroism spectra: an attempt to solve the problem of the best-fitting reference protein subsets. , 1995, Analytical biochemistry.

[30]  H. Mantsch,et al.  The use and misuse of FTIR spectroscopy in the determination of protein structure. , 1995, Critical reviews in biochemistry and molecular biology.

[31]  T. Keiderling,et al.  Comparison of and limits of accuracy for statistical analyses of vibrational and electronic circular dichroism spectra in terms of correlations to and predictions of protein secondary structure , 1995, Protein science : a publication of the Protein Society.

[32]  P. Haris,et al.  A Fourier-transform infrared spectroscopic investigation of the hydrogen-deuterium exchange and secondary structure of the 28-kDa channel-forming integral membrane protein (CHIP28). , 1995, European journal of biochemistry.

[33]  T. Keiderling,et al.  Predictions of secondary structure using statistical analyses of electronic and vibrational circular dichroism and Fourier transform infrared spectra of proteins in H2O. , 1996, Journal of molecular biology.

[34]  Protein structural segments and their interconnections derived from optical spectra. Thermal unfolding of ribonuclease T1 as an example. , 1996, Biochemistry.

[35]  A. Dunker,et al.  Aromatic and Cystine Side-Chain Circular Dichroism in Proteins , 1996 .

[36]  S. Venyaminov,et al.  Determination of Protein Secondary Structure , 1996 .

[37]  W. Hübner,et al.  Secondary structure determination of proteins in aqueous solution by infrared spectroscopy: a comparison of multivariate data analysis methods. , 1996, Analytical biochemistry.

[38]  E. Goormaghtigh,et al.  Relevance of Protein Thin Films Prepared for Attenuated Total Reflection Fourier Transform Infrared Spectroscopy: Significance of the pH , 1996 .

[39]  T. Walz,et al.  Secondary structures comparison of aquaporin-1 and bacteriorhodopsin: a Fourier transform infrared spectroscopy study of two-dimensional membrane crystals. , 1997, Biophysical journal.

[40]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[41]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[42]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1998 , 1998, Nucleic Acids Res..

[43]  A. Fink,et al.  A new attenuated total reflectance Fourier transform infrared spectroscopy method for the study of proteins in solution. , 1998, Analytical biochemistry.

[44]  F. Goñi,et al.  Structure and dynamics of membrane proteins as studied by infrared spectroscopy. , 1999, Progress in biophysics and molecular biology.

[45]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[46]  A. Barth,et al.  The infrared absorption of amino acid side chains. , 2000, Progress in biophysics and molecular biology.

[47]  N. Sreerama,et al.  Estimation of protein secondary structure from circular dichroism spectra: comparison of CONTIN, SELCON, and CDSSTR methods with an expanded reference set. , 2000, Analytical biochemistry.

[48]  Jonathan G. Lees,et al.  Analyses of circular dichroism spectra of membrane proteins , 2003, Protein science : a publication of the Protein Society.

[49]  Erik Goormaghtigh,et al.  Rationally selected basis proteins: A new approach to selecting proteins for spectroscopic secondary structure analysis , 2003, Protein science : a publication of the Protein Society.

[50]  R. Woody Contributions of tryptophan side chains to the far-ultraviolet circular dichroism of proteins , 2004, European Biophysics Journal.