PDB2CD: a web-based application for the generation of circular dichroism spectra from protein atomic coordinates

Motivation: Circular dichroism (CD) spectroscopy is extensively utilized for determining the percentages of secondary structure content present in proteins. However, although a large contributor, secondary structure is not the only factor that influences the shape and magnitude of the CD spectrum produced. Other structural features can make contributions so an entire protein structural conformation can give rise to a CD spectrum. There is a need for an application capable of generating protein CD spectra from atomic coordinates. However, no empirically derived method to do this currently exists. Results: PDB2CD has been created as an empirical-based approach to the generation of protein CD spectra from atomic coordinates. The method utilizes a combination of structural features within the conformation of a protein; not only its percentage secondary structure content, but also the juxtaposition of these structural components relative to one another, and the overall structure similarity of the query protein to proteins in our dataset, the SP175 dataset, the ‘gold standard’ set obtained from the Protein Circular Dichroism Data Bank (PCDDB). A significant number of the CD spectra associated with the 71 proteins in this dataset have been produced with excellent accuracy using a leave-one-out cross-validation process. The method also creates spectra in good agreement with those of a test set of 14 proteins from the PCDDB. The PDB2CD package provides a web-based, user friendly approach to enable researchers to produce CD spectra from protein atomic coordinates. Availability and implementation: http://pdb2cd.cryst.bbk.ac.uk Contact: r.w.janes@qmul.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  K Henrick,et al.  Electronic Reprint Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions , 2022 .

[2]  B. Wallace,et al.  Synchrotron radiation circular dichroism spectroscopy of proteins and applications in structural and functional genomics. , 2006, Chemical Society reviews.

[3]  Andreas Prlic,et al.  Sequence analysis , 2003 .

[4]  N. Sreerama,et al.  Estimation of protein secondary structure from circular dichroism spectra: inclusion of denatured proteins with native proteins in the analysis. , 2000, Analytical biochemistry.

[5]  E. Ohmae,et al.  Vacuum-Ultraviolet Circular Dichroism Spectra of Escherichia coli Dihydrofolate Reductase and Its Mutants: Contributions of Phenylalanine and Tyrosine Side Chains and Exciton Coupling of Two Tryptophan Side Chains. , 2015, The journal of physical chemistry. B.

[6]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[7]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[8]  Andreas Prlic,et al.  BioJava: an open-source framework for bioinformatics in 2012 , 2012, Bioinform..

[9]  Lee Whitmore,et al.  ValiDichro: a website for validating and quality control of protein circular dichroism spectra , 2013, Nucleic Acids Res..

[10]  W C Johnson,et al.  Information content in the circular dichroism of proteins. , 1981, Biochemistry.

[11]  Andrew J. Miles,et al.  A reference database for circular dichroism spectroscopy covering fold and secondary structure space , 2006, Bioinform..

[12]  Ian Sillitoe,et al.  The CATH classification revisited—architectures reviewed and new ways to characterize structural divergence in superfamilies , 2008, Nucleic Acids Res..

[13]  S. Provencher,et al.  Estimation of globular protein secondary structure from circular dichroism. , 1981, Biochemistry.

[14]  Jonathan D. Hirst,et al.  DichroCalc - circular and linear dichroism online , 2009, Bioinform..

[15]  Christoph Wiedemann,et al.  CAPITO - a web server-based analysis and plotting tool for circular dichroism data , 2013, Bioinform..

[16]  Y H Chen,et al.  A new approach to the calculation of secondary structures of globular proteins by optical rotatory dispersion and circular dichroism. , 1971, Biochemical and biophysical research communications.

[17]  Lee Whitmore,et al.  PCDDB: the protein circular dichroism data bank, a repository for circular dichroism spectral and metadata , 2010, Nucleic Acids Res..

[18]  J. Brahms,et al.  Determination of protein secondary structure in solution by vacuum ultraviolet circular dichroism. , 1980, Journal of molecular biology.

[19]  Frank Wien,et al.  Accurate secondary structure prediction and fold recognition for circular dichroism spectroscopy , 2015, Proceedings of the National Academy of Sciences.

[20]  Andrew J. Miles,et al.  A reference dataset for the analyses of membrane protein secondary structures and transmembrane residues using circular dichroism spectroscopy , 2011, Bioinform..

[21]  B. Wallace,et al.  Protein secondary structure analyses from circular dichroism spectroscopy: methods and reference databases. , 2008, Biopolymers.

[22]  R. Woody,et al.  Theoretical study of the contribution of aromatic side chains to the circular dichroism of basic bovine pancreatic trypsin inhibitor. , 1989, Biochemistry.

[23]  Johnson Wc,et al.  Information content in the circular dichroism of proteins. , 1981 .

[24]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.