Cys.sqlite: A Structured-Information Approach to the Comprehensive Analysis of Cysteine Disulfide Bonds in the Protein Databank

Cysteine is a multifaceted amino acid that is central to the structure and function of many proteins. A disulfide bond formed between two cysteines restrains protein conformations through the strong covalent bond and torsions about the bond that prefer, energetically, ±90°. In this study, we transform over 30 000 Protein Databank files (PDBx/mmCIFs) into a single file, the SQLite database (Cys.sqlite). The database schema is designed to accommodate the structural information on both oxidized and reduced cysteines and to retain essential protein metadata to establish informational and biological provenance. Cys.sqlite contains over 95 000 peptide chains and 500 000 cysteines (700 000 structural conformers); there are over 265 000 cysteine disulfide bond conformations from structures solved with all available experimental methods. The structural information is analyzed with respect to sequence identity cutoff, the experimental method, and energetics of the disulfide. We find that as the experimental information becomes limiting and the influence of modeling becomes more pronounced, the observed average strain increases artificially. The database and analyses presented here can be used to improve the refinement of biological structures from experiments that are known to contain one or more disulfide bonds.

[1]  Hector H. Huang,et al.  Force-dependent chemical kinetics of disulfide bond reduction observed with single-molecule techniques. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Xiao Zhu,et al.  Polarizable empirical force field for sulfur‐containing compounds based on the classical Drude oscillator model , 2010, J. Comput. Chem..

[3]  H. Hennecke,et al.  Protein disulphide oxidoreductases in bacteria. , 1994, Trends in biochemical sciences.

[4]  G. N. Ramachandran,et al.  Conformation of polypeptides and proteins. , 1968, Advances in protein chemistry.

[5]  J. Richardson,et al.  The penultimate rotamer library , 2000, Proteins.

[6]  M. A. Wouters,et al.  Between-strand disulfides: forbidden disulfides linking adjacent β-strands , 2013 .

[7]  Matthew L. Leininger,et al.  Psi4: an open‐source ab initio electronic structure program , 2012 .

[8]  Jeremy C. Smith,et al.  HackaMol: An Object-Oriented Modern Perl Library for Molecular Hacking on Multiple Scales , 2015, J. Chem. Inf. Model..

[9]  Miriam Góngora-Benítez,et al.  Multifaceted roles of disulfide bonds. Peptides as therapeutics. , 2014, Chemical reviews.

[10]  P. Hogg,et al.  Disulfide bonds as switches for protein function. , 2003, Trends in biochemical sciences.

[11]  Mapping the cysteine proteome: analysis of redox-sensing thiols. , 2011, Current opinion in chemical biology.

[12]  J. W. Wong,et al.  Identification of allosteric disulfides from labile bonds in X-ray structures , 2018, Royal Society Open Science.

[13]  A. Joachimiak,et al.  X-ray-induced deterioration of disulfide bridges at atomic resolution. , 2010, Acta crystallographica. Section D, Biological crystallography.

[14]  H. Scheraga,et al.  Disulfide bonds and protein folding. , 2000, Biochemistry.

[15]  M. A. Wouters,et al.  Disulfides as redox switches: from molecular mechanisms to functional significance. , 2010, Antioxidants & redox signaling.

[16]  C. Deane,et al.  Identifying and quantifying radiation damage at the atomic level , 2015, Journal of synchrotron radiation.

[17]  J. Thornton Disulphide bridges in globular proteins. , 1981, Journal of molecular biology.

[18]  F. Gräter,et al.  Mechanical force can fine-tune redox potentials of disulfide bonds. , 2012, Biophysical journal.

[19]  J. Winther,et al.  Shedding light on disulfide bond formation: engineering a redox switch in green fluorescent protein , 2001, The EMBO journal.

[20]  Cole H. Christie,et al.  Protein Data Bank: the single global archive for 3D macromolecular structure data , 2018, Nucleic Acids Res..

[21]  Steven M Lewis,et al.  Molprobity's ultimate rotamer‐library distributions for model validation , 2016, Proteins.

[22]  Angela K. Wilson,et al.  Gaussian basis sets for use in correlated molecular calculations. X. The atoms aluminum through argon revisited , 2001 .

[23]  Roland L. Dunbrack Rotamer libraries in the 21st century. , 2002, Current opinion in structural biology.

[24]  B. Dijkstra,et al.  Model building of disulfide bonds in proteins with known three-dimensional structure. , 1988, Protein engineering.

[25]  Sjors H. W. Scheres A Bayesian view on cryo-EM structure determination , 2012, ISBI.

[26]  N Srinivasan,et al.  Stereochemical modeling of disulfide bridges. Criteria for introduction into proteins by site-directed mutagenesis. , 1989, Protein engineering.

[27]  Roland L. Dunbrack,et al.  Bayesian statistical analysis of protein side‐chain rotamer preferences , 1997, Protein science : a publication of the Protein Society.

[28]  H. Scheraga,et al.  Equilibrium and kinetic constants for the thiol-disulfide interchange reaction between glutathione and dithiothreitol. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[29]  R. Ravelli,et al.  The 'fingerprint' that X-rays can leave on structures. , 2000, Structure.

[30]  Jianpeng Ma,et al.  CHARMM: The biomolecular simulation program , 2009, J. Comput. Chem..

[31]  Roland L. Dunbrack,et al.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction. , 1993, Journal of molecular biology.

[32]  B. Matthews,et al.  Structure of a thermostable disulfide-bridge mutant of phage T4 lysozyme shows that an engineered cross-link in a flexible region does not increase the rigidity of the folded protein. , 1990, Biochemistry.

[33]  Pekka Pyykkö,et al.  Molecular single-bond covalent radii for elements 1-118. , 2009, Chemistry.

[34]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[35]  T. H. Dunning Gaussian basis sets for use in correlated molecular calculations. I. The atoms boron through neon and hydrogen , 1989 .

[36]  B. Schmidt,et al.  Search for allosteric disulfide bonds in NMR structures , 2007, BMC Structural Biology.

[37]  P. Karplus,et al.  A fresh look at the Ramachandran plot and the occurrence of standard structures in proteins , 2010, Biomolecular concepts.

[38]  Klaus Schulten,et al.  Mature HIV-1 capsid structure by cryo-electron microscopy and all-atom molecular dynamics , 2013, Nature.

[39]  Xiao Wang,et al.  Psi4 1.1: An Open-Source Electronic Structure Program Emphasizing Automation, Advanced Libraries, and Interoperability. , 2017, Journal of chemical theory and computation.

[40]  Nathaniel Echols,et al.  Accessing protein conformational ensembles using room-temperature X-ray crystallography , 2011, Proceedings of the National Academy of Sciences.

[41]  Dianne Ford,et al.  Metalloproteins and metal sensing , 2009, Nature.

[42]  T. Dunning,et al.  Electron affinities of the first‐row atoms revisited. Systematic basis sets and wave functions , 1992 .

[43]  R. Wetzel Harnessing disulfide bonds using protein engineering , 1987 .

[44]  Torsten Herrmann,et al.  Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. , 2002, Journal of molecular biology.

[45]  M. A. Wouters,et al.  Cross-strand disulfides in the non-hydrogen bonding site of antiparallel β-sheet (aCSDns): poised for biological switching , 2015 .

[46]  D. Missiakas,et al.  Making and breaking disulfide bonds. , 1997, Annual review of microbiology.

[47]  E. Snell,et al.  Insights into the mechanism of X-ray-induced disulfide-bond cleavage in lysozyme crystals based on EPR, optical absorption and X-ray diffraction studies , 2013, Acta crystallographica. Section D, Biological crystallography.

[48]  U. Singh,et al.  A NEW FORCE FIELD FOR MOLECULAR MECHANICAL SIMULATION OF NUCLEIC ACIDS AND PROTEINS , 1984 .

[49]  D. Belin,et al.  Mutations that allow disulfide bond formation in the cytoplasm of Escherichia coli. , 1993, Science.

[50]  George M. Whitesides,et al.  Rate constants and equilibrium constants for thiol-disulfide interchange reactions involving oxidized glutathione , 1980 .

[51]  Christopher J. Williams,et al.  Broad Analysis of Vicinal Disulfides: Occurrences, Conformations with Cis or with Trans Peptides, and Functional Roles Including Sugar Binding. , 2017, Journal of molecular biology.

[52]  M. A. Wouters,et al.  Cross-strand disulphides in cell entry proteins: poised to act. , 2004, BioEssays : news and reviews in molecular, cellular and developmental biology.

[53]  J. W. Wong,et al.  Allosteric Disulfide Bonds , 2006 .

[54]  Roland L. Dunbrack,et al.  A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions. , 2011, Structure.

[55]  Gerhard Klebe,et al.  High-Throughput Crystallography: Reliable and Efficient Identification of Fragment Hits. , 2016, Structure.

[56]  J. Beckwith,et al.  Disulfide bond formation in the Escherichia coli cytoplasm: an in vivo role reversal for the thioredoxins , 1998, The EMBO journal.

[57]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[58]  Haruki Nakamura,et al.  Intra- and intermolecular interaction inducing pyramidalization on both sides of a proline dipeptide during isomerization: an ab initio QM/MM molecular dynamics simulation study in explicit water. , 2009, Journal of the American Chemical Society.

[59]  P. Fernandes,et al.  Theoretical insights into the mechanism for thiol/disulfide exchange. , 2004, Chemistry.

[60]  A. Kossiakoff,et al.  The crystallographically determined structures of atypical strained disulfides engineered into subtilisin. , 1986, The Journal of biological chemistry.

[61]  G. Storz,et al.  Activation of the OxyR transcription factor by reversible disulfide bond formation. , 1998, Science.

[62]  Leonardo G. Trabuco,et al.  Flexible fitting of atomic structures into electron microscopy maps using molecular dynamics. , 2008, Structure.

[63]  S. Gellman,et al.  Parallel β-sheet secondary structure is stabilized and terminated by interstrand disulfide cross-linking. , 2012, Journal of the American Chemical Society.

[64]  F. Gräter,et al.  Identification of allosteric disulfides from prestress analysis. , 2014, Biophysical journal.

[65]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[66]  J L Sussman,et al.  Specific chemical and structural damage to proteins produced by synchrotron radiation. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[67]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[68]  L. J. Perry,et al.  Disulfide bond engineered into T4 lysozyme: stabilization of the protein toward thermal inactivation. , 1984, Science.