DNAproDB: an interactive tool for structural analysis of DNA–protein complexes

Abstract Many biological processes are mediated by complex interactions between DNA and proteins. Transcription factors, various polymerases, nucleases and histones recognize and bind DNA with different levels of binding specificity. To understand the physical mechanisms that allow proteins to recognize DNA and achieve their biological functions, it is important to analyze structures of DNA–protein complexes in detail. DNAproDB is a web-based interactive tool designed to help researchers study these complexes. DNAproDB provides an automated structure-processing pipeline that extracts structural features from DNA–protein complexes. The extracted features are organized in structured data files, which are easily parsed with any programming language or viewed in a browser. We processed a large number of DNA–protein complexes retrieved from the Protein Data Bank and created the DNAproDB database to store this data. Users can search the database by combining features of the DNA, protein or DNA–protein interactions at the interface. Additionally, users can upload their own structures for processing privately and securely. DNAproDB provides several interactive and customizable tools for creating visualizations of the DNA–protein interface at different levels of abstraction that can be exported as high quality figures. All functionality is documented and freely accessible at http://dnaprodb.usc.edu.

[1]  Pinak Chakrabarti,et al.  Dissection, residue conservation, and structural classification of protein‐DNA interfaces , 2009, Proteins.

[2]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[3]  Changsheng Zhang,et al.  Structural insight into the self-sacrifice mechanism of enediyne resistance. , 2006, ACS chemical biology.

[4]  Nathan A. Baker,et al.  PDB2PQR: an automated pipeline for the setup of Poisson-Boltzmann electrostatics calculations , 2004, Nucleic Acids Res..

[5]  S. Harrison,et al.  A structural taxonomy of DNA-binding domains , 1991, Nature.

[6]  Jun-tao Guo,et al.  Statistical analysis of structural determinants for protein–DNA‐binding specificity , 2016, Proteins.

[7]  John D. Westbrook,et al.  The PDB Format, mmCIF Formats, and Other Data Formats , 2005 .

[8]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[9]  J. Thornton,et al.  Satisfying hydrogen bonding potential in proteins. , 1994, Journal of molecular biology.

[10]  Gerhard Klebe,et al.  PDB2PQR: expanding and upgrading automated preparation of biomolecular structures for molecular simulations , 2007, Nucleic Acids Res..

[11]  H M Berman,et al.  Protein-DNA interactions: A structural analysis. , 1999, Journal of molecular biology.

[12]  G M Clore,et al.  Structural basis for SRY-dependent 46-X,Y sex reversal: modulation of DNA bending by a naturally occurring point mutation. , 2001, Journal of molecular biology.

[13]  Francisco Melo,et al.  The Protein-DNA Interface database , 2010, BMC Bioinformatics.

[14]  Ponraj Prabakaran,et al.  Classification of protein-DNA complexes based on structural descriptors. , 2006, Structure.

[15]  Remo Rohs,et al.  Control of DNA minor groove width and Fis protein binding by the purine 2-amino group , 2013, Nucleic acids research.

[16]  RyangGuk Kim,et al.  PDA: an automatic and comprehensive analysis program for protein-DNA complex structures , 2009, BMC Genomics.

[17]  Chi-Ren Shyu,et al.  DOMMINO 2.0: integrating structurally resolved protein-, RNA-, and DNA-mediated macromolecular interactions , 2016, Database J. Biol. Databases Curation.

[18]  R. Mann,et al.  Origins of specificity in protein-DNA recognition. , 2010, Annual review of biochemistry.

[19]  Remo Rohs,et al.  Covariation between homeodomain transcription factors and the shape of their DNA binding sites , 2013, Nucleic acids research.

[20]  Markus Fischer,et al.  MarkUs: a server to navigate sequence–structure–function space , 2011, Nucleic Acids Res..

[21]  Bernhardt L Trout,et al.  Prediction of aggregation prone regions of therapeutic proteins. , 2010, The journal of physical chemistry. B.

[22]  David S. Goodsell,et al.  The RCSB protein data bank: integrative view of protein, gene and 3D structural information , 2016, Nucleic Acids Res..

[23]  Sohail Malik,et al.  Crystal Structure of Negative Cofactor 2 Recognizing the TBP-DNA Transcription Complex , 2001, Cell.

[24]  Ramanathan Sowdhamini,et al.  Re-visiting protein-centric two-tier classification of existing DNA-protein complexes , 2012, BMC Bioinformatics.

[25]  Xiang-Jun Lu,et al.  3DNA: a versatile, integrated software system for the analysis, rebuilding and visualization of three-dimensional nucleic-acid structures , 2008, Nature Protocols.

[26]  W. Olson,et al.  3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures. , 2003, Nucleic acids research.

[27]  Daniel Svozil,et al.  Bioinformatic analysis of the protein/DNA interface , 2011, Nucleic acids research.

[28]  Simon Mitternacht,et al.  FreeSASA: An open source C library for solvent accessible surface area calculations , 2016, F1000Research.

[29]  Michael A. Crickmore,et al.  Functional Specificity of a Hox Protein Mediated by the Recognition of Minor Groove Structure , 2007, Cell.

[30]  J. Thornton,et al.  An overview of the structures of protein-DNA complexes , 2000, Genome Biology.

[31]  J. Richardson,et al.  Asparagine and glutamine: using hydrogen atom contacts in the choice of side-chain amide orientation. , 1999, Journal of molecular biology.

[32]  Remo Rohs,et al.  Mechanistic insights into metal ion activation and operator recognition by the ferric uptake regulator , 2015, Nature Communications.

[33]  J. Thornton,et al.  NUCPLOT: a program to generate schematic diagrams of protein-nucleic acid interactions. , 1997, Nucleic acids research.

[34]  Remo Rohs,et al.  Exposing the secrets of sex determination , 2015, Nature Structural &Molecular Biology.

[35]  R. Lavery,et al.  Defining the structure of irregular nucleic acids: conventions and principles. , 1989, Journal of biomolecular structure & dynamics.

[36]  Hideki Aihara,et al.  An ancient protein-DNA interaction underlying metazoan sex determination , 2015, Nature Structural &Molecular Biology.

[37]  Helen M Berman,et al.  Signatures of protein-DNA recognition in free DNA binding sites. , 2009, Journal of molecular biology.

[38]  T. Richmond,et al.  Solvent mediated interactions in the structure of the nucleosome core particle at 1.9 a resolution. , 2002, Journal of molecular biology.

[39]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[40]  Stephen K. Burley,et al.  1.9 Å resolution refined structure of TBP recognizing the minor groove of TATAAAAG , 1994, Nature Structural Biology.

[41]  Christina Freytag,et al.  The Definitive Guide To Mongodb The Nosql Database For Cloud And Desktop Computing , 2016 .

[42]  Remo Rohs,et al.  Conformations of p53 response elements in solution deduced using site-directed spin labeling and Monte Carlo sampling , 2013, Nucleic acids research.

[43]  C. Garvie,et al.  Recognition of specific DNA sequences. , 2001, Molecular cell.

[44]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[45]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[46]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[47]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[48]  David A. Lee,et al.  CATH: comprehensive structural and functional annotations for genome sequences , 2014, Nucleic Acids Res..

[49]  Vincent B. Chen,et al.  Correspondence e-mail: , 2000 .

[50]  Shula Shazman,et al.  OnTheFly: a database of Drosophila melanogaster transcription factors and their binding sites , 2013, Nucleic Acids Res..

[51]  Prasanna R Kolatkar,et al.  The crystal structure of the Sox4 HMG domain-DNA complex suggests a mechanism for positional interdependence in DNA recognition. , 2012, The Biochemical journal.

[52]  Bruno Contreras-Moreira,et al.  3D-footprint: a database for the structural analysis of protein–DNA complexes , 2009, Nucleic Acids Res..