A discriminatory function for prediction of protein-DNA interactions based on alpha shape modeling

MOTIVATION Protein-DNA interaction has significant importance in many biological processes. However, the underlying principle of the molecular recognition process is still largely unknown. As more high-resolution 3D structures of protein-DNA complex are becoming available, the surface characteristics of the complex become an important research topic. RESULT In our work, we apply an alpha shape model to represent the surface structure of the protein-DNA complex and developed an interface-atom curvature-dependent conditional probability discriminatory function for the prediction of protein-DNA interaction. The interface-atom curvature-dependent formalism captures atomic interaction details better than the atomic distance-based method. The proposed method provides good performance in discriminating the native structures from the docking decoy sets, and outperforms the distance-dependent formalism in terms of the z-score. Computer experiment results show that the curvature-dependent formalism with the optimal parameters can achieve a native z-score of -8.17 in discriminating the native structure from the highest surface-complementarity scored decoy set and a native z-score of -7.38 in discriminating the native structure from the lowest RMSD decoy set. The interface-atom curvature-dependent formalism can also be used to predict apo version of DNA-binding proteins. These results suggest that the interface-atom curvature-dependent formalism has a good prediction capability for protein-DNA interactions. AVAILABILITY The code and data sets are available for download on http://www.hy8.com/bioinformatics.htm CONTACT kenandzhou@hotmail.com.

[1]  Herbert Edelsbrunner,et al.  Three-dimensional alpha shapes , 1994, ACM Trans. Graph..

[2]  Jeffrey Skolnick,et al.  DBD-Hunter: a knowledge-based method for the prediction of DNA–protein interactions , 2008, Nucleic acids research.

[3]  J. Fickett Recognition of protein coding regions in DNA sequences. , 1982, Nucleic acids research.

[4]  Ying Xu,et al.  Structure‐based prediction of transcription factor binding sites using a protein‐DNA docking approach , 2008, Proteins.

[5]  Hong Yan,et al.  Relationship between periodic dinucleotides and the nucleosome structure revealed by alpha shape modeling , 2010 .

[6]  Herbert Edelsbrunner,et al.  On the Definition and the Construction of Pockets in Macromolecules , 1998, Discret. Appl. Math..

[7]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[8]  Gary D. Stormo,et al.  DNA binding sites: representation and discovery , 2000, Bioinform..

[9]  Thorsten Heinzel,et al.  A CBP Integrator Complex Mediates Transcriptional Activation and AP-1 Inhibition by Nuclear Receptors , 1996, Cell.

[10]  Kenji Mizuguchi,et al.  Applying the Naïve Bayes classifier with kernel density estimation to the prediction of protein-protein interaction sites , 2010, Bioinform..

[11]  Antonina Silkov,et al.  Structural alignment of protein--DNA interfaces: insights into the determinants of binding specificity. , 2005, Journal of molecular biology.

[12]  D. Moras,et al.  Defining and characterizing protein surface using alpha shapes , 2009, Proteins.

[13]  Jie Liang,et al.  Simplicial edge representation of protein structures and alpha contact potential with confidence measure , 2003, Proteins.

[14]  T. D. Schneider,et al.  Information content of binding sites on nucleotide sequences. , 1986, Journal of molecular biology.

[15]  M J Sternberg,et al.  Use of pair potentials across protein interfaces in screening predicted docked complexes , 1999, Proteins.

[16]  R. Samudrala,et al.  An all-atom distance-dependent conditional probability discriminatory function for protein structure prediction. , 1998, Journal of molecular biology.

[17]  A. Sancar,et al.  Molecular mechanisms of mammalian DNA repair and the DNA damage checkpoints. , 2004, Annual review of biochemistry.

[18]  S. McKnight,et al.  Eukaryotic transcriptional regulatory proteins. , 1989, Annual review of biochemistry.

[19]  M. Sternberg,et al.  Modelling protein docking using shape complementarity, electrostatics and biochemical information. , 1997, Journal of molecular biology.

[20]  Jérôme Azé,et al.  A new protein-protein docking scoring function based on interface residue properties , 2007, Bioinform..

[21]  S Subramaniam,et al.  Analytical shape computation of macromolecules: I. molecular area and volume through alpha shape , 1998, Proteins.

[22]  Akinori Sarai,et al.  Moment-based prediction of DNA-binding proteins. , 2004, Journal of molecular biology.

[23]  M J Sternberg,et al.  Modelling repressor proteins docking to DNA , 1998, Proteins.

[24]  S. Wodak,et al.  Deviations from standard atomic volumes as a quality measure for protein crystal structures. , 1996, Journal of molecular biology.

[25]  H Edelsbrunner,et al.  Analytical shape computation of macromolecules: II. Inaccessible cavities in proteins , 1998, Proteins.

[26]  H M Berman,et al.  Protein-DNA interactions: A structural analysis. , 1999, Journal of molecular biology.

[27]  Thomas Werner,et al.  MatInspector and beyond: promoter analysis based on transcription factor binding sites , 2005, Bioinform..

[28]  B. Li,et al.  Rapid comparison of properties on protein surface , 2008, Proteins.

[29]  A. Poupon Voronoi and Voronoi-related tessellations in studies of protein structure and interaction. , 2004, Current opinion in structural biology.

[30]  A. van Oosterom,et al.  The Solid Angle of a Plane Triangle , 1983, IEEE Transactions on Biomedical Engineering.

[31]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[32]  Gabriele Varani,et al.  An all‐atom, distance‐dependent scoring function for the prediction of protein–DNA interactions from structure , 2006, Proteins.

[33]  Ilya A. Vakser,et al.  A simple shape characteristic of protein-protein recognition , 2007, Bioinform..

[34]  C. Pabo,et al.  Geometric analysis and comparison of protein-DNA interfaces: why is there no simple code for recognition? , 2000, Journal of molecular biology.

[35]  Ozlem Keskin,et al.  Protein–DNA interactions: structural, thermodynamic and clustering patterns of conserved residues in DNA-binding proteins , 2008, Nucleic acids research.

[36]  B. Rost,et al.  Analysing six types of protein-protein interfaces. , 2003, Journal of molecular biology.