Discrimination of Native Folds Using Network Properties of Protein Structures

Graph theoretic properties of proteins can be used to perceive the differences between correctly folded proteins and well designed decoy sets. 3D protein structures of proteins are represented with graphs. We used two different graph representations: Delaunay tessellations of proteins and contact map graphs. Graph theoretic properties for both graph types showed high classification accuracy for protein discrimination. Fisher, linear, quadratic, neural network, and support vector classifiers were used for the classification of the protein structures. The best classifier accuracy was over 98%. Results showed that characteristic features of graph theoretic properties can be used in the detection of native folds.

[1]  S Vajda,et al.  Discrimination of near‐native protein structures from misfolded models by empirical free energy functions , 2000, Proteins.

[2]  R. Casadio,et al.  A neural network based predictor of residue contacts in proteins. , 1999, Protein engineering.

[3]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[4]  J Chomilier,et al.  Voronoï tessellation reveals the condensed matter character of folded proteins. , 2000, Physical review letters.

[5]  C. DeLisi,et al.  Necessary conditions for avoiding incorrect polypeptide folds in conformational search by energy minimization , 1993, Biopolymers.

[6]  Richard Bonneau,et al.  Ab initio protein structure prediction: progress and prospects. , 2001, Annual review of biophysics and biomolecular structure.

[7]  M Karplus,et al.  Small-world view of the amino acids that play a key role in protein folding. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  George Karypis,et al.  Prediction of Contact Maps Using Support Vector Machines , 2005, Int. J. Artif. Intell. Tools.

[9]  Iosif I Vaisman,et al.  Graph theoretic properties of networks formed by the Delaunay tessellation of protein structures. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  V. Uversky,et al.  Why are “natively unfolded” proteins unstructured under physiologic conditions? , 2000, Proteins.

[11]  D. Baker,et al.  Prediction and design of macromolecular structures and interactions , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[12]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[13]  J. William Ahwood,et al.  CLASSIFICATION , 1931, Foundations of Familiar Language.

[14]  M. Levitt,et al.  Improved protein structure selection using decoy-dependent discriminatory functions , 2004, BMC Structural Biology.

[15]  K A Dill,et al.  Are proteins well-packed? , 2001, Biophysical journal.

[16]  P. Bourne CASP and CAFASP experiments and their findings. , 2003, Methods of biochemical analysis.

[17]  M Vendruscolo,et al.  Recovery of protein structure from contact maps. , 1997, Folding & design.

[18]  S. Strogatz Exploring complex networks , 2001, Nature.

[19]  R. Jernigan,et al.  Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. , 1996, Journal of molecular biology.

[20]  George Karypis,et al.  Prediction of contact maps using support vector machines , 2003, Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings..

[21]  M. Karplus,et al.  Effective energy functions for protein structure prediction. , 2000, Current opinion in structural biology.

[22]  David P. Dobkin,et al.  The quickhull algorithm for convex hulls , 1996, TOMS.

[23]  Michael Levitt,et al.  Design of an optimal Chebyshev‐expanded discrimination function for globular proteins , 2002, Protein science : a publication of the Protein Society.

[24]  A. Atilgan,et al.  Small-world communication of residues and significance for protein dynamics. , 2003, Biophysical journal.

[25]  R. Bruccoleri,et al.  Criteria that discriminate between native proteins and incorrectly folded models , 1988, Proteins.

[26]  B. McConkey,et al.  Discrimination of native protein structures using atom–atom contact scoring , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[27]  M. Levitt,et al.  Energy functions that discriminate X-ray and near native folds from well-constructed decoys. , 1996, Journal of molecular biology.

[28]  T. Subba Rao,et al.  Classification, Parameter Estimation and State Estimation: An Engineering Approach Using MATLAB , 2004 .