Analysis of protein folds using protein contact networks

Proteins are important biomolecules, which perform diverse structural and functional roles in living systems. Starting from a linear chain of amino acids, proteins fold to different secondary structures, which then fold through short- and long-range interactions to give rise to the final three-dimensional shapes useful to carry out the biophysical and biochemical functions. Proteins are defined as having a common ‘fold’ if they have major secondary structural elements with same topological connections. It is known that folding mechanisms are largely determined by a protein’s topology rather than its interatomic interactions. The native state protein structures can, thus, be modelled, using a graph-theoretical approach, as coarse-grained networks of amino acid residues as ‘nodes’ and the inter-residue interactions/contacts as ‘links’. Using the network representation of protein structures and their 2D contact maps, we have identified the conserved contact patterns (groups of contacts) representing two typical folds — the EF-hand and the ubiquitin-like folds. Our results suggest that this direct and computationally simple methodology can be used to infer about the presence of specific folds from the protein’s contact map alone.

[1]  Victoria A. Higman,et al.  Uncovering network systems within protein structures. , 2003, Journal of molecular biology.

[2]  M Bycroft,et al.  The UBX domain: a widespread ubiquitin-like module. , 2001, Journal of molecular biology.

[3]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[4]  S. Vishveshwara,et al.  Identification of side-chain clusters in protein structures by a graph spectral method. , 1999, Journal of molecular biology.

[5]  P. Argos,et al.  Relationships between protein sequence and structure patterns based on residue contacts , 1998, Proteins.

[6]  Michael Jünger,et al.  Graph Drawing Software , 2003, Graph Drawing Software.

[7]  S. Linse,et al.  Structural basis for the negative allostery between Ca2+‐ and Mg2+ ‐binding in the intracellular Ca2+ ‐receptor calbindin D9k , 1997, Protein science : a publication of the Protein Society.

[8]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[9]  Ganesh Bagler,et al.  Assortative mixing in Protein Contact Networks and protein folding kinetics , 2007, Bioinform..

[10]  W. Gronwald,et al.  Solution structure of the Ras binding domain of the protein kinase Byr2 from Schizosaccharomyces pombe. , 2001, Structure.

[11]  S. Vishveshwara,et al.  A network representation of protein structures: implications for protein stability. , 2005, Biophysical journal.

[12]  Saraswathi Vishveshwara,et al.  Dynamics of lysozyme structure network: probing the process of unfolding. , 2007, Biophysical journal.

[13]  J M Thornton,et al.  An atlas of protein topology cartoons available on the World-Wide Web. , 1998, Trends in biochemical sciences.

[14]  V. Lamzin,et al.  Crystal structure of the EF‐hand parvalbumin at atomic resolution (0.91 Å) and at low temperature (100 K). Evidence for conformational multistates within the hydrophobic core , 1999, Protein science : a publication of the Protein Society.

[15]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[16]  Susumu Nakayama,et al.  Evolution of EF-hand calcium-modulated proteins. II. Domains of several subfamilies have diverse evolutionary histories , 1992, Journal of Molecular Evolution.

[17]  Ganesh Bagler,et al.  Network properties of protein structures , 2004, q-bio/0408009.

[18]  J. Skolnick,et al.  Ab initio folding of proteins using restraints derived from evolutionary information , 1999, Proteins.

[19]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[20]  A. Godzik,et al.  Topology fingerprint approach to the inverse protein folding problem. , 1992, Journal of molecular biology.

[21]  A. Fisher,et al.  The Rub Family of Ubiquitin-like Proteins , 1998, The Journal of Biological Chemistry.

[22]  Usha K Muppirala,et al.  A simple approach for protein structure discrimination based on the network pattern of conserved hydrophobic residues. , 2006, Protein engineering, design & selection : PEDS.

[23]  M Karplus,et al.  Small-world view of the amino acids that play a key role in protein folding. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  J. Baleja,et al.  Solution structure of the Reps1 EH domain and characterization of its binding to NPF target sequences. , 2001, Biochemistry.

[25]  A. Lesk,et al.  Assessment of novel fold targets in CASP4: Predictions of three‐dimensional structures, secondary structures, and interresidue contacts , 2001, Proteins.

[26]  M. Overduin,et al.  Solution structure of Eps15's third EH domain reveals coincident Phe-Trp and Asn-Pro-Phe binding sites. , 2000, Biochemistry.

[27]  Pierre Baldi,et al.  Prediction of contact maps by GIOHMMs and recurrent neural networks using lateral propagation from all four cardinal corners , 2002, ISMB.