Modeling Protein Contact Networks

Proteins are an important class of biomolecules that serve as essential building blocks of the cells. Their three-dimensional structures are responsible for their functions. In this thesis we have investigated the protein structures using a network theoretical approach. While doing so we used a coarse-grained method, viz., complex network analysis. We model protein structures at two length scales as Protein Contact Networks (PCN) and as Long-range Interaction Networks (LINs). We found that proteins by virtue of being characterised by high amount of clustering, are small-world networks. Apart from the small-world nature, we found that proteins have another general property, viz., assortativity. This is an interesting and exceptional finding as all other complex networks (except for social networks) are known to be disassortative. Importantly, we could identify one of the major topological determinant of assortativity by building appropriate controls.

[1]  William R. Taylor,et al.  Connection topology of proteins , 1993, Comput. Appl. Biosci..

[2]  S. Radford,et al.  Rapid folding with and without populated intermediates in the homologous four-helix proteins Im7 and Im9. , 1999, Journal of molecular biology.

[3]  Henri Atlan,et al.  The Living Cell as a Paradigm for Complex Natural Systems , 2002, Complexus.

[4]  Igor M. Sokolov,et al.  Changing Correlations in Networks: Assortativity and Dissortativity , 2005 .

[5]  Mohammed J. Zaki,et al.  Mining Protein Contact Maps , 2002, BIOKDD.

[6]  C. Branden,et al.  Introduction to protein structure , 1991 .

[7]  P. Ponnuswamy,et al.  Hydrophobic character of amino acid residues in globular proteins , 1978, Nature.

[8]  Albert-László Barabási,et al.  The Activity Reaction Core and Plasticity of Metabolic Networks , 2005, PLoS Comput. Biol..

[9]  L Serrano,et al.  Evidence for a two-state transition in the folding process of the activation domain of human procarboxypeptidase A2. , 1995, Biochemistry.

[10]  S. Vishveshwara,et al.  Identification of side-chain clusters in protein structures by a graph spectral method. , 1999, Journal of molecular biology.

[11]  César A. Hidalgo,et al.  Scale-free networks , 2008, Scholarpedia.

[12]  R. Nussinov,et al.  Is allostery an intrinsic property of all dynamic proteins? , 2004, Proteins.

[13]  Mohamed A. Marahiel,et al.  Conservation of rapid two-state folding in mesophilic, thermophilic and hyperthermophilic cold shock proteins , 1998, Nature Structural Biology.

[14]  I. M. Sokolov,et al.  Construction and properties of assortative random networks , 2004 .

[15]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[16]  A. Fersht Structure and mechanism in protein science , 1998 .

[17]  Flavio Seno,et al.  Geometrical aspects of protein folding , 2001 .

[18]  J. Montoya,et al.  Small world patterns in food webs. , 2002, Journal of theoretical biology.

[19]  R. Jernigan,et al.  Identification of kinetically hot residues in proteins , 1998, Protein science : a publication of the Protein Society.

[20]  Victoria A. Higman,et al.  Uncovering network systems within protein structures. , 2003, Journal of molecular biology.

[21]  Gil Amitai,et al.  Network analysis of protein structures identifies functional residues. , 2004, Journal of molecular biology.

[22]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[23]  Luis Serrano,et al.  Different folding transition states may result in the same native structure , 1996, Nature Structural Biology.

[24]  F. Rao,et al.  The protein folding network. , 2004, Journal of molecular biology.

[25]  Gürol M. Süel,et al.  Evolutionarily conserved networks of residues mediate allosteric communication in proteins , 2003, Nature Structural Biology.

[26]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[27]  K A Dill,et al.  Are proteins well-packed? , 2001, Biophysical journal.

[28]  Jaewoon Jung,et al.  Topological determinants of protein unfolding rates , 2005, Proteins.

[29]  D Baker,et al.  Kinetics of folding of the IgG binding domain of peptostreptococcal protein L. , 1997, Biochemistry.

[30]  Neo D. Martinez,et al.  Two degrees of separation in complex food webs , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[31]  A. Barabasi,et al.  Scale-free characteristics of random networks: the topology of the world-wide web , 2000 .

[32]  Mark Newman,et al.  Models of the Small World , 2000 .

[33]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[34]  I. Sokolov,et al.  Reshuffling scale-free networks: from random to assortative. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[36]  D Baker,et al.  The sequences of small proteins are not extensively optimized for rapid folding by natural selection. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[37]  Vladimir G Tumanyan,et al.  Analysis of forces that determine helix formation in alpha-proteins. , 2004, Protein science : a publication of the Protein Society.

[38]  Saraswathi Vishveshwara,et al.  Oligomeric protein structure networks: insights into protein-protein interactions , 2005, BMC Bioinformatics.

[39]  G. Rose,et al.  Hierarchic organization of domains in globular proteins. , 1979, Journal of molecular biology.

[40]  S. Strogatz Exploring complex networks , 2001, Nature.

[41]  A. Arkin,et al.  Biological networks. , 2003, Current opinion in structural biology.

[42]  Béla Bollobás,et al.  Random Graphs , 1985 .

[43]  Owen L. Petchey,et al.  Interaction strengths in food webs: issues and opportunities , 2004 .

[44]  Saraswathi Vishveshwara,et al.  A graph spectral analysis of the structural similarity network of protein chains , 2005, Proteins.

[45]  H E Stanley,et al.  Classes of small-world networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[46]  D. Baker,et al.  Functional rapidly folding proteins from simplified amino acid sequences , 1997, Nature Structural Biology.

[47]  D. Baker,et al.  A surprising simplicity to protein folding , 2000, Nature.

[48]  Ganesh Bagler,et al.  Analysis of the airport network of India as a complex weighted network , 2004, cond-mat/0409773.

[49]  Paul Smaglik,et al.  For my next trick. . . , 2000, Nature.

[50]  T. Kiefhaber,et al.  Folding of the disulfide-bonded beta-sheet protein tendamistat: rapid two-state folding without hydrophobic collapse. , 1997, Journal of molecular biology.

[51]  A. Barabasi,et al.  Virtual Round Table on ten leading questions for network research , 2004 .

[52]  David Baker,et al.  Characterization of the folding energy landscapes of computer generated proteins suggests high folding free energy barriers and cooperativity may be consequences of natural selection. , 2004, Journal of molecular biology.

[53]  Hawoong Jeong,et al.  Modeling the Internet's large-scale topology , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[54]  Béla Bollobás,et al.  Degree sequences of random graphs , 1981, Discret. Math..

[55]  G Taubes,et al.  Protein Chemistry: Misfolding the Way to Disease , 1996, Science.

[56]  K. Sneppen,et al.  Specificity and Stability in Topology of Protein Networks , 2002, Science.

[57]  S. N. Dorogovtsev,et al.  Evolution of networks , 2001, cond-mat/0106144.

[58]  I D Campbell,et al.  The folding kinetics and thermodynamics of the Fyn-SH3 domain. , 1998, Biochemistry.

[59]  M. Gromiha,et al.  Comparison between long-range interactions and contact order in determining the folding rate of two-state proteins: application of long-range order to folding rate prediction. , 2001, Journal of molecular biology.

[60]  J. Hofrichter,et al.  Submillisecond protein folding kinetics studied by ultrarapid mixing. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[61]  A. Atilgan,et al.  Small-world communication of residues and significance for protein dynamics. , 2003, Biophysical journal.

[62]  A. Barabasi,et al.  The topology of the transcription regulatory network in the yeast , 2002, cond-mat/0205181.

[63]  Bengt Nölting,et al.  Mechanism of protein folding , 2000, Proteins.

[64]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[65]  Somenath Biswas,et al.  Evolution and similarity evaluation of protein structures in contact map space , 2005, Proteins.

[66]  A. Vespignani,et al.  The architecture of complex weighted networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[67]  R. Varadarajan,et al.  Residue depth: a novel parameter for the analysis of protein structure and stability. , 1999, Structure.

[68]  C. Levinthal How to fold graciously , 1969 .

[69]  G. Cecchi,et al.  Scale-free brain functional networks. , 2003, Physical review letters.

[70]  D. Baker,et al.  Matching theory and experiment in protein folding. , 1999, Current opinion in structural biology.

[71]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[72]  D. Thirumalai,et al.  Determination of network of residues that regulate allostery in protein families using sequence analysis , 2006, Protein science : a publication of the Protein Society.

[73]  I D Campbell,et al.  Folding kinetics of the SH3 domain of PI3 kinase by real-time NMR combined with optical spectroscopy. , 1998, Journal of molecular biology.

[74]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[75]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[76]  S. Vishveshwara,et al.  A network representation of protein structures: implications for protein stability. , 2005, Biophysical journal.

[77]  Hongyi Zhou,et al.  Folding rate prediction using total contact distance. , 2002, Biophysical journal.

[78]  D. Watts,et al.  Small Worlds: The Dynamics of Networks between Order and Randomness , 2001 .

[79]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[80]  C Koch,et al.  Complexity and the nervous system. , 1999, Science.

[81]  E. Cota,et al.  Folding studies of immunoglobulin-like beta-sandwich proteins suggest that they share a common folding pathway. , 1999, Structure.

[82]  Eytan Domany,et al.  Protein folding in contact map space , 2000 .

[83]  W. Goddard,et al.  First principles prediction of protein folding rates. , 1999, Journal of molecular biology.

[84]  Christopher M. Dobson,et al.  Mutational analysis of acylphosphatase suggests the importance of topology and contact order in protein folding , 1999, Nature Structural Biology.

[85]  L Serrano,et al.  Thermodynamic and kinetic analysis of the SH3 domain of spectrin shows a two-state folding transition. , 1994, Biochemistry.

[86]  M Karplus,et al.  Small-world view of the amino acids that play a key role in protein folding. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[87]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[88]  Kevin W Plaxco,et al.  Residues participating in the protein folding nucleus do not exhibit preferential evolutionary conservation. , 2002, Journal of molecular biology.

[89]  Sudip Kundu,et al.  Amino acid network within protein , 2005 .

[90]  D Baker,et al.  Folding dynamics of the src SH3 domain. , 1997, Biochemistry.

[91]  L A Mirny,et al.  How evolution makes proteins fold quickly. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[92]  M. Oliveberg,et al.  High-energy channeling in protein folding. , 1997, Biochemistry.

[93]  S. Kundu,et al.  Hydrophobic, hydrophilic, and charged amino acid networks within protein. , 2006, Biophysical journal.

[94]  Roger Guimerà,et al.  Robust patterns in food web structure. , 2001, Physical review letters.

[95]  R. Nussinov,et al.  Residues crucial for maintaining short paths in network communication mediate signaling in proteins , 2006, Molecular systems biology.

[96]  D. Baker,et al.  Contact order, transition state placement and the refolding rates of single domain proteins. , 1998, Journal of molecular biology.

[97]  Arnab Chatterjee,et al.  Small-world properties of the Indian railway network. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[98]  A. Barabasi,et al.  Functional and topological characterization of protein interaction networks , 2004, Proteomics.

[99]  A. Barabasi,et al.  Global organization of metabolic fluxes in the bacterium Escherichia coli , 2004, Nature.

[100]  M E J Newman Assortative mixing in networks. , 2002, Physical review letters.

[101]  L. Amaral,et al.  Small-world networks and the conformation space of a short lattice polymer chain , 2000, cond-mat/0004380.

[102]  J. Doyle,et al.  Reverse Engineering of Biological Complexity , 2002, Science.

[103]  S. Jackson,et al.  Folding pathway of FKBP12 and characterisation of the transition state. , 1999, Journal of molecular biology.

[104]  C. Orengo,et al.  Analysis and assessment of ab initio three‐dimensional prediction, secondary structure, and contacts prediction , 1999, Proteins.

[105]  J. Clarke,et al.  Folding and stability of a fibronectin type III domain of human tenascin. , 1997, Journal of molecular biology.

[106]  Saraswathi Vishveshwara,et al.  Identification of domains and domain interface residues in multidomain proteins from graph spectral method , 2005, Proteins.

[107]  L. Gregoret,et al.  Stability and folding properties of a model β‐sheet protein, Escherichia coli CspA , 1998, Protein science : a publication of the Protein Society.

[108]  C M Dobson,et al.  Slow folding of muscle acylphosphatase in the absence of intermediates. , 1998, Journal of molecular biology.

[109]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[110]  E. Shakhnovich,et al.  Topological determinants of protein folding , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[111]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[112]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[113]  William H. Press,et al.  Numerical recipes in C , 2002 .

[114]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[115]  J. Clarke,et al.  Mapping the folding pathway of an immunoglobulin domain: structural detail from Phi value analysis and movement of the transition state. , 2001, Structure.

[116]  H. Gray,et al.  Cytochrome c folding triggered by electron transfer. , 1996, Chemistry & biology.

[117]  J. Ziman Bridging the Culture Gap , 1973, Nature.

[118]  Saraswathi Vishveshwara,et al.  Characterization of the backbone geometry of protein native state structures , 2006, Proteins.

[119]  S. Khorasanizadeh,et al.  Evidence for a three-state model of protein folding from kinetic analysis of ubiquitin variants with altered core residues , 1996, Nature Structural Biology.

[120]  Albert-László Barabási,et al.  Life's Complexity Pyramid , 2002, Science.

[121]  Ganesh Bagler,et al.  Network properties of protein structures , 2004, q-bio/0408009.

[122]  F M Poulsen,et al.  Folding of a four-helix bundle: studies of acyl-coenzyme A binding protein. , 1995, Biochemistry.

[123]  A. Fersht,et al.  Folding of chymotrypsin inhibitor 2. 1. Evidence for a two-state transition. , 1991, Biochemistry.

[124]  M. Newman,et al.  Why social networks are different from other types of networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[125]  A. Murzin Structure classification‐based assessment of CASP3 predictions for the fold recognition targets , 1999, Proteins.

[126]  Vladimir G. Tumanyan,et al.  Analysis of forces that determine helix formation in α‐proteins , 2004 .

[127]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[128]  Patrick C Phillips,et al.  Network thinking in ecology and evolution. , 2005, Trends in ecology & evolution.

[129]  H. Scheraga,et al.  The influence of long-range interactions on the structure of myoglobin. , 1968, Biochemistry.

[130]  Eytan Domany,et al.  Protein folding using contact maps. , 2000 .

[131]  P. Agarwal,et al.  Network of coupled promoting motions in enzyme catalysis , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[132]  Lada A. Adamic,et al.  Power-Law Distribution of the World Wide Web , 2000, Science.

[133]  D. Raleigh,et al.  Submillisecond folding of the peripheral subunit-binding domain. , 1999, Journal of molecular biology.

[134]  C M Dobson,et al.  Slow cooperative folding of a small globular protein HPr. , 1998, Biochemistry.

[135]  R Pastor-Satorras,et al.  Dynamical and correlation properties of the internet. , 2001, Physical review letters.

[136]  E I Shakhnovich,et al.  Folding kinetics of villin 14T, a protein domain with a central beta-sheet and two hydrophobic cores. , 1998, Biochemistry.

[137]  D. Bray Protein molecules as computational elements in living cells , 1995, Nature.

[138]  Neo D. Martinez,et al.  Food-web structure and network theory: The role of connectance and size , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[139]  S Walter Englander,et al.  Protein folding: the stepwise assembly of foldon units. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[140]  G M Crippen,et al.  The tree structural organization of proteins. , 1978, Journal of molecular biology.

[141]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[142]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.