Visualization and analysis of non-covalent contacts using the Protein Contacts Atlas

Visualizations of biomolecular structures empower us to gain insights into biological functions, generate testable hypotheses, and communicate biological concepts. Typical visualizations (such as ball and stick) primarily depict covalent bonds. In contrast, non-covalent contacts between atoms, which govern normal physiology, pathogenesis, and drug action, are seldom visualized. We present the Protein Contacts Atlas, an interactive resource of non-covalent contacts from over 100,000 PDB crystal structures. We developed multiple representations for visualization and analysis of non-covalent contacts at different scales of organization: atoms, residues, secondary structure, subunits, and entire complexes. The Protein Contacts Atlas enables researchers from different disciplines to investigate diverse questions in the framework of non-covalent contacts, including the interpretation of allostery, disease mutations and polymorphisms, by exploring individual subunits, interfaces, and protein–ligand contacts and by mapping external information. The Protein Contacts Atlas is available at http://www.mrc-lmb.cam.ac.uk/pca/ and also through PDBe.The Protein Contacts Atlas is an interactive resource of non-covalent contacts that can generate multiple representations of non-covalent contacts from PDB structures at different scales, from atoms to subunits and entire complexes.

[1]  R. Nussinov,et al.  Residues crucial for maintaining short paths in network communication mediate signaling in proteins , 2006, Molecular systems biology.

[2]  Victoria A. Higman,et al.  Uncovering network systems within protein structures. , 2003, Journal of molecular biology.

[3]  S. Emamzadah,et al.  Reversal of the DNA-binding-induced loop L1 conformational switch in an engineered human p53 protein. , 2014, Journal of molecular biology.

[4]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[5]  Rahul Raman,et al.  Atomic Interaction Networks in the Core of Protein Domains and Their Native Folds , 2010, PloS one.

[6]  Alex Bavelas,et al.  Communication Patterns in Task‐Oriented Groups , 1950 .

[7]  Linus Pauling,et al.  The Structure of Proteins , 1939 .

[8]  M. Seeber,et al.  Structure network analysis to gain insights into GPCR function. , 2016, Biochemical Society transactions.

[9]  Saraswathi Vishveshwara,et al.  PROTEIN STRUCTURE: INSIGHTS FROM GRAPH THEORY , 2002 .

[10]  Saraswathi Vishveshwara,et al.  Protein Structure and Function: Looking through the Network of Side-Chain Interactions. , 2015, Current protein & peptide science.

[11]  L. Pauling,et al.  The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Nadezhda T. Doncheva,et al.  Topological analysis and interactive visualization of biological networks and protein structures , 2012, Nature Protocols.

[13]  Barbara Baird,et al.  Models and molecules , 1994, Nature.

[14]  Francesco Raimondi,et al.  Network analysis to uncover the structural communication in GPCRs. , 2013, Methods in cell biology.

[15]  M. Cheetham,et al.  Mechanisms of cell death in rhodopsin retinitis pigmentosa: implications for therapy. , 2005, Trends in molecular medicine.

[16]  M. Perutz THE HEMOGLOBIN MOLECULE. , 1964, Scientific American.

[17]  A. Lesk,et al.  How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins. , 1980, Journal of molecular biology.

[18]  Oliver P. Ernst,et al.  A Ligand Channel through the G Protein Coupled Receptor Opsin , 2009, PloS one.

[19]  Bas Vroling,et al.  GPCRdb: an information system for G protein-coupled receptors , 2015, Nucleic Acids Res..

[20]  Nadezhda T. Doncheva,et al.  Analyzing and visualizing residue networks of protein structures. , 2011, Trends in biochemical sciences.

[21]  Xavier Deupi,et al.  Stabilized G protein binding site in the structure of constitutively active metarhodopsin-II , 2011, Proceedings of the National Academy of Sciences.

[22]  Francesca Fanelli,et al.  WebPSN: a web server for high-throughput investigation of structural communication in biomacromolecules , 2015, Bioinform..

[23]  C. Chothia,et al.  The Packing Density in Proteins: Standard Radii and Volumes , 1999 .

[24]  A. J. Venkatakrishnan,et al.  Universal allosteric mechanism for Gα activation by GPCRs , 2015, Nature.

[25]  M. Babu,et al.  Molecular signatures of G-protein-coupled receptors , 2013, Nature.

[26]  Sarah A. Teichmann,et al.  3D Complex: A Structural Classification of Protein Complexes , 2006, PLoS Comput. Biol..

[27]  Abhik Mukhopadhyay,et al.  PDBe: improved accessibility of macromolecular structure data from PDB and EMDB , 2015, Nucleic Acids Res..

[28]  Yifan Cheng Single-Particle Cryo-EM at Crystallographic Resolution , 2015, Cell.

[29]  Silvio C. E. Tosatto,et al.  RING: networking interacting residues, evolutionary information and energetics in protein structures , 2011, Bioinform..

[30]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[31]  Alicia P. Higueruelo,et al.  Arpeggio: A Web Server for Calculating and Visualising Interatomic Interactions in Protein Structures , 2017, Journal of molecular biology.

[32]  Sarah A Teichmann,et al.  Evolution of protein structures and interactions from the perspective of residue contact networks. , 2013, Current opinion in structural biology.

[33]  S. Vishveshwara,et al.  Intra and inter-molecular communications through protein structure network. , 2009, Current protein & peptide science.

[34]  G. N. Ramachandran,et al.  Stereochemistry of polypeptide chain configurations. , 1963, Journal of molecular biology.

[35]  C. Chothia,et al.  Structural patterns in globular proteins , 1976, Nature.

[36]  Gürol M. Süel,et al.  Evolutionarily conserved networks of residues mediate allosteric communication in proteins , 2003, Nature Structural Biology.

[37]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[38]  M. Perutz,et al.  Structure of Hæmoglobin: A Three-Dimensional Fourier Synthesis at 5.5-Å. Resolution, Obtained by X-Ray Analysis , 1960, Nature.

[39]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[40]  Yigong Shi A Glimpse of Structural Biology through X-Ray Crystallography , 2014, Cell.

[41]  Chuan-Tien Hung,et al.  Control of the negative IRES trans-acting factor KHSRP by ubiquitination , 2016, Nucleic acids research.

[42]  Kyongbum Lee,et al.  An algorithm for modularity analysis of directed and weighted biological networks based on edge-betweenness centrality , 2006, Bioinform..

[43]  M. Madan Babu,et al.  Selectivity determinants of GPCR–G-protein binding , 2017, Nature.

[44]  R. Stevens,et al.  High-Resolution Crystal Structure of an Engineered Human β2-Adrenergic G Protein–Coupled Receptor , 2007, Science.

[45]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[46]  S. Teichmann,et al.  Principles of assembly reveal a periodic table of protein complexes , 2015, Science.

[47]  Franca Fraternali,et al.  POPS: a fast algorithm for solvent accessible surface areas at atomic and residue level , 2003, Nucleic Acids Res..

[48]  Gert Sabidussi,et al.  The centrality index of a graph , 1966 .

[49]  R. G. Hart,et al.  Structure of Myoglobin: A Three-Dimensional Fourier Synthesis at 2 Å. Resolution , 1960, Nature.

[50]  A. Leslie,et al.  Structure of the adenosine A2A receptor bound to an engineered G protein , 2016, Nature.

[51]  J. Richardson β-Sheet topology and the relatedness of proteins , 1977, Nature.

[52]  L. da F. Costa,et al.  Characterization of complex networks: A survey of measurements , 2005, cond-mat/0505185.

[53]  A. J. Venkatakrishnan,et al.  Diverse activation pathways in class A GPCRs converge near the G-protein-coupling region , 2016, Nature.

[54]  R. Nussinov,et al.  Residue centrality, functionally important residues, and active site shape: Analysis of enzyme and non‐enzyme families , 2006, Protein science : a publication of the Protein Society.

[55]  C Chothia,et al.  Haemoglobin: the structural changes related to ligand binding and its allosteric mechanism. , 1979, Journal of molecular biology.

[56]  David S. Goodsell,et al.  The RCSB protein data bank: integrative view of protein, gene and 3D structural information , 2016, Nucleic Acids Res..

[57]  Chris de Graaf,et al.  Generic GPCR residue numbers - aligning topology maps while minding the gaps. , 2015, Trends in pharmacological sciences.

[58]  Bang Wong,et al.  Visualizing biological data—now and in the future , 2010, Nature Methods.

[59]  Roman A. Laskowski,et al.  LigPlot+: Multiple Ligand-Protein Interaction Diagrams for Drug Discovery , 2011, J. Chem. Inf. Model..

[60]  F. Crick,et al.  Molecular Structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid , 1953, Nature.

[61]  M. Babu,et al.  Pharmacogenomics of GPCR Drug Targets , 2018, Cell.

[62]  K. Wüthrich The way to NMR structures of proteins , 2001, Nature Structural Biology.

[63]  Nobuhiko Saitô,et al.  Tertiary Structure of Proteins. I. : Representation and Computation of the Conformations , 1972 .

[64]  Silvio C. E. Tosatto,et al.  The RING 2.0 web server for high quality residue interaction networks , 2016, Nucleic Acids Res..

[65]  S. Vishveshwara,et al.  Identification of side-chain clusters in protein structures by a graph spectral method. , 1999, Journal of molecular biology.

[66]  Susan S. Taylor,et al.  Surface comparison of active and inactive protein kinases identifies a conserved activation mechanism , 2006, Proceedings of the National Academy of Sciences.

[67]  Nita Parekh,et al.  NAPS: Network Analysis of Protein Structures , 2016, Nucleic Acids Res..