A Real-Time All-Atom Structural Search Engine for Proteins

Protein designers use a wide variety of software tools for de novo design, yet their repertoire still lacks a fast and interactive all-atom search engine. To solve this, we have built the Suns program: a real-time, atomic search engine integrated into the PyMOL molecular visualization system. Users build atomic-level structural search queries within PyMOL and receive a stream of search results aligned to their query within a few seconds. This instant feedback cycle enables a new “designability”-inspired approach to protein design where the designer searches for and interactively incorporates native-like fragments from proven protein structures. We demonstrate the use of Suns to interactively build protein motifs, tertiary interactions, and to identify scaffolds compatible with hot-spot residues. The official web site and installer are located at http://www.degradolab.org/suns/ and the source code is hosted at https://github.com/godotgildor/Suns (PyMOL plugin, BSD license), https://github.com/Gabriel439/suns-cmd (command line client, BSD license), and https://github.com/Gabriel439/suns-search (search engine server, GPLv2 license).

[1]  Eugene I Shakhnovich,et al.  Lessons from the design of a novel atomic potential for protein folding , 2005, Protein science : a publication of the Protein Society.

[2]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[3]  Jason E. Donald,et al.  Salt bridges: Geometrically specific, designable interactions , 2011, Proteins.

[4]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[5]  Dong Xu,et al.  ProteinDBS: a real-time retrieval system for protein structure comparison , 2004, Nucleic Acids Res..

[6]  M. Rossmann,et al.  X-ray Crystallographic Structure of the Norwalk Virus , 1999 .

[7]  Timothy A. Whitehead,et al.  Computational Design of Proteins Targeting the Conserved Stem Region of Influenza Hemagglutinin , 2011, Science.

[8]  T. P. Flores,et al.  Multiple protein structure alignment , 1994, Protein science : a publication of the Protein Society.

[9]  Shawn M. Gomez,et al.  Mapping Protein Interactions between Dengue Virus and Its Human and Insect Hosts , 2011, PLoS neglected tropical diseases.

[10]  W. Kabsch A solution for the best rotation to relate two sets of vectors , 1976 .

[11]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[12]  Nikolay V. Dokholyan,et al.  Rigid substructure search , 2011, Bioinform..

[13]  C. Sander,et al.  The FSSP database of structurally aligned protein fold families. , 1994, Nucleic acids research.

[14]  Jeffrey J. Gray,et al.  Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. , 2003, Journal of molecular biology.

[15]  Gevorg Grigoryan,et al.  Mining tertiary structural motifs for assessment of designability. , 2013, Methods in enzymology.

[16]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[17]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[18]  S Ramaseshan,et al.  Crystal Physics, Diffraction, Theoretical and General Crystallography , 1981 .

[19]  M G Rossmann,et al.  X-ray crystallographic structure of the Norwalk virus capsid. , 1999, Science.

[20]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[21]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[22]  R. Sankaranarayanan,et al.  Crystal structure of a fungal protease inhibitor from Antheraea mylitta. , 2009, Journal of structural biology.

[23]  W. DeGrado,et al.  Computational Design of Virus-Like Protein Assemblies on Carbon Nanotube Surfaces , 2011, Science.

[24]  Stephen A. Cook,et al.  The complexity of theorem-proving procedures , 1971, STOC.