KNOTTIN: the database of inhibitor cystine knot scaffold after 10 years, toward a systematic structure modeling

Abstract Knottins, or inhibitor cystine knots (ICKs), are ultra-stable miniproteins with multiple applications in drug design and medical imaging. These widespread and functionally diverse proteins are characterized by the presence of three interwoven disulfide bridges in their structure, which form a unique pseudoknot. Since 2004, the KNOTTIN database (www.dsimb.inserm.fr/KNOTTIN/) has been gathering standardized information about knottin sequences, structures, functions and evolution. The website also provides access to bibliographic data and to computational tools that have been specifically developed for ICKs. Here, we present a major upgrade of our database, both in terms of data content and user interface. In addition to the new features, this article describes how KNOTTIN has seen its size multiplied over the past ten years (since its last publication), notably with the recent inclusion of predicted ICKs structures. Finally, we report how our web resource has proved usefulness for the researchers working on ICKs, and how the new version of the KNOTTIN website will continue to serve this active community.

[1]  Jean-Christophe Gelly,et al.  KNOTTIN: the knottin or inhibitor cystine knot scaffold in 2007 , 2007, Nucleic Acids Res..

[2]  M. Currie,et al.  Two randomized trials of linaclotide for chronic constipation. , 2011, The New England journal of medicine.

[3]  Conan K. L. Wang,et al.  ConoServer, a database for conopeptide sequences and structures , 2008, Bioinform..

[4]  J. Cochran,et al.  Cystine-knot peptides: emerging tools for cancer imaging and therapy , 2014, Expert review of proteomics.

[5]  Chris Sander,et al.  MView: a web-compatible database search or multiple alignment viewer , 1998, Bioinform..

[6]  Conan K. L. Wang,et al.  CyBase: a database of cyclic protein sequences and structures, with applications in protein discovery and engineering , 2007, Nucleic Acids Res..

[7]  Jason P. Mulvenna,et al.  CyBase: a database of cyclic protein sequence and structure , 2005, Nucleic Acids Res..

[8]  Marco Biasini,et al.  SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information , 2014, Nucleic Acids Res..

[9]  Jean-Christophe Gelly,et al.  The KNOTTIN website and database: a new information system dedicated to the knottin scaffold , 2004, Nucleic Acids Res..

[10]  J. Sussman,et al.  JSmol and the Next-Generation Web-Based Representation of 3D Molecular Structure as Applied to Proteopedia , 2013 .

[11]  S. M. Ashiqul Islam,et al.  PredSTP: a highly accurate SVM based model to predict sequential cystine stabilized peptides , 2015, BMC Bioinformatics.

[12]  Marc A. Martí-Renom,et al.  MODBASE: a database of annotated comparative protein structure models and associated resources , 2005, Nucleic Acids Res..

[13]  Jérôme Gracy,et al.  PAT: a protein analysis toolkit for integrated biocomputing on the web , 2005, Nucleic Acids Res..

[14]  N. Ayoub,et al.  Dramatic expansion of the black widow toxin arsenal uncovered by multi-tissue transcriptomics and venom proteomics , 2014, BMC Genomics.

[15]  Jennifer R Cochran,et al.  Engineered knottin peptides as diagnostics, therapeutics, and drug delivery vehicles. , 2016, Current opinion in chemical biology.

[16]  Yaping Zhang,et al.  A sodium channel inhibitor ISTX-I with a novel structure provides a new hint at the evolutionary link between two toxin folds , 2016, Scientific Reports.

[17]  P. Argos,et al.  Knowledge‐based protein secondary structure assignment , 1995, Proteins.

[18]  Jérôme Gracy,et al.  Optimizing structural modeling for a specific protein scaffold: knottins or inhibitor cystine knots , 2010, BMC Bioinformatics.

[19]  David J. Craik,et al.  ConoServer: updated content, knowledge, and discovery tools in the conopeptide database , 2011, Nucleic Acids Res..

[20]  Adam Zemla,et al.  MvirDB—a microbial database of protein toxins, virulence factors and antibiotic resistance genes for bio-defence applications , 2006, Nucleic Acids Res..

[21]  Julie D Thompson,et al.  Multiple Sequence Alignment Using ClustalW and ClustalX , 2003, Current protocols in bioinformatics.