PASS2 version 6: a database of structure-based sequence alignments of protein domain superfamilies in accordance with SCOPe

Abstract The number of protein structures is increasing due to the individual initiatives and rapid development of structure determination techniques. Structure-based sequence alignments of distantly related proteins enable the investigation of structural, evolutionary and functional relationships between proteins and their domains leading to their common evolutionary origin. Protein Alignments organized as Structural Superfamilies (PASS2) is a database that provides such alignments of members of protein domain superfamilies of known structure and with less than 40% sequence identity. PASS2 has been continuously updated in accordance to Structural Classification of Proteins (SCOP), and now Structural Classification of Proteins - extended (SCOPe). The current update directly corresponds to SCOPe 2.06, dealing with 2006 domain superfamilies of known structure and about 14 000 domains. Alignments have been augmented by features such as hidden Markov models, highly conserved residues, structural motifs and gene ontology terms, which are available for download. In this update, we introduce the concepts of ‘extreme structural outliers’ and ‘split superfamilies’ as well.

[1]  Ponnuthurai N. Suganthan,et al.  SMotif: a server for structural motifs in proteins , 2007, Bioinform..

[2]  M. A. McClure,et al.  Hidden Markov models of biological primary sequence information. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Charlotte M. Deane,et al.  JOY: protein sequence-structure representation and analysis , 1998, Bioinform..

[4]  Narayanaswamy Srinivasan,et al.  CUSP : an algorithm to distinguish structurally conserved and unconserved regions in protein domain alignments and its application in the study of large length variations , 2008 .

[5]  Changhoon Kim,et al.  Accuracy of structure-based sequence alignment of automatic methods , 2007, BMC Bioinformatics.

[6]  Ramanathan Sowdhamini,et al.  PASS2: a semi-automated database of Protein Alignments Organised as Structural Superfamilies , 2002, Nucleic Acids Res..

[7]  Ramanathan Sowdhamini,et al.  Sequence and Structural Analyses of Interleukin-8-Like Chemokine Superfamily , 2008, Silico Biol..

[8]  Samuel A. Smits,et al.  jsPhyloSVG: A Javascript Library for Visualizing Interactive and Vector-Based Phylogenetic Trees on the Web , 2010, PloS one.

[9]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[10]  Ramanathan Sowdhamini,et al.  BMC Bioinformatics BioMed Central Database , 2004 .

[11]  T. Blundell,et al.  Definition of general topological equivalence in protein structures. A procedure involving comparison of properties and relationships through simulated annealing and dynamic programming. , 1990, Journal of molecular biology.

[12]  Arunmozhiarasi Armugam,et al.  microRNAs Involved in Regulating Spontaneous Recovery in Embolic Stroke Model , 2013, PLoS ONE.

[13]  Ramanathan Sowdhamini,et al.  PASS2 version 4: An update to the database of structure-based sequence alignments of structural domain superfamilies , 2011, Nucleic Acids Res..

[14]  Lenore Cowen,et al.  Matt: Local Flexibility Aids Protein Multiple Structure Alignment , 2008, PLoS Comput. Biol..

[15]  M. Baggiolini,et al.  Neutrophil-activating peptide-1/interleukin 8, a novel cytokine that activates neutrophils. , 1989, The Journal of clinical investigation.

[16]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[17]  Ramanathan Sowdhamini,et al.  PASS2 database for the structure-based sequence alignment of distantly related SCOP domain superfamilies: update to version 5 and added features , 2015, Nucleic Acids Res..

[18]  Ramanathan Sowdhamini,et al.  RStrucFam: a web server to associate structure and cognate RNA for RNA-binding proteins from sequence information , 2016, BMC Bioinformatics.

[19]  Janet M Thornton,et al.  Evolution of binding sites for zinc and calcium ions playing structural roles , 2008, Proteins.

[20]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[21]  A. D. McLachlan,et al.  A mathematical procedure for superimposing atomic coordinates of proteins , 1972 .

[22]  Steven E. Brenner,et al.  SCOPe: Structural Classification of Proteins—extended, integrating SCOP and ASTRAL data and classification of new structures , 2013, Nucleic Acids Res..

[23]  T. Blundell,et al.  Knowledge based modelling of homologous proteins, Part I: Three-dimensional frameworks derived from the simultaneous superposition of multiple structures. , 1987, Protein engineering.

[24]  Xueyan Ma,et al.  Crystal Structure of Vinorine Synthase, the First Representative of the BAHD Superfamily* , 2005, Journal of Biological Chemistry.

[25]  Saikat Chakrabarti,et al.  Regions of minimal structural variation among members of protein domain superfamilies: application to remote homology detection and modelling using distant relationships , 2004, FEBS letters.

[26]  Sridhar Hariharaputran,et al.  Rebelling for a Reason: Protein Structural “Outliers” , 2013, PloS one.

[27]  Ni Li,et al.  Gene Ontology Annotations and Resources , 2012, Nucleic Acids Res..

[28]  Ramanathan Sowdhamini,et al.  PASS2: A Database of Structure-Based Sequence Alignments of Protein Structural Domain Superfamilies , 2011, Int. J. Knowl. Discov. Bioinform..