STRUCLA: a WWW meta-server for protein structure comparison and evolutionary classification

MOTIVATION Evolutionary relationships of proteins have long been derived from the alignment of protein sequences. But from the view of function, most restraints of evolutionary divergence operate at the level of tertiary structure. It has been demonstrated that quantitative measures of dissimilarity in families of structurally similar proteins can be applied to the construction of trees from a comparison of their three-dimensional structures. However, no convenient tool is publicly available to carry out such analyses. RESULTS We developed STRUCLA (STRUcture CLAssification), a WWW tool for generation of trees based on evolutionary distances inferred from protein structures according to various methods. The server takes as an input a list of PDB files or the initial alignment of protein coordinates provided by the user (for instance exported from SWISS PDB VIEWER). The user specifies the distance cutoff and selects the distance measures. The server returns series of unrooted trees in the NEXUS format and corresponding distance matrices, as well as a consensus tree. The results can be used as an alternative and a complement to a fixed hierarchy of current protein structure databases. It can complement sequence-based phylogenetic analysis in the 'twilight zone of homology', where amino acid sequences are too diverged to provide reliable relationships.

[1]  B Honig,et al.  An integrated approach to the analysis and modeling of protein sequences and structures. I. Protein structural alignment and a quantitative measure for protein structural distance. , 2000, Journal of molecular biology.

[2]  M Levitt,et al.  Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteins , 1998, Protein science : a publication of the Protein Society.

[3]  Arne Elofsson,et al.  A study of quality measures for protein threading models , 2001, BMC Bioinformatics.

[4]  Janusz M. Bujnicki,et al.  Comparison of protein structures reveals monophyletic origin of AdoMet-dependent methyltransferase family and mechanistic convergence rather than recent differentiation of N4-cytosine and N6-adenine DNA methylation , 1999, Silico Biol..

[5]  Patrice Koehl,et al.  Sequence variations within protein families are linearly related to structural variations. , 2002, Journal of molecular biology.

[6]  A. Efimov Structural trees for protein superfamilies , 1997, Proteins.

[7]  J. Whisstock,et al.  Protein structural alignments and functional genomics , 2001, Proteins.

[8]  A. Murzin How far divergent evolution goes in proteins. , 1998, Current opinion in structural biology.

[9]  M. Gerstein,et al.  Average core structures and variability measures for protein families: application to the immunoglobulins. , 1995, Journal of molecular biology.

[10]  S. Pongor,et al.  Protein fold similarity estimated by a probabilistic approach based on Cα-Cα distance comparison , 2002 .

[11]  A. Godzik The structural alignment between two proteins: Is there a unique answer? , 1996, Protein science : a publication of the Protein Society.

[12]  A C May,et al.  Toward more meaningful hierarchical classification of protein three‐dimensional structures , 1999, Proteins.

[13]  Janusz M. Bujnicki,et al.  Phylogeny of the Restriction Endonuclease-Like Superfamily Inferred from Comparison of Protein Structures , 2000, Journal of Molecular Evolution.

[14]  J A Eisen,et al.  A phylogenomic study of the MutS family of proteins. , 1998, Nucleic acids research.

[15]  Georg E. Schulz,et al.  Recognition of phylogenetic relationships from polypeptide chain fold similarities , 1977, Journal of Molecular Evolution.

[16]  M. P. Cummings PHYLIP (Phylogeny Inference Package) , 2004 .

[17]  Tom L. Blundell,et al.  Molecular anatomy: Phyletic relationships derived from three-dimensional structures of proteins , 2005, Journal of Molecular Evolution.

[18]  P Bork,et al.  Positionally cloned human disease genes: patterns of evolutionary conservation and functional motifs. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[19]  W. Pearson,et al.  Evolution of protein sequences and structures. , 1999, Journal of molecular biology.

[20]  A. Poupon,et al.  The immunoglobulin fold family: sequence analysis and 3D structure comparisons. , 1999, Protein engineering.

[21]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[22]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[23]  M. Levitt,et al.  A unified statistical framework for sequence comparison and structure comparison. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[24]  N. Guex,et al.  SWISS‐MODEL and the Swiss‐Pdb Viewer: An environment for comparative protein modeling , 1997, Electrophoresis.

[25]  B. Rost,et al.  Protein structures sustain evolutionary drift. , 1997, Folding & design.

[26]  S. Pongor,et al.  A normalized root‐mean‐spuare distance for comparing protein three‐dimensional structures , 2001, Protein science : a publication of the Protein Society.

[27]  M J Sippl,et al.  Optimum superimposition of protein structures: ambiguities and implications. , 1996, Folding & design.