Validity of Protein Structure Alignment Method Based on Backbone Torsion Angles

Previous researches noticed that a 3D backbone structure can be mathematically represented with a 1D ? and ? dihedral angle array. However, performance of the backbone dihedral angle alignment was not supported with sufficiently large test sets to be quantified; i.e. only 2 pairs or 4 pairs of proteins were analyzed. Here we showed that it is more effective to accurately anticipate homology among 1891 pairs of proteins of 62 different proteases with the string of ? and ? dihedral angle array than famous 3D structural alignment tool TM-align. Gapless global alignment between protein structures was conducted to validate the effectiveness of performing structural alignment with strings of backbone torsion angles. Representation of 3D structure by 1D torsion angle strings allows local alignment, profile construction, hidden Markov models to be implemented with minor modifications and with almost no loss of speed compared with sequence alignment. By our further validation from the previous studies, the utility of backbone dihedral angle method could be more evident.

[1]  Johannes Söding,et al.  Protein homology detection by HMM?CHMM comparison , 2005, Bioinform..

[2]  Roland L Dunbrack,et al.  Testing computational prediction of missense mutation phenotypes: Functional characterization of 204 mutations of human cystathionine beta synthase , 2010, Proteins.

[3]  M. Levitt,et al.  A unified statistical framework for sequence comparison and structure comparison. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Ralf Bundschuh,et al.  Simple is beautiful: a straightforward approach to improve the delineation of true and false positives in PSI-BLAST searches , 2008, Bioinform..

[5]  Manfred J. Sippl,et al.  On distance and similarity in fold space , 2008, Bioinform..

[6]  Joël Pothier,et al.  YAKUSA: A fast structural database scanning method , 2005, Proteins.

[7]  A H Louie,et al.  Differential geometry of proteins: a structural and dynamical representation of patterns. , 1982, Journal of theoretical biology.

[8]  K Henrick,et al.  Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. , 2004, Acta crystallographica. Section D, Biological crystallography.

[9]  Manfred J. Sippl,et al.  A note on difficult structure alignment problems , 2008, Bioinform..

[10]  K. Karplus,et al.  Hidden Markov models that use predicted local structure for fold recognition: Alphabets of backbone geometry , 2003, Proteins.

[11]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[12]  Homayoun Valafar,et al.  Tali: Local Alignment of protein Structures Using Backbone Torsion Angles , 2008, J. Bioinform. Comput. Biol..

[13]  B Honig,et al.  An integrated approach to the analysis and modeling of protein sequences and structures. I. Protein structural alignment and a quantitative measure for protein structural distance. , 2000, Journal of molecular biology.

[14]  N. Srinivasan,et al.  A substitution matrix for structural alphabet based on structural alignment of homologous proteins and its applications , 2006, Proteins.

[15]  R. Mannhold,et al.  Substructure and Whole Molecule Approaches for Calculating Log P , 2001 .

[16]  J. Jung,et al.  Protein structure alignment using environmental profiles. , 2000, Protein engineering.

[17]  R. Lavery,et al.  Describing protein structure: A general algorithm yielding complete helicoidal parameters and a unique overall axis , 1989, Proteins.

[18]  Min-Sung Kim,et al.  Whole Genome Alignment with BLAST on Grid Environment , 2006, The Sixth IEEE International Conference on Computer and Information Technology (CIT'06).

[19]  W. Kabsch A discussion of the solution for the best rotation to relate two sets of vectors , 1978 .

[20]  A. Sali,et al.  Protein Structure Prediction and Structural Genomics , 2001, Science.

[21]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[22]  J. Skolnick,et al.  The PDB is a covering set of small protein structures. , 2003, Journal of molecular biology.

[23]  A H Louie,et al.  Differential geometry of proteins. Helical approximations. , 1983 .

[24]  S. Rackovsky,et al.  Differential Geometry and Polymer Conformation. 2. Development of a Conformational Distance Function , 1980 .

[25]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP)—round IX , 2011, Proteins.

[26]  Narayanaswamy Srinivasan,et al.  Protein Block Expert (PBE): a web-based protein structure analysis server using a structural alphabet , 2006, Nucleic Acids Res..

[27]  A Elofsson,et al.  Assessing the performance of fold recognition methods by means of a comprehensive benchmark. , 1996, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[28]  F. Eisenmenger,et al.  A fast unbiased comparison of protein structures by means of the Needleman-Wunsch algorithm , 1991, Journal of Molecular Evolution.

[29]  B. Rost,et al.  Critical assessment of methods of protein structure prediction (CASP)—Round 6 , 2005, Proteins.

[30]  W. Kabsch A solution for the best rotation to relate two sets of vectors , 1976 .

[31]  Osvaldo Olmea,et al.  MAMMOTH (Matching molecular models obtained from theory): An automated method for model comparison , 2002, Protein science : a publication of the Protein Society.

[32]  Pierre Tufféry,et al.  SA-Search: a web tool for protein structure mining based on a Structural Alphabet , 2004, Nucleic Acids Res..

[33]  Adam Godzik,et al.  Flexible structure alignment by chaining aligned fragment pairs allowing twists , 2003, ECCB.

[34]  Kevin Karplus,et al.  Evaluation of local structure alphabets based on residue burial , 2004, Proteins.

[35]  S Rackovsky,et al.  Protein comparison and classification: a differential geometric approach. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[36]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[37]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[38]  Kenji Mizuguchi,et al.  Applying the Naïve Bayes classifier with kernel density estimation to the prediction of protein-protein interaction sites , 2010, Bioinform..

[39]  Peter J. Stuckey,et al.  Fast and accurate protein substructure searching with simulated annealing and GPUs , 2010, BMC Bioinformatics.

[40]  Han van de Waterbeemd,et al.  Substructure and whole molecule approaches for calculating log P , 2001, J. Comput. Aided Mol. Des..

[41]  Jacquelyn S. Fetrow,et al.  Structural genomics and its importance for gene function analysis , 2000, Nature Biotechnology.

[42]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[43]  M. E. Karpen,et al.  Comparing short protein substructures by a method based on backbone torsion angles , 1989, Proteins.

[44]  Christine A. Orengo,et al.  Towards a comprehensive structural coverage of completed genomes: a structural genomics viewpoint , 2007, BMC Bioinformatics.

[45]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[46]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[47]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.