An effective sequence-alignment-free superpositioning of pairwise or multiple structures with missing data

BackgroundSuperpositioning is an important problem in structural biology. Determining an optimal superposition requires a one-to-one correspondence between the atoms of two proteins structures. However, in practice, some atoms are missing from their original structures. Current superposition implementations address the missing data crudely by ignoring such atoms from their structures.ResultsIn this paper, we propose an effective method for superpositioning pairwise and multiple structures without sequence alignment. It is a two-stage procedure including data reduction and data registration.ConclusionsNumerical experiments demonstrated that our method is effective and efficient. The code package of protein structure superposition method for addressing the cases with missing data is implemented by MATLAB, and it is freely available from: http://sourceforge.net/projects/pssm123/files/?source=navbar

[1]  R. L. Somorjai,et al.  The alignment of protein structures in three dimensions , 1989 .

[2]  José Mario Martínez,et al.  Convergent algorithms for protein structural alignment , 2007, BMC Bioinformatics.

[3]  S. Balaji,et al.  PALI: a database of alignments and phylogeny of homologous protein structures , 2001, Bioinform..

[4]  D R Flower,et al.  Rotational superposition: a review of methods. , 1999, Journal of molecular graphics & modelling.

[5]  Ping-Chiang Lyu,et al.  CPSARST: an efficient circular permutation search tool applied to the detection of novel protein structural relationships , 2008, Genome Biology.

[6]  J. Whisstock,et al.  Protein structural alignments and functional genomics , 2001, Proteins.

[7]  N. Grishin Fold change in evolution of protein structures. , 2001, Journal of structural biology.

[8]  Jan Gorodkin,et al.  Multiple structural alignment and clustering of RNA sequences , 2007, Bioinform..

[9]  R. Diamond On the comparison of conformations using linear and quadratic transformations , 1976 .

[10]  Adam Godzik,et al.  Multiple flexible structure alignment using partial order graphs , 2005, Bioinform..

[11]  D Fischer,et al.  A computer vision based technique for 3-D sequence-independent structural comparison of proteins. , 1993, Protein engineering.

[12]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Douglas L. Theobald,et al.  Accurate Structural Correlations from Maximum Likelihood Superpositions , 2008, PLoS Comput. Biol..

[14]  W. Kabsch A discussion of the solution for the best rotation to relate two sets of vectors , 1978 .

[15]  David S. Wishart,et al.  SuperPose: a simple server for sophisticated structural superposition , 2004, Nucleic Acids Res..

[16]  Lenore Cowen,et al.  Matt: Local Flexibility Aids Protein Multiple Structure Alignment , 2008, PLoS Comput. Biol..

[17]  Christos H. Papadimitriou,et al.  Algorithmic aspects of protein structure similarity , 1999, 40th Annual Symposium on Foundations of Computer Science (Cat. No.99CB37039).

[18]  Tao Jiang,et al.  On the Complexity of Multiple Sequence Alignment , 1994, J. Comput. Biol..

[19]  Randy J. Read,et al.  Overview of the CCP4 suite and current developments , 2011, Acta crystallographica. Section D, Biological crystallography.

[20]  A. Konagurthu,et al.  MUSTANG: A multiple structural alignment algorithm , 2006, Proteins.

[21]  Roland L. Dunbrack Sequence comparison and protein structure prediction. , 2006, Current opinion in structural biology.

[22]  A. Panchenko,et al.  Threading with explicit models for evolutionary conservation of structure and sequence , 1999, Proteins.

[23]  G. Cohen Align : A program to superimpose protein coordinates, accounting for insertions and deletions , 1997 .

[24]  Edmund K. Burke,et al.  ProCKSI: a decision support system for Protein (Structure) Comparison, Knowledge, Similarity and Information , 2007, BMC Bioinformatics.

[25]  S. Kearsley On the orthogonal transformation used for structural comparisons , 1989 .

[26]  Douglas L. Theobald,et al.  Optimal simultaneous superpositioning of multiple structures with missing data , 2012, Bioinform..

[27]  Concettina Guerra,et al.  A global optimization algorithm for protein surface alignment , 2009, 2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop.

[28]  Robert C. Edgar,et al.  Multiple sequence alignment. , 2006, Current opinion in structural biology.

[29]  H. Wolfson,et al.  Multiple structural alignment by secondary structures: Algorithm and applications , 2003, Protein science : a publication of the Protein Society.

[30]  K. Dill,et al.  Using quaternions to calculate RMSD , 2004, J. Comput. Chem..