Detecting similarities among distant homologous proteins by comparison of domain flexibilities.

Aim of this work is to assess the informativeness of protein dynamics in the detection of similarities among distant homologous proteins. To this end, an approach to perform large-scale comparisons of protein domain flexibilities is proposed. CONCOORD is confirmed as a reliable method for fast conformational sampling. The root mean square fluctuation of alpha carbon positions in the essential dynamics subspace is employed as a measure of local flexibility and a synthetic index of similarity is presented. The dynamics of a large collection of protein domains from ASTRAL/SCOP40 is analyzed and the possibility to identify relationships, at both the family and the superfamily levels, on the basis of the dynamical features is discussed. The obtained picture is in agreement with the SCOP classification, and furthermore suggests the presence of a distinguishable familiar trend in the flexibility profiles. The results support the complementarity of the dynamical and the structural information, suggesting that information from dynamics analysis can arise from functional similarities, often partially hidden by a static comparison. On the basis of this first test, flexibility annotation can be expected to help in automatically detecting functional similarities otherwise unrecoverable.

[1]  A. Mattevi,et al.  Structure-Function Relationships in Flavoenzyme-dependent Amine Oxidations , 2002, The Journal of Biological Chemistry.

[2]  B. Entsch,et al.  Dynamics involved in catalysis by single-component and two-component flavin-dependent aromatic hydroxylases. , 2005, Biochemical and biophysical research communications.

[3]  Patrice Koehl,et al.  The ASTRAL Compendium in 2004 , 2003, Nucleic Acids Res..

[4]  Brian Everitt,et al.  Cluster analysis , 1974 .

[5]  F. Fraternali,et al.  Leap‐dynamics: efficient sampling of conformational space of proteins and peptides in solution , 2000, FEBS letters.

[6]  N. Sampson,et al.  Crystal structure determination of cholesterol oxidase from Streptomyces and structural characterization of key active site mutants. , 1999, Biochemistry.

[7]  Patrice Koehl,et al.  The ASTRAL compendium for protein structure and sequence analysis , 2000, Nucleic Acids Res..

[8]  Berk Hess,et al.  Improving efficiency of large time‐scale molecular dynamics simulations of hydrogen‐rich systems , 1999, Journal of computational chemistry.

[9]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[10]  T. Darden,et al.  Particle mesh Ewald: An N⋅log(N) method for Ewald sums in large systems , 1993 .

[11]  Hans Frauenfelder,et al.  Temperature-dependent X-ray diffraction as a probe of protein structural dynamics , 1979, Nature.

[12]  Eugene I Shakhnovich,et al.  Expanding protein universe and its origin from the biological Big Bang , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997 .

[14]  Hassan A. Karimi,et al.  iGNM: a database of protein functional motions based on Gaussian Network Model , 2005, Bioinform..

[15]  Stuart Murdock,et al.  BioSimGrid: towards a worldwide repository for biomolecular simulations. , 2004, Organic & biomolecular chemistry.

[16]  M. S. Pilone,et al.  D-Amino acid oxidase: new findings , 2000, Cellular and Molecular Life Sciences CMLS.

[17]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[18]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[19]  Julian Echave,et al.  Exploring the common dynamics of homologous proteins. Application to the globin family. , 2005, Biophysical journal.

[20]  Jocelyne Vreede,et al.  PAS Domains , 2003, The Journal of Biological Chemistry.

[21]  J. Skolnick,et al.  The PDB is a covering set of small protein structures. , 2003, Journal of molecular biology.

[22]  Teri A. Crosby,et al.  How to Detect and Handle Outliers , 1993 .

[23]  G. Vriend,et al.  Prediction of protein conformational freedom from distance constraints , 1997, Proteins.

[24]  Benjamin A Hall,et al.  Dynamite: a simple way to gain insight into protein motions. , 2004, Acta crystallographica. Section D, Biological crystallography.

[25]  A. Lesk,et al.  Structural mechanisms for domain movements in proteins. , 1994, Biochemistry.

[26]  C. Enroth,et al.  The crystal structure of phenol hydroxylase in complex with FAD and phenol provides evidence for a concerted conformational change in the enzyme and its cofactor during catalysis. , 1998, Structure.

[27]  A. Mattevi,et al.  Structure and mechanism of monoamine oxidase. , 2004 .

[28]  A Pandini,et al.  Conservation and specialization in PAS domain dynamics. , 2005, Protein engineering, design & selection : PEDS.

[29]  B. Halle,et al.  Flexibility and packing in proteins , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[30]  B. Rost Twilight zone of protein sequence alignments. , 1999, Protein engineering.

[31]  G. Ciccotti,et al.  Numerical Integration of the Cartesian Equations of Motion of a System with Constraints: Molecular Dynamics of n-Alkanes , 1977 .

[32]  M. Saraste,et al.  FEBS Lett , 2000 .

[33]  W. Pearson,et al.  The limits of protein sequence comparison? , 2005, Current opinion in structural biology.

[34]  M. Sansom,et al.  Molecular dynamics simulations of a K+ channel blocker: Tc1 toxin from Tityus cambridgei , 2003, FEBS letters.

[35]  R. Coulombe,et al.  The structure of L‐amino acid oxidase reveals the substrate trajectory into an enantiomerically conserved active site , 2000, The EMBO journal.

[36]  W. Pearson,et al.  Sensitivity and selectivity in protein structure comparison , 2004, Protein science : a publication of the Protein Society.

[37]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[38]  Patrice Koehl,et al.  ASTRAL compendium enhancements , 2002, Nucleic Acids Res..

[39]  A. Mark,et al.  Fluctuation and cross-correlation analysis of protein motions observed in nanosecond molecular dynamics simulations. , 1995, Journal of molecular biology.

[40]  D. van der Spoel,et al.  GROMACS: A message-passing parallel molecular dynamics implementation , 1995 .

[41]  Michael Gribskov,et al.  Use of Receiver Operating Characteristic (ROC) Analysis to Evaluate Sequence Matching , 1996, Comput. Chem..

[42]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[43]  G. Wasserburg The chemical record , 1989 .

[44]  Graham Richards,et al.  Intermolecular forces , 1978, Nature.

[45]  R. Miura,et al.  Versatility and specificity in flavoenzymes: control mechanisms of flavin reactivity. , 2001, Chemical record.

[46]  Berk Hess,et al.  GROMACS 3.0: a package for molecular simulation and trajectory analysis , 2001 .

[47]  M. Fraaije,et al.  Flavoenzymes: diverse catalysts with recurrent features. , 2000, Trends in biochemical sciences.

[48]  Jeremy C. Smith,et al.  Fluctuations and correlations in crystalline protein dynamics: a simulation analysis of staphylococcal nuclease. , 2005, Biophysical journal.

[49]  D. Thirumalai,et al.  Network of dynamically important residues in the open/closed transition in polymerases is strongly conserved. , 2005, Structure.

[50]  A. Atilgan,et al.  Direct evaluation of thermal fluctuations in proteins using a single-parameter harmonic potential. , 1997, Folding & design.

[51]  Sebastian Doniach,et al.  Protein flexibility in solution and in crystals , 1999 .

[52]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[53]  Liisa Holm,et al.  DaliLite workbench for protein structure comparison , 2000, Bioinform..

[54]  M. Bolognesi,et al.  Active site plasticity in D-amino acid oxidase: a crystallographic analysis. , 1997, Biochemistry.

[55]  K. Tai Conformational sampling for the impatient. , 2004, Biophysical chemistry.

[56]  P. Radivojac,et al.  Protein flexibility and intrinsic disorder , 2004, Protein science : a publication of the Protein Society.

[57]  P. Radivojac,et al.  Improved amino acid flexibility parameters , 2003, Protein science : a publication of the Protein Society.

[58]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[59]  H. Berendsen,et al.  Essential dynamics of proteins , 1993, Proteins.

[60]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[61]  Roberto Todeschini,et al.  Data correlation, number of significant principal components and shape of molecules. The K correlation index , 1997 .

[62]  C. A. Andersen,et al.  Continuum secondary structure captures protein flexibility. , 2002, Structure.