SSTmPROT, A POINT mu TATION SENSITIVE TOO l T O CO mb INE RES ulTS AND TO PREDICT THE CONSENS u S SEQ u ENCE , AND SECONDARY STR u CT u RE OF TRANS m E mb RANE PROTEINS

Abstract: The efficiency of predictions of protein secondary structures can be increased by treating this process as a cycle of steps, where each step is an approach to the single natural event in folding process (model and simulation of successive events). The set of simulation steps qualifies reliability of in silico simulation of single steps, allowing to verify the correctness of each step as well as to retain sensitivity in case of a single amino acidic substitution. For this purpose the three-part algorithm (SSTMProt) has been designed. This algorithm combines the results of known methods of prediction of proteins secondary structure. Furthermore, the efficiency of this algorithm has been verified using the models received from the RCSB PDB (the Research Collaboratory for Structural Bioinformatics – Protein Data Base; http://www.rcsb.org). The accuracy of known methods has been compared with the accuracy of designed algorithms. The accuracy has been tested by the comparison of true secondary structure with predicted secondary structure of a given protein. The results of accuracy test has been presented as percentage values of similarity between both secondary structures: predicted structure using known method vs. true structure and predicted structure using designed method vs. true structure. The results demonstrate 20-30% higher accuracy of prediction for designed algorithms then for adequate known methods. The test of sensitivity has been done for proteins of a very conservative and stable structure (subunits of bovine cytochrome c oxidase and bacterial ATP-ase, bovine rhodopsin and human hemoblobin as a globular but alpha-helical protein). The influence of a single amino acid substitution on a resulted secondary structure predicted by SSTMProt algorithms has been examined. The repeatability of elaborated algorithms is 100% and each of all 12 tested combinations of methods were sensitive on a single amino acid substitution. All tests have been done for 10 models of native forms of proteins of known structure (models down-loaded from the RCSB PDB 1HBB, 1HBS, 1OCC, 1U17, 1C17) and over 500 modified models; 30 known methods of prediction of secondary structure of proteins and 40 combinations of these methods included in three versions of elaborated algorithms have been examined for each protein model.

[1]  R A Goldstein,et al.  Predicting protein secondary structure with probabilistic schemata of evolutionarily derived information , 1997, Protein science : a publication of the Protein Society.

[2]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[3]  Geoffrey J. Barton,et al.  The Jalview Java alignment editor , 2004, Bioinform..

[4]  Shigeki Mitaku,et al.  SOSUI: classification and secondary structure prediction system for membrane proteins , 1998, Bioinform..

[5]  E. Padlan,et al.  Refined crystal structure of deoxyhemoglobin S. I. Restrained least-squares refinement at 3.0-A resolution. , 1982, The Journal of biological chemistry.

[6]  Marcella Attimonelli,et al.  HmtDB, a Human Mitochondrial Genomic Resource Based on Variability Studies Supporting Population Genetics and Biomedical Research , 2005, BMC Bioinformatics.

[7]  J. Gibrat,et al.  GOR method for predicting protein secondary structure from amino acid sequence. , 1996, Methods in enzymology.

[8]  M. Girvin,et al.  Structural changes linked to proton translocation by subunit c of the ATP synthase , 1999, Nature.

[9]  T. A. Hall,et al.  BIOEDIT: A USER-FRIENDLY BIOLOGICAL SEQUENCE ALIGNMENT EDITOR AND ANALYSIS PROGRAM FOR WINDOWS 95/98/ NT , 1999 .

[10]  T. Tomizaki,et al.  The Whole Structure of the 13-Subunit Oxidized Cytochrome c Oxidase at 2.8 Å , 1996, Science.

[11]  J U Bowie,et al.  Helix packing in membrane proteins. , 1997, Journal of molecular biology.

[12]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[13]  Christophe Blanchet,et al.  ANTHEPROT: An integrated protein sequence analysis software with client/server capabilities , 2001, Comput. Biol. Medicine.

[14]  Christian Th. Klein,et al.  Influence of solvation on helix formation of poly-alanine studied by multiple annealing simulations , 1996 .

[15]  S. Yoshikawa,et al.  The low-spin heme of cytochrome c oxidase as the driving element of the proton-pumping process , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[16]  N. Guex,et al.  SWISS‐MODEL and the Swiss‐Pdb Viewer: An environment for comparative protein modeling , 1997, Electrophoresis.

[17]  M. Buehler,et al.  Energy landscape, structure and rate effects on strength properties of alpha-helical proteins , 2010, Journal of physics. Condensed matter : an Institute of Physics journal.

[18]  T. Hirokawa,et al.  Physicochemical factors for discriminating between soluble and membrane proteins: hydrophobicity of helical segments and protein length. , 1999, Protein engineering.

[19]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[20]  R. Hodges,et al.  Effect of chain length on the formation and stability of synthetic alpha-helical coiled coils. , 1994, Biochemistry.

[21]  R A Sayle,et al.  RASMOL: biomolecular graphics for all. , 1995, Trends in biochemical sciences.

[22]  R S Cármenes,et al.  Predict7, a program for protein structure prediction , 1989, Biochemical and Biophysical Research Communications.

[23]  C. Dobson,et al.  The folding of hen lysozyme involves partially structured intermediates and multiple pathways , 1992, Nature.

[24]  S. Venyaminov,et al.  The Effects of Chain Length and Thermal Denaturation on Helix-Forming Peptides: A Mode-Specific Analysis Using 2D FT-IR , 1997 .

[25]  R. Hodges,et al.  Synthesis of a model protein of defined secondary and quaternary structure. Effect of chain length on the stabilization and formation of two-stranded alpha-helical coiled-coils. , 1984, The Journal of biological chemistry.

[26]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..

[27]  COMPUTATIONAL METHODS FOR PREDICTING TRANSMEMBRANE ALPHA HELICES , 2002 .

[28]  Kurt Wüthrich,et al.  An EB1-Binding Motif Acts as a Microtubule Tip Localization Signal , 2009, Cell.

[29]  J. Garnier,et al.  Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. , 1978, Journal of molecular biology.

[30]  Pierre Baldi,et al.  An enhanced MITOMAP with a global mtDNA mutational phylogeny , 2006, Nucleic Acids Res..

[31]  P. S. Kim,et al.  Reexamination of the folding of BPTI: predominance of native intermediates , 1991, Science.

[32]  Andrzej Kloczkowski,et al.  GOR V server for protein secondary structure prediction , 2005, Bioinform..

[33]  Torsten Schwede,et al.  Automated comparative protein structure modeling with SWISS‐MODEL and Swiss‐PdbViewer: A historical perspective , 2009, Electrophoresis.

[34]  C. Cohen,et al.  Rigor-like structures from muscle myosins reveal key mechanical elements in the transduction pathways of this allosteric motor. , 2007, Structure.

[35]  G. Heijne Membrane protein structure prediction. Hydrophobicity analysis and the positive-inside rule. , 1992, Journal of molecular biology.

[36]  E. Kanaya,et al.  Indication of possible post-translational formation of disulphide bonds in the beta-sheet domain of human lysozyme. , 1993, The Biochemical journal.

[37]  F. Sanger,et al.  Sequence and organization of the human mitochondrial genome , 1981, Nature.

[38]  P. Argos,et al.  Seventy‐five percent accuracy in protein secondary structure prediction , 1997, Proteins.

[39]  Marcus Elstner,et al.  The retinal conformation and its environment in rhodopsin in light of a new 2.2 A crystal structure. , 2004, Journal of molecular biology.

[40]  A. Doig,et al.  The alpha-helix folds on the millisecond time scale. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[41]  J. Gibrat,et al.  Further developments of protein secondary structure prediction using information theory. New parameters and consideration of residue pairs. , 1987, Journal of molecular biology.

[42]  T. Lockett,et al.  Influence of Helix Length on Cleavage Efficiency of Hammerhead Ribozymes , 2005 .