A new decoding algorithm for hidden Markov models improves the prediction of the topology of all-beta membrane proteins

BackgroundStructure prediction of membrane proteins is still a challenging computational problem. Hidden Markov models (HMM) have been successfully applied to the problem of predicting membrane protein topology. In a predictive task, the HMM is endowed with a decoding algorithm in order to assign the most probable state path, and in turn the labels, to an unknown sequence. The Viterbi and the posterior decoding algorithms are the most common. The former is very efficient when one path dominates, while the latter, even though does not guarantee to preserve the HMM grammar, is more effective when several concurring paths have similar probabilities. A third good alternative is 1-best, which was shown to perform equal or better than Viterbi.ResultsIn this paper we introduce the posterior-Viterbi (PV) a new decoding which combines the posterior and Viterbi algorithms. PV is a two step process: first the posterior probability of each state is computed and then the best posterior allowed path through the model is evaluated by a Viterbi algorithm.ConclusionWe show that PV decoding performs better than other algorithms when tested on the problem of the prediction of the topology of beta-barrel membrane proteins.

[1]  D. Haussler,et al.  Hidden Markov models in computational biology. Applications to protein modeling. , 1993, Journal of molecular biology.

[2]  M. A. McClure,et al.  Hidden Markov models of biological primary sequence information. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Anders Krogh Hidden Markov models for labeled sequences , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[4]  Anders Krogh,et al.  Two Methods for Improving Performance of a HMM and their Application for Gene Finding , 1997, ISMB.

[5]  Gapped BLAST and PSI-BLAST: A new , 1997 .

[6]  G. Tusnády,et al.  Principles governing amino acid composition of integral membrane proteins: application to topology prediction. , 1998, Journal of molecular biology.

[7]  Ian Holmes,et al.  Dynamic programming alignment accuracy , 1998, RECOMB '98.

[8]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[9]  H Mamitsuka,et al.  Predicting peptides that bind to MHC molecules using supervised learning of hidden markov models , 1998, Proteins.

[10]  B. Rost,et al.  A modified definition of Sov, a segment‐based measure for protein secondary structure prediction assessment , 1999, Proteins.

[11]  G. Schulz β-Barrel membrane proteins , 2000 .

[12]  G. Schulz beta-Barrel membrane proteins. , 2000, Current opinion in structural biology.

[13]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[14]  R. Casadio,et al.  A 3D model of the voltage‐dependent anion channel (VDAC) , 2002, FEBS letters.

[15]  Pierre Baldi,et al.  Bioinformatics - the machine learning approach (2. ed.) , 2000 .

[16]  Piero Fariselli,et al.  A sequence-profile-based HMM for predicting and discriminating beta barrel membrane proteins , 2002, ISMB.

[17]  Piero Fariselli,et al.  MaxSubSeq: an algorithm for segment-length optimization. The case study of the transmembrane spanning segments , 2003, Bioinform..

[18]  R. Casadio,et al.  Fishing new proteins in the twilight zone of genomes: The test case of outer membrane proteins in Escherichia coli K12, Escherichia coli O157:H7, and other Gram‐negative bacteria , 2003, Protein science : a publication of the Protein Society.

[19]  Qi Liu,et al.  A HMM-based method to predict the transmembrane regions of \beta-barrel membrane proteins , 2003, Comput. Biol. Chem..

[20]  Piero Fariselli,et al.  In silico prediction of the structure of membrane proteins: Is it feasible? , 2003, Briefings Bioinform..

[21]  Piero Fariselli,et al.  An ENSEMBLE machine learning approach for the prediction of all-alpha membrane proteins , 2003, ISMB.

[22]  Stavros J. Hamodrakas,et al.  PRED-TMBB: a web server for predicting the topology of ?barrel outer membrane proteins , 2004, Nucleic Acids Res..

[23]  Simon Parsons,et al.  Bioinformatics: The Machine Learning Approach by P. Baldi and S. Brunak, 2nd edn, MIT Press, 452 pp., $60.00, ISBN 0-262-02506-X , 2004, The Knowledge Engineering Review.

[24]  Henry R. Bigelow,et al.  Predicting transmembrane beta-barrels in proteomes. , 2004, Nucleic acids research.

[25]  Stavros J. Hamodrakas,et al.  Evaluation of methods for predicting the topology of β-barrel outer membrane proteins and a consensus prediction method , 2005, BMC Bioinformatics.

[26]  Zsuzsanna Dosztányi,et al.  Transmembrane proteins in the Protein Data Bank: identification and classification , 2004, Bioinform..

[27]  A. Elofsson,et al.  Best α‐helical transmembrane protein topology predictions are achieved using hidden Markov models and evolutionary information , 2004 .