论文信息 - Algorithms for variable length Markov chain modeling

Algorithms for variable length Markov chain modeling

UNLABELLED We present a general purpose implementation of variable length Markov models. Contrary to fixed order Markov models, these models are not restricted to a predefined uniform depth. Rather, by examining the training data, a model is constructed that fits higher order Markov dependencies where such contexts exist, while using lower order Markov dependencies elsewhere. As both theoretical and experimental results show, these models are capable of capturing rich signals from a modest amount of training data, without the use of hidden states. AVAILABILITY The source code is freely available at http://www.soe.ucsc.edu/~jill/src/

Gill Bejerano

[1] Gill Bejerano,et al. Automata Learning and Stochastic Modeling for Biosequence Analysis , 2003 .

[2] Sean R. Eddy,et al. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[3] Dana Ron,et al. The power of amnesia: Learning probabilistic automata with variable memory length , 1996, Machine Learning.

[4] Golan Yona,et al. Variations on probabilistic suffix trees: statistical modeling and prediction of protein families , 2001, Bioinform..