论文信息 - An efficient viterbi algorithm on DBNs

An efficient viterbi algorithm on DBNs

DBNs (Dynamic Bayesian Networks) [1] are powerful tool in modeling time-series data, and have been used in speech recognition recently [2,3,4]. The “decoding” task in speech recognition means to find the viterbi path [5](in graphical model community, “viterbi path” has the same meaning as MPE “Most Probable Explanation”) for a given acoustic observations. In this paper we describe a new algorithm utilizes a new data structure “backpointer”, which is produced in the “marginalization” procedure in probability inference. With these backpointers, the viterbi path can be found in a simple backtracking. We first introduce the concept of backpointer and backtracking; then give the algorithm to compute the viterbi path for DBNs based on backpointer and backtracking. We prove that the new algorithm is correct, faster and more memory saving comparison with old algorithm. Several experiments are conducted to demonstrate the effectiveness of the algorithm on several well known DBNs. We also test the algorithm on a real world DBN model that can recognize continuous digit numbers.

[1] Geoffrey Zweig,et al. The graphical models toolkit: An open source software system for speech and time-series processing , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2] David J. Spiegelhalter,et al. Probabilistic Networks and Expert Systems , 1999, Information Science and Statistics.

[3] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[4] Michael C. Horsch,et al. Dynamic Bayesian networks , 1990 .

[5] Adnan Darwiche,et al. Inference in belief networks: A procedural guide , 1996, Int. J. Approx. Reason..

[6] Stuart J. Russell,et al. Dynamic bayesian networks: representation, inference and learning , 2002 .

[7] Kevin Murphy,et al. Bayes net toolbox for Matlab , 1999 .

[8] Jeff A. Bilmes,et al. DBN based multi-stream models for speech , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9] A. P. Dawid,et al. Applications of a general propagation algorithm for probabilistic expert systems , 1992 .