论文信息 - Clustering Markov States into Equivalence Classes using SVD and Heuristic Search Algorithms

Clustering Markov States into Equivalence Classes using SVD and Heuristic Search Algorithms

This paper investigates the problem of finding a K-state first-order Markov chain that approximates an M -state first-order Markov chain, where K is typically much smaller than M . A variety of greedy heuristic search algorithms that maximize the data likelihood are investigated and found to work well empirically. The proposed algorithms are demonstrated on two applications: learning user models from traces of Unix commands, and word segmentation in language modeling.

Sridevi Parise | Padhraic Smyth | Xianping Ge

[1] Andrew V. Goldberg,et al. An efficient cost scaling algorithm for the assignment problem , 1995, Math. Program..

[2] J. Ponte. USe: A Retargetable Word Segmentation Procedure for Information Retrieval , 1996 .

[3] Pietro Perona,et al. A Factorization Approach to Grouping , 1998, ECCV.

[4] Jianbo Shi,et al. A Random Walks View of Spectral Segmentation , 2001, AISTATS.

[5] J. Besag. On the Statistical Analysis of Dirty Pictures , 1986 .

[6] Hao Li,et al. Regulatory Element Detection Using a Probabilistic Segmentation Model , 2000, ISMB.

[7] Yair Weiss,et al. Segmentation using eigenvectors: a unifying view , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[8] Jian Zhang,et al. On the use of words and n-grams for Chinese information retrieval , 2000, IRAL '00.

[9] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.

[10] Terran Lane,et al. Hidden Markov Models for Human/Computer Interface Modeling , 1999 .

[11] Hermann Ney,et al. Algorithms for bigram and trigram word clustering , 1995, Speech Commun..

[12] Hector J. Levesque,et al. A New Method for Solving Hard Satisfiability Problems , 1992, AAAI.