论文信息 - Learning Probabilistic Residual Finite State Automata

Learning Probabilistic Residual Finite State Automata

We introduce a new class of probabilistic automata: Probabilistic Residual Finite State Automata. We show that this class can be characterized by a simple intrinsic property of the stochastic languages they generate (the set of residual languages is finitely generated by residuals) and that it admits canonical minimal forms. We prove that there are more languages generated by PRFA than by Probabilistic Deterministic Finite Automata (PDFA). We present a first inference algorithm using this representation and we show that stochastic languages represented by PRFA can be identified from a characteristic sample if words are provided with their probabilities of appearance in the target language.

[1] Andreas Stolcke,et al. Inducing Probabilistic Grammars by Bayesian Model Merging , 1994, ICGI.

[2] Aurélien Lemay,et al. Learning Regular Languages Using Non Deterministic Finite Automata , 2000, ICGI.

[3] Aurélien Lemay,et al. Learning regular languages using RFSAs , 2004, Theor. Comput. Sci..

[4] J. Oncina,et al. INFERRING REGULAR LANGUAGES IN POLYNOMIAL UPDATED TIME , 1992 .

[5] José Oncina,et al. Learning Stochastic Regular Grammars by Means of a State Merging Method , 1994, ICGI.

[6] Aurélien Lemay,et al. Residual Finite State Automata , 2002, Fundam. Informaticae.

[7] Colin de la Higuera,et al. Probabilistic DFA Inference using Kullback-Leibler Divergence and Minimality , 2000, ICML.

[8] José Oncina,et al. Learning deterministic regular grammars from stochastic samples in polynomial time , 1999, RAIRO Theor. Informatics Appl..

[9] Barak A. Pearlmutter,et al. Results of the Abbadingo One DFA Learning Competition and a New Evidence-Driven State Merging Algorithm , 1998, ICGI.

[10] Aurélien Lemay,et al. Learning Regular Languages Using RFSA , 2001, ALT.