A similarity between probabilistic tree languages: application to XML document families
暂无分享,去创建一个
We describe a general approach to compute a similarity measure between distributions generated by probabilistic tree automata that may be used in a number of applications in the pattern recognition field. In particular, we show how this similarity can be computed for families of structured (XML) documents can be computed. In such case, the use of regular expressions to specify the right part of the expansion rules adds some complexity to the task.
[1] Derick Wood,et al. One-Unambiguous Regular Languages , 1998, Inf. Comput..
[2] Christian N. S. Pedersen,et al. Metrics and Similarity Measures for Hidden Markov Models , 1999, ISMB.
[3] Jorge Calera-Rubio,et al. Computing the Relative Entropy Between Regular Tree Languages , 1998, Inf. Process. Lett..
[4] Rafael C. Carrasco. Accurate Computation of the Relative Entropy Between Stochastic Regular Grammars , 1997, RAIRO Theor. Informatics Appl..