Maximum mutual information and conditional maximum likelihood estimation of stochastic regular syntax-directed translation schemes

Formal translations have become of great interest for modeling some Pattern Recognition problems, but they require a stochastic extension in order to deal with noisy and distorted patterns. A Maximum Likelihood estimation has been recently developed for learning the statistical parameters of Stochastic Regular Syntax-Directed Translation Schemes. The goal of this paper is the study of estimation criteria in order to take into account the problem of sparse training data. In particular, these are the Maximum Mutual Information criterion and the Conditional Maximum Likelihood criterion. Some experimental results are reported to compare the three criteria.

[1]  Enrique Vidal,et al.  Learning Subsequential Transducers for Pattern Recognition Interpretation Tasks , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[3]  Dimitri Kanevsky,et al.  An inequality for rational functions with applications to some statistical estimation problems , 1991, IEEE Trans. Inf. Theory.

[4]  Francisco Casacuberta Growth Transformations for Probability Functions of Stochastic Grammars , 1996, Int. J. Pattern Recognit. Artif. Intell..

[5]  Michael Picheny,et al.  On a model-robust training method for speech recognition , 1988, IEEE Trans. Acoust. Speech Signal Process..

[6]  Francisco Casacuberta,et al.  Grammatical Inference and Automatic Speech Recognition , 1995 .

[7]  Lalit R. Bahl,et al.  A Maximum Likelihood Approach to Continuous Speech Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  King-Sun Fu,et al.  Syntactic Pattern Recognition And Applications , 1968 .

[9]  Emden R. Gansner,et al.  A Technique for Drawing Directed Graphs , 1993, IEEE Trans. Software Eng..

[10]  Peter F. Brown,et al.  The acoustic-modeling problem in automatic speech recognition , 1987 .

[11]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[12]  Robert B. Ash,et al.  Information Theory , 2020, The SAGE International Encyclopedia of Mass Media and Society.

[13]  Michael G. Thomason,et al.  Syntactic Pattern Recognition, An Introduction , 1978, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  I. H. Öğüş,et al.  NATO ASI Series , 1997 .