论文信息 - Empirical Risk Minimization with Approximations of Probabilistic Grammars

Empirical Risk Minimization with Approximations of Probabilistic Grammars

Probabilistic grammars are generative statistical models that are useful for compositional and sequential structures. We present a framework, reminiscent of structural risk minimization, for empirical risk minimization of the parameters of a fixed probabilistic grammar using the log-loss. We derive sample complexity bounds in this framework that apply both to the supervised setting and the un-supervised setting.

Noah A. Smith | Shay B. Cohen

[1] Dan Klein,et al. Corpus-Based Induction of Syntactic Structure: Models of Dependency and Constituency , 2004, ACL.

[2] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[3] Ben Taskar,et al. Structured Prediction Cascades , 2010, AISTATS.

[4] Naoki Abe,et al. On the computational complexity of approximating distributions by probabilistic automata , 1990, Machine Learning.

[5] David Haussler,et al. Decision Theoretic Generalizations of the PAC Model for Neural Net and Other Learning Applications , 1992, Inf. Comput..

[6] Jason Eisner,et al. Three New Probabilistic Models for Dependency Parsing: An Exploration , 1996, COLING.

[7] Noah A. Smith,et al. Viterbi Training for PCFGs: Hardness Results and Competitiveness of Uniform Initialization , 2010, ACL.

[8] Zhiyi Chi,et al. Statistical Properties of Probabilistic Context-Free Grammars , 1999, CL.

[9] A. Tsybakov,et al. Optimal aggregation of classifiers in statistical learning , 2003 .

[10] Jake Porway,et al. A stochastic graph grammar for compositional object representation and recognition , 2009, Pattern Recognit..

[11] Naoki Abe,et al. Polynomial learnability of probabilistic concepts with respect to the Kullback-Leibler divergence , 1991, COLT '91.