论文信息 - Dimension Correction for Hierarchical Latent Class Models

Dimension Correction for Hierarchical Latent Class Models

Model complexity is an important factor to consider when selecting among graphical models. When all variables are observed, the complexity of a model can be measured by its standard dimension, i.e. the number of independent parameters. When hidden variables are present, however, standard dimension might no longer be appropriate. One should instead use effective dimension (Geiger et al. 1996). This paper is concerned with the computation of effective dimension. First we present an upper bound on the effective dimension of a latent class (LC) model. This bound is tight and its computation is easy. We then consider a generalization of LC models called hierarchical latent class (HLC) models (Zhang 2002). We show that the effective dimension of an HLC model can be obtained from the effective dimensions of some related LC models. We also demonstrate empirically that using effective dimension in place of standard dimension improves the quality of models learned from data.

Tomas Kocka | Nevin Lianwen Zhang | N. Zhang | Tomas Kocka

[1] Gregory Piatetsky-Shapiro,et al. Advances in Knowledge Discovery and Data Mining , 2004, Lecture Notes in Computer Science.

[2] Jim Q. Smith,et al. On the Geometry of Bayesian Graphical Models with Hidden Variables , 1998, UAI.

[3] Peter C. Cheeseman,et al. Bayesian Classification (AutoClass): Theory and Results , 1996, Advances in Knowledge Discovery and Data Mining.

[4] David Maxwell Chickering,et al. Efficient Approximations for the Marginal Likelihood of Bayesian Networks with Hidden Variables , 1997, Machine Learning.

[5] G. Schwarz. Estimating the Dimension of a Model , 1978 .

[6] Gregory F. Cooper,et al. A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[7] Nevin Lianwen Zhang,et al. Hierarchical latent class models for cluster analysis , 2002, J. Mach. Learn. Res..

[8] David Heckerman,et al. Asymptotic Model Selection for Directed Networks with Hidden Variables , 1996, UAI.

[9] Jim Q. Smith,et al. Geometry, moments and Bayesian networks with hidden variables , 1999, AISTATS.