论文信息 - An Information-Geometric Approach to Learning Bayesian Network Topologies from Data

An Information-Geometric Approach to Learning Bayesian Network Topologies from Data

This work provides a general overview of structure learning of Bayesian networks (BNs), and goes on to explore the feasibility of applying an information-geometric approach to the task of learning the topology of a BN from data. An information-geometric scoring function based on the Minimum Description Length Principle is described. The info-geometric score takes into account the effects of complexity due to both the number of parameters in the BN, and the geometry of the statistical manifold on which the parametric family of probability distributions of the BN is mapped. The paper provides an introduction to information geometry, and lays out a theoretical framework supported by empirical evidence that shows that this info-geometric scoring function is at least as efficient as applying BIC (Bayesian information criterion); and that, for certain BN topologies, it can drastically increase the accuracy in the selection of the best possible BN.

Eitel J. M. Lauría

[1] N. Metropolis,et al. Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[2] Bin Yu,et al. Model Selection and the Principle of Minimum Description Length , 2001 .

[3] G. Schwarz. Estimating the Dimension of a Model , 1978 .

[4] Carlos C. Rodriguez,et al. The Volume of Bitnets , 2004 .

[5] J. Rissanen. Stochastic Complexity and Modeling , 1986 .

[6] Nir Friedman,et al. Being Bayesian about Network Structure , 2000, UAI.

[7] Gregory F. Cooper,et al. A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[8] J. Rissanen,et al. Modeling By Shortest Data Description* , 1978, Autom..

[9] Carlos C. Rodriguez,et al. The Metrics Induced by the Kullback Number , 1989 .

[10] Gregory J. Chaitin,et al. A recent technical report , 1974, SIGA.

[11] Shun-ichi Amari,et al. Methods of information geometry , 2000 .