A Method for Integrating Expert Knowledge When Learning Bayesian Networks From Data

Automatic learning of Bayesian networks from data is a challenging task, particularly when the data are scarce and the problem domain contains a high number of random variables. The introduction of expert knowledge is recognized as an excellent solution for reducing the inherent uncertainty of the models retrieved by automatic learning methods. Previous approaches to this problem based on Bayesian statistics introduce the expert knowledge by the elicitation of informative prior probability distributions of the graph structures. In this paper, we present a new methodology for integrating expert knowledge, based on Monte Carlo simulations and which avoids the costly elicitation of these prior distributions and only requests from the expert information about those direct probabilistic relationships between variables which cannot be reliably discerned with the help of the data.

[1]  Satoru Miyano,et al.  Combining microarrays and biological knowledge for estimating gene networks via Bayesian networks , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[2]  Daphne Koller,et al.  Active Learning for Structure in Bayesian Networks , 2001, IJCAI.

[3]  Judea Pearl,et al.  Fusion, Propagation, and Structuring in Belief Networks , 1986, Artif. Intell..

[4]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[5]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[6]  Rui Chang,et al.  Modeling semantics of inconsistent qualitative knowledge for quantitative Bayesian network inference , 2008, Neural Networks.

[7]  Nir Friedman,et al.  Being Bayesian About Network Structure. A Bayesian Approach to Structure Discovery in Bayesian Networks , 2004, Machine Learning.

[8]  James G. Scott,et al.  An exploration of aspects of Bayesian multiple testing , 2006 .

[9]  Wray L. Buntine Theory Refinement on Bayesian Networks , 1991, UAI.

[10]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[11]  Richard E. Neapolitan,et al.  Learning Bayesian networks , 2007, KDD '07.

[12]  J. York,et al.  Bayesian Graphical Models for Discrete Data , 1995 .

[13]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2006, Nucleic Acids Res..

[14]  Luis Daniel Hernández Molinero Diseño y validación de nuevos algoritmos para el tratamiento de grafos de dependencias , 1995 .

[15]  James G. Scott,et al.  Objective Bayesian model selection in Gaussian graphical models , 2009 .

[16]  Blaz Zupan,et al.  Towards knowledge-based gene expression data mining , 2007, J. Biomed. Informatics.

[17]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[18]  Andrés R. Masegosa,et al.  An Importance Sampling Approach to Integrate Expert Knowledge When Learning Bayesian Networks From Data , 2010, IPMU.

[19]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[20]  Lyle H. Ungar,et al.  Using prior knowledge to improve genetic network reconstruction from microarray data , 2004, Silico Biol..

[21]  Kevin Murphy,et al.  Active Learning of Causal Bayes Net Structure , 2006 .

[22]  Luis M. de Campos,et al.  Bayesian network learning algorithms using structural restrictions , 2007, Int. J. Approx. Reason..

[23]  Rui Chang,et al.  Quantitative Inference by Qualitative Semantic Knowledge Mining with Bayesian Model Averaging , 2008, IEEE Transactions on Knowledge and Data Engineering.