论文信息 - Learning Bayesian Networks from Ordinal Data

Learning Bayesian Networks from Ordinal Data

Bayesian networks are a powerful framework for studying the dependency structure of variables in a complex system. The problem of learning Bayesian networks is tightly associated with the given data type. Ordinal data, such as stages of cancer, rating scale survey questions, and letter grades for exams, are ubiquitous in applied research. However, existing solutions are mainly for continuous and nominal data. In this work, we propose an iterative score-and-search method - called the Ordinal Structural EM (OSEM) algorithm - for learning Bayesian networks from ordinal data. Unlike traditional approaches designed for nominal data, we explicitly respect the ordering amongst the categories. More precisely, we assume that the ordinal variables originate from marginally discretizing a set of Gaussian variables, whose structural dependence in the latent space follows a directed acyclic graph. Then, we adopt the Structural EM algorithm and derive closed-form scoring functions for efficient graph searching. Through simulation studies, we illustrate the superior performance of the OSEM algorithm compared to the alternatives and analyze various factors that may influence the learning accuracy. Finally, we demonstrate the practicality of our method with a real-world application on psychological survey data from 408 patients with co-morbid symptoms of obsessive-compulsive disorder and depression.

Giusi Moffa | Jack Kuipers | Xiang Ge Luo

[1] D. Madigan,et al. Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window , 1994 .

[2] Ralf Eggeling,et al. Learning Bayesian networks with local structure, mixed variables, and exact algorithms , 2019, Int. J. Approx. Reason..

[3] Geert Molenberghs,et al. A pairwise likelihood approach to estimation in multilevel probit models , 2004, Comput. Stat. Data Anal..

[4] David Heckerman,et al. Learning Bayesian Networks: A Unification for Discrete and Gaussian Domains , 1995, UAI.

[5] Yang Liu,et al. Large-scale empirical validation of Bayesian Network structure learning algorithms with noisy data , 2020, Int. J. Approx. Reason..

[6] David Heckerman,et al. Parameter Priors for Directed Acyclic Graphical Models and the Characteriration of Several Probability Distributions , 1999, UAI.

[7] David Maxwell Chickering,et al. Large-Sample Learning of Bayesian Networks is NP-Hard , 2002, J. Mach. Learn. Res..

[8] J. Markowitz,et al. The 16-Item quick inventory of depressive symptomatology (QIDS), clinician rating (QIDS-C), and self-report (QIDS-SR): a psychometric evaluation in patients with chronic major depression , 2003, Biological Psychiatry.

[9] Judea Pearl,et al. Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[10] Xiao-Li Meng,et al. Maximum likelihood estimation via the ECM algorithm: A general framework , 1993 .

[11] Tom Heskes,et al. Copula PC Algorithm for Causal Discovery from Mixed Data , 2016, ECML/PKDD.