Learning Bayesian Networks Under Sparsity Constraints: A Parameterized Complexity Analysis

We study the problem of learning the structure of an optimal Bayesian network $D$ when additional constraints are posed on the DAG $D$ or on its moralized graph. More precisely, we consider the constraint that the moralized graph can be transformed to a graph from a sparse graph class $\Pi$ by at most $k$ vertex deletions. We show that for $\Pi$ being the graphs with maximum degree $1$, an optimal network can be computed in polynomial time when $k$ is constant, extending previous work that gave an algorithm with such a running time for $\Pi$ being the class of edgeless graphs [Korhonen & Parviainen, NIPS 2015]. We then show that further extensions or improvements are presumably impossible. For example, we show that when $\Pi$ is the set of graphs with maximum degree $2$ or when $\Pi$ is the set of graphs in which each component has size at most three, then learning an optimal network is NP-hard even if $k=0$. Finally, we show that learning an optimal network with at most $k$ edges in the moralized graph presumably has no $f(k)\cdot |I|^{\mathcal{O}(1)}$-time algorithm and that, in contrast, an optimal network with at most $k$ arcs in the DAG $D$ can be computed in $2^{\mathcal{O}(k)}\cdot |I|^{\mathcal{O}(1)}$ time where $|I|$ is the total input size.

[1]  Stefan Szeider,et al.  Parameterized Complexity Results for Exact Bayesian Network Structure Learning , 2014, J. Artif. Intell. Res..

[2]  Mikko Koivisto,et al.  Exact Structure Discovery in Bayesian Networks with Less Space , 2009, UAI.

[3]  David Maxwell Chickering,et al.  Learning Bayesian Networks is , 1994 .

[4]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[5]  S. Miyano,et al.  Finding optimal gene networks using biological constraints. , 2003, Genome informatics. International Conference on Genome Informatics.

[6]  Stephen Gould,et al.  Learning Bounded Treewidth Bayesian Networks , 2008, NIPS.

[7]  Shimon Even,et al.  An O (N2.5) algorithm for maximum matching in general graphs , 1975, 16th Annual Symposium on Foundations of Computer Science (sfcs 1975).

[8]  Adnan Darwiche,et al.  Modeling and Reasoning with Bayesian Networks , 2009 .

[9]  Michael R. Fellows,et al.  On the parameterized complexity of multiple-interval graph problems , 2009, Theor. Comput. Sci..

[10]  Christian Komusiewicz,et al.  On the Relation of Strong Triadic Closure and Cluster Deletion , 2018, Algorithmica.

[11]  Gregory F. Cooper,et al.  The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..

[12]  Eric Horvitz,et al.  Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (1996) , 2013, ArXiv.

[13]  Anil K. Kesarwani,et al.  Genome Informatics , 2019, Encyclopedia of Bioinformatics and Computational Biology.

[14]  Tomi Silander,et al.  A Simple Approach for Finding the Globally Optimal Bayesian Network Structure , 2006, UAI.

[15]  Janne H. Korhonen,et al.  Exact Learning of Bounded Tree-width Bayesian Networks , 2013, AISTATS.

[16]  R. Downey,et al.  Parameterized Computational Feasibility , 1995 .

[17]  Janne H. Korhonen,et al.  Tractable Bayesian Network Structure Learning with Bounded Vertex Cover Number , 2015, NIPS.

[18]  Michal Pilipczuk,et al.  Parameterized Algorithms , 2015, Springer International Publishing.

[19]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[20]  Christopher Meek,et al.  Finding a path is harder than finding a tree , 2001, AISTATS.

[21]  Christian Komusiewicz,et al.  On the Relation of Strong Triadic Closure and Cluster Deletion , 2019, Algorithmica.