Injecting domain knowledge in neural networks: a controlled experiment on a constrained problem

Given enough data, Deep Neural Networks (DNNs) are capable of learning complex input-output relations with high accuracy. In several domains, however, data is scarce or expensive to retrieve, while a substantial amount of expert knowledge is available. It seems reasonable that if we can inject this additional information in the DNN, we could ease the learning process. One such case is that of Constraint Problems, for which declarative approaches exists and pure ML solutions have obtained mixed success. Using a classical constrained problem as a case study, we perform controlled experiments to probe the impact of progressively adding domain and empirical knowledge in the DNN. Our results are very encouraging, showing that (at least in our setup) embedding domain knowledge at training time can have a considerable effect and that a small amount of empirical knowledge is sufficient to obtain practically useful results.

[1]  Richard Evans,et al.  Learning Explanatory Rules from Noisy Data , 2017, J. Artif. Intell. Res..

[2]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[3]  Luc De Raedt,et al.  DeepProbLog: Neural Probabilistic Logic Programming , 2018, BNAIC/BENELEARN.

[4]  Mark D. Johnston,et al.  A discrete stochastic neural network algorithm for constraint satisfaction problems , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[5]  Tim Rocktäschel,et al.  End-to-end Differentiable Proving , 2017, NIPS.

[6]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[7]  Velibor V. Misic,et al.  Optimization of Tree Ensembles , 2017, Oper. Res..

[8]  Matteo Fischetti,et al.  Deep Neural Networks as 0-1 Mixed Integer Linear Programs: A Feasibility Study , 2017, ArXiv.

[9]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[10]  Marco Gori,et al.  Integrating Learning and Reasoning with Deep Logic Models , 2019, ECML/PKDD.

[11]  Marco Gori,et al.  Semantic-based regularization for learning and inference , 2017, Artif. Intell..

[12]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[13]  Paola Mello,et al.  Model Agnostic Solution of CSPs via Deep Learning: A Preliminary Study , 2018, CPAIOR.

[14]  Velibor V. Mivsi'c,et al.  Optimization of Tree Ensembles , 2017, Oper. Res..

[15]  Artur S. d'Avila Garcez,et al.  Logic Tensor Networks: Deep Learning and Logical Reasoning from Data and Knowledge , 2016, NeSy@HLAI.

[16]  Samy Bengio,et al.  Neural Combinatorial Optimization with Reinforcement Learning , 2016, ICLR.

[17]  Toby Walsh,et al.  Handbook of Constraint Programming , 2006, Handbook of Constraint Programming.

[18]  Frank van Harmelen,et al.  Semi-supervised Learning using Differentiable Reasoning , 2019, FLAP.

[19]  Abderrahim El Qadi,et al.  Scheduling meeting solved by neural network and Min-conflict heuristic , 2016, 2016 4th IEEE International Colloquium on Information Science and Technology (CiSt).

[20]  Barry O'Sullivan,et al.  Constraint acquisition , 2017, Artif. Intell..

[21]  Eduard H. Hovy,et al.  End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[22]  Andrea Bartolini,et al.  Empirical decision model learning , 2017, Artif. Intell..

[23]  Yoshua Bengio,et al.  Machine Learning for Combinatorial Optimization: a Methodological Tour d'Horizon , 2018, Eur. J. Oper. Res..

[24]  Yingqian Zhang,et al.  Auction optimization using regression trees and linear models as integer programs , 2017, Artif. Intell..

[25]  Guosheng Lin,et al.  Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Wouter Kool,et al.  Attention Solves Your TSP, Approximately , 2018 .