论文信息 - Neural Learning of One-of-Many Solutions for Combinatorial Problems in Structured Output Spaces

Neural Learning of One-of-Many Solutions for Combinatorial Problems in Structured Output Spaces

Recent research has proposed neural architectures for solving combinatorial problems in structured output spaces. In many such problems, there may exist multiple solutions for a given input, e.g. a partially filled Sudoku puzzle may have many completions satisfying all constraints. Further, we are often interested in finding {\em any one} of the possible solutions, without any preference between them. Existing approaches completely ignore this solution multiplicity. In this paper, we argue that being oblivious to the presence of multiple solutions can severely hamper their training ability. Our contribution is two fold. First, we formally define the task of learning one-of-many solutions for combinatorial problems in structured output spaces, which is applicable for solving several problems of interest such as N-Queens, and Sudoku. Second, we present a generic learning framework that adapts an existing prediction network for a combinatorial problem to handle solution multiplicity. Our framework uses a selection module, whose goal is to dynamically determine, for every input, the solution that is most effective for training the network parameters in any given learning iteration. We propose an RL based approach to jointly train the selection module with the prediction network. Experiments on three different domains, and using two different prediction networks, demonstrate that our framework significantly improves the accuracy in our setting, obtaining up to $21$ pt gain over the baselines.

[1] Xin Geng,et al. Partial Label Learning via Label Enhancement , 2019, AAAI.

[2] Dumitru Erhan,et al. Show and Tell: Lessons Learned from the 2015 MSCOCO Image Captioning Challenge , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Ole Winther,et al. Recurrent Relational Networks , 2017, NeurIPS.

[4] Bo An,et al. Partial Label Learning with Self-Guided Retraining , 2019, AAAI.

[5] Sharad Malik,et al. Zchaff2004: An Efficient SAT Solver , 2004, SAT (Selected Papers.

[6] Bowen Zhou,et al. SummaRuNNer: A Recurrent Neural Network Based Sequence Model for Extractive Summarization of Documents , 2016, AAAI.

[7] Edward Grefenstette,et al. Differentiable Reasoning on Large Knowledge Bases and Natural Language , 2019, Knowledge Graphs for eXplainable Artificial Intelligence.

[8] Jiebo Luo,et al. Image Captioning with Semantic Attention , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Matthew J. Hausknecht,et al. Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis , 2018, ICLR.

[10] Grigorios Tsoumakas,et al. Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[11] Yann LeCun,et al. Prediction Under Uncertainty with Error-Encoding Networks , 2017, ArXiv.

[12] Pushmeet Kohli,et al. RobustFill: Neural Program Learning under Noisy I/O , 2017, ICML.

[13] Razvan Pascanu,et al. Relational recurrent neural networks , 2018, NeurIPS.

[14] Rob Fergus,et al. Stochastic Video Generation with a Learned Prior , 2018, ICML.

[15] William Yang Wang,et al. Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning , 2018, ACL.

[16] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[17] Richard Socher,et al. A Deep Reinforced Model for Abstractive Summarization , 2017, ICLR.

[18] Rong Jin,et al. Learning with Multiple Labels , 2002, NIPS.

[19] Ben Taskar,et al. Learning from Partial Labels , 2011, J. Mach. Learn. Res..

[20] Sameer Singh,et al. Injecting Logical Background Knowledge into Embeddings for Relation Extraction , 2015, NAACL.

[21] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[22] Richard Evans,et al. Learning Explanatory Rules from Noisy Data , 2017, J. Artif. Intell. Res..

[23] Alessandro Rudi,et al. Structured Prediction with Partial Labelling through the Infimum Loss , 2020, ICML.

[24] Chong Wang,et al. Neural Logic Machines , 2019, ICLR.

[25] Li Zhao,et al. Reinforcement Learning for Relation Classification From Noisy Data , 2018, AAAI.

[26] Priya L. Donti,et al. SATNet: Bridging deep learning and logical reasoning using a differentiable satisfiability solver , 2019, ICML.

[27] Gary McGuire,et al. There Is No 16-Clue Sudoku: Solving the Sudoku Minimum Number of Clues Problem via Hitting Set Enumeration , 2012, Exp. Math..

[28] Bart Selman,et al. Local search strategies for satisfiability testing , 1993, Cliques, Coloring, and Satisfiability.