Generalization of machine learning for problem reduction: a case study on travelling salesman problems

Combinatorial optimization plays an important role in real-world problem solving. In the big data era, the dimensionality of a combinatorial optimization problem is usually very large, which poses a significant challenge to existing solution methods. In this paper, we examine the generalization capability of a machine learning model for problem reduction on the classic travelling salesman problems (TSP). We demonstrate that our method can greedily remove decision variables from an optimization problem that are predicted not to be part of an optimal solution. More specifically, we investigate our model's capability to generalize on test instances that have not been seen during the training phase. We consider three scenarios where training and test instances are different in terms of: 1) problem characteristics; 2) problem sizes; and 3) problem types. Our experiments show that this machine learning based technique can generalize reasonably well over a wide range of TSP test instances with different characteristics or sizes. While the accuracy of predicting unused variables naturally deteriorates as a test instance is further away from the training set, we observe that even when tested on a different TSP problem variant, the machine learning model still makes useful predictions about which variables can be eliminated without significantly impacting solution quality.

[1]  A. Volgenant,et al.  Nonoptimal Edges for the Symmetric Traveling Salesman Problem , 1984, Oper. Res..

[2]  William J. Cook,et al.  The Traveling Salesman Problem: A Computational Study , 2007 .

[3]  Samy Bengio,et al.  Neural Combinatorial Optimization with Reinforcement Learning , 2016, ICLR.

[4]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[5]  Juho Lauri,et al.  Fine-grained Search Space Classification for Hard Enumeration Variants of Subset Problems , 2019, AAAI.

[6]  Sergiy Butenko,et al.  Clique Relaxations in Social Network Analysis: The Maximum k-Plex Problem , 2011, Oper. Res..

[7]  Andreas T. Ernst,et al.  Using Statistical Measures and Machine Learning for Graph Reduction to Solve Maximum Weight Clique Problems , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  David S. Johnson,et al.  The Traveling Salesman Problem: A Case Study in Local Optimization , 2008 .

[9]  Hanif D. Sherali,et al.  On Tightening the Relaxations of Miller-Tucker-Zemlin Formulations for Asymmetric Traveling Salesman Problems , 2002, Oper. Res..

[10]  He He,et al.  Learning to Search in Branch and Bound Algorithms , 2014, NIPS.

[11]  William J. Cook,et al.  Chained Lin-Kernighan for Large Traveling Salesman Problems , 2003, INFORMS Journal on Computing.

[12]  Keld Helsgaun,et al.  An effective implementation of the Lin-Kernighan traveling salesman heuristic , 2000, Eur. J. Oper. Res..

[13]  Yoshua Bengio,et al.  Machine Learning for Combinatorial Optimization: a Methodological Tour d'Horizon , 2018, Eur. J. Oper. Res..

[14]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[15]  Zhuwen Li,et al.  Combinatorial Optimization with Graph Convolutional Networks and Guided Tree Search , 2018, NeurIPS.

[16]  Qinghua Wu,et al.  A review on algorithms for maximum clique problems , 2015, Eur. J. Oper. Res..

[17]  Yuandong Tian,et al.  Learning to Perform Local Rewriting for Combinatorial Optimization , 2019, NeurIPS.

[18]  Chaitanya Swamy,et al.  Orienteering Algorithms for Generating Travel Itineraries , 2018, WSDM.

[19]  Jens Gottlieb,et al.  Evolutionary Computation in Combinatorial Optimization , 2006, Lecture Notes in Computer Science.

[20]  Toby Walsh,et al.  The Backbone of the Travelling Salesperson , 2005, IJCAI.

[21]  Jano I. van Hemert,et al.  Discovering the suitability of optimisation algorithms by learning from evolved instances , 2011, Annals of Mathematics and Artificial Intelligence.

[22]  Chih-Jen Lin,et al.  Trust Region Newton Method for Logistic Regression , 2008, J. Mach. Learn. Res..

[23]  Gerold Jäger,et al.  Effective Tour Searching for TSP by Contraction of Pseudo Backbone Edges , 2009, AAIM.

[24]  Andrew Lim,et al.  Learning Improvement Heuristics for Solving the Travelling Salesman Problem , 2019, ArXiv.

[25]  Brian W. Kernighan,et al.  An Effective Heuristic Algorithm for the Traveling-Salesman Problem , 1973, Oper. Res..

[26]  Peter Merz,et al.  Reducing the Size of Traveling Salesman Problem Instances by Fixing Edges , 2007, EvoCOP.

[27]  Andrew Lim,et al.  Learning Improvement Heuristics for Solving Routing Problems , 2019 .

[28]  Gerhard Reinelt,et al.  TSPLIB - A Traveling Salesman Problem Library , 1991, INFORMS J. Comput..

[29]  Gerold Jäger,et al.  A backbone based TSP heuristic for large instances , 2014, J. Heuristics.

[30]  R. Jonker,et al.  Transforming asymmetric into symmetric traveling salesman problems , 1983 .

[31]  Max Welling,et al.  Attention, Learn to Solve Routing Problems! , 2018, ICLR.

[32]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[33]  Rong Chen,et al.  An Exact Algorithm for Maximum k-Plexes in Massive Graphs , 2018, IJCAI.

[34]  Navdeep Jaitly,et al.  Pointer Networks , 2015, NIPS.

[35]  Juho Lauri,et al.  Learning Multi-Stage Sparsification for Maximum Clique Enumeration , 2019, ArXiv.

[36]  Le Song,et al.  2 Common Formulation for Greedy Algorithms on Graphs , 2018 .

[37]  Manuel López-Ibáñez,et al.  Construct , Merge , Solve & Adapt : A New General Algorithm For Combinatorial Optimization , 2015 .

[38]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[39]  Le Song,et al.  Accelerating Primal Solution Findings for Mixed Integer Programs Based on Solution Prediction , 2019, AAAI.

[40]  Stefan Hougardy,et al.  Edge Elimination in TSP Instances , 2014, WG.

[41]  Alexandre Lacoste,et al.  Learning Heuristics for the TSP by Policy Gradient , 2018, CPAIOR.

[42]  Chih-Jen Lin,et al.  Working Set Selection Using Second Order Information for Training Support Vector Machines , 2005, J. Mach. Learn. Res..