Finding near-optimal independent sets at scale

The maximum independent set problem is NP-hard and particularly difficult to solve in sparse graphs, which typically take exponential time to solve exactly using the best-known exact algorithms. In this paper, we present two new novel heuristic algorithms for computing large independent sets on huge sparse graphs, which are intractable in practice. First, we develop an advanced evolutionary algorithm that uses fast graph partitioning with local search algorithms to implement efficient combine operations that exchange whole blocks of given independent sets. Though the evolutionary algorithm itself is highly competitive with existing heuristic algorithms on large social networks, we further show that it can be effectively used as an oracle to guess vertices that are likely to be in large independent sets. We then show how to combine these guesses with kernelization techniques in a branch-and-reduce-like algorithm to compute high-quality independent sets quickly in huge complex networks. Our experiments against a recent (and fast) exact algorithm for large sparse graphs show that our technique always computes an optimal solution when the exact solution is known, and it further computes consistent results on even larger instances where the solution is unknown. Ultimately, we show that identifying and removing vertices likely to be in large independent sets opens up the reduction space—which not only speeds up the computation of large independent sets drastically, but also enables us to compute high-quality independent sets on much larger instances than previously reported in the literature.

[1]  Peter Sanders,et al.  Engineering Route Planning Algorithms , 2009, Algorithmics of Large and Complex Networks.

[2]  Panos M. Pardalos,et al.  Finding maximum independent sets in graphs arising from coding theory , 2002, SAC '02.

[3]  Peter Sanders,et al.  Finding Near-Optimal Independent Sets at Scale , 2016, ALENEX.

[4]  Robert E. Tarjan,et al.  Finding a Maximum Independent Set , 1976, SIAM J. Comput..

[5]  Shinya Takahashi,et al.  A Simple and Faster Branch-and-Bound Algorithm for Finding a Maximum Clique , 2010, WALCOM.

[6]  Vangelis Th. Paschos,et al.  Fast Algorithms for max independent set , 2010, Algorithmica.

[7]  Andrew V. Goldberg,et al.  The Shortest Path Problem , 2009 .

[8]  Federico Della Croce,et al.  Combining Swaps and Node Weights in an Adaptive Greedy Approach for the Maximum Clique Problem , 2004, J. Heuristics.

[9]  Wayne J. Pullan,et al.  Simple ingredients leading to very efficient heuristics for the maximum clique problem , 2008, J. Heuristics.

[10]  Fedor V. Fomin,et al.  Exact exponential algorithms , 2013, CACM.

[11]  Roberto Battiti,et al.  Reactive Local Search for the Maximum Clique Problem1 , 2001, Algorithmica.

[12]  Frank Harary,et al.  A Procedure for Clique Detection Using the Group Matrix , 1957 .

[13]  Martin Nöllenburg,et al.  Evaluation of Labeling Strategies for Rotating Maps , 2014, SEA.

[14]  Leslie E. Trotter,et al.  Vertex packings: Structural properties and algorithms , 1975, Math. Program..

[15]  Marco Rosa,et al.  Layered label propagation: a multiresolution coordinate-free ordering for compressing social networks , 2010, WWW.

[16]  Timothy A. Davis,et al.  The university of Florida sparse matrix collection , 2011, TOMS.

[17]  Pablo San Segundo,et al.  An exact bit-parallel algorithm for the maximum clique problem , 2011, Comput. Oper. Res..

[18]  Kengo Katayama,et al.  An effective local search for the maximum clique problem , 2005, Inf. Process. Lett..

[19]  David E. Goldberg,et al.  Genetic Algorithms, Tournament Selection, and the Effects of Noise , 1995, Complex Syst..

[20]  David E. Goldberg,et al.  Genetic Algorithms, Selection Schemes, and the Varying Effects of Noise , 1996, Evolutionary Computation.

[21]  Sergiy Butenko,et al.  Using critical sets to solve the maximum independent set problem , 2007, Oper. Res. Lett..

[22]  Pedro V. Sander,et al.  Efficient traversal of mesh edges using adjacency primitives , 2008, SIGGRAPH 2008.

[23]  Sebastiano Vigna,et al.  The webgraph framework I: compression techniques , 2004, WWW '04.

[24]  Peter Sanders,et al.  Graph Partitioning for Independent Sets , 2015, SEA.

[25]  Mauricio G. C. Resende,et al.  A Greedy Randomized Adaptive Search Procedure for Maximum Independent Set , 1994, Oper. Res..

[26]  Wayne J. Pullan,et al.  Dynamic Local Search for the Maximum Clique Problem , 2011, J. Artif. Intell. Res..

[27]  Thomas Bäck,et al.  An evolutionary heuristic for the maximum independent set problem , 1994, Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence.

[28]  Pierre Hansen,et al.  Variable neighborhood search for the maximum clique , 2001, Discret. Appl. Math..

[29]  David A. Bader,et al.  Benchmarking for Graph Clustering and Partitioning , 2014, Encyclopedia of Social Network Analysis and Mining.

[30]  Hiroshi Nagamochi,et al.  Confining sets and avoiding bottleneck cases: A simple maximum independent set algorithm in degree-3 graphs , 2013, Theor. Comput. Sci..

[31]  Kenneth de Jong,et al.  Evolutionary computation: a unified approach , 2007, GECCO.

[32]  Hiroshi Nagamochi,et al.  Exact Algorithms for Maximum Independent Set , 2013, ISAAC.

[33]  Chris Walshaw,et al.  A Combined Evolutionary Search and Multilevel Optimisation Approach to Graph-Partitioning , 2004, J. Glob. Optim..

[34]  Diogo Vieira Andrade,et al.  Fast local search for the maximum independent set problem , 2012, J. Heuristics.

[35]  Srinivasan Parthasarathy,et al.  New Algorithms for Fast Discovery of Association Rules , 1997, KDD.

[36]  Panos M. Pardalos,et al.  Improvements to MCS algorithm for the maximum clique problem , 2014, J. Comb. Optim..

[37]  Ke Xu,et al.  Combining MaxSAT Reasoning and Incremental Upper Bound for the Maximum Clique Problem , 2013, ICTAI.

[38]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[39]  Yu Liu,et al.  Towards Maximum Independent Sets on Massive Graphs , 2015, Proc. VLDB Endow..

[40]  Yoichi Iwata,et al.  Linear-Time FPT Algorithms via Network Flow , 2013, SODA.

[41]  Thomas Bäck,et al.  Evolutionary algorithms in theory and practice - evolution strategies, evolutionary programming, genetic algorithms , 1996 .

[42]  Pablo San Segundo,et al.  An improved bit parallel exact maximum clique algorithm , 2013, Optim. Lett..

[43]  Takuya Akiba,et al.  Branch-and-reduce exponential/FPT algorithms in practice: A case study of vertex cover , 2014, Theor. Comput. Sci..

[44]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[45]  Peter Sanders,et al.  Distributed Time-Dependent Contraction Hierarchies , 2010, SEA.

[46]  Qinghua Wu,et al.  A review on algorithms for maximum clique problems , 2015, Eur. J. Oper. Res..

[47]  Pavel A. Borisovsky,et al.  Experimental Comparison of Two Evolutionary Algorithms for the Independent Set Problem , 2003, EvoWorkshops.

[48]  Peter Willett,et al.  Graph-Theoretic Techniques for Macromolecular Docking , 2000, J. Chem. Inf. Comput. Sci..

[49]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[50]  室 章治郎 Michael R.Garey/David S.Johnson 著, "COMPUTERS AND INTRACTABILITY A guide to the Theory of NP-Completeness", FREEMAN, A5判変形判, 338+xii, \5,217, 1979 , 1980 .

[51]  Ashraf Aboulnaga,et al.  Scalable maximum clique computation using MapReduce , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).