A mixed breadth-depth first strategy for the branch and bound tree of Euclidean k-center problems

The k-center problem arises in many applications such as facility location and data clustering. Typically, it is solved using a branch and bound tree traversed using the depth first strategy. The reason is its linear space requirement compared to the exponential space requirement of the breadth first strategy. Although the depth first strategy gains useful information fast by reaching some leaves early and therefore assists in pruning the tree, it may lead to exploring too many subtrees before reaching the optimal solution, resulting in a large search cost. To speed up the arrival to the optimal solution, a mixed breadth-depth traversing strategy is proposed. The main idea is to cycle through the nodes of the same level and recursively explore along their first promising paths until reaching their leaf nodes (solutions). Thus many solutions with diverse structures are obtained and a good upper bound of the optimal solution can be achieved by selecting the minimum among them. In addition, we employ inexpensive lower and upper bounds of the enclosing balls, and this often relieves us from calling the computationally expensive exact minimum enclosing ball algorithm. Experimental work shows that the proposed strategy is significantly faster than the naked branch and bound approach, especially as the number of centers and/or the required accuracy increases.

[1]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[2]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[3]  Joseph S. B. Mitchell,et al.  Comuting Core-Sets and Approximate Smallest Enclosing HyperSpheres in High Dimensions , 2003, ALENEX.

[4]  Clustering and reconstructing large data sets , 2004 .

[5]  Tinku Acharya,et al.  Image Processing: Principles and Applications , 2005, J. Electronic Imaging.

[6]  Sanjay Mehrotra,et al.  On the Implementation of a Primal-Dual Interior Point Method , 1992, SIAM J. Optim..

[7]  Teofilo F. GONZALEZ,et al.  Clustering to Minimize the Maximum Intercluster Distance , 1985, Theor. Comput. Sci..

[8]  Kenneth L. Clarkson,et al.  Smaller core-sets for balls , 2003, SODA '03.

[9]  David Peleg,et al.  Polynomial time approximation schemes for base station coverage with minimum total radii , 2005, Comput. Networks.

[10]  N. Megiddo Linear-time algorithms for linear programming in R3 and related problems , 1982, FOCS 1982.

[11]  René Brandenberg,et al.  New algorithms for k-center and extensions , 2009, J. Comb. Optim..

[12]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[13]  Reza Zanjirani Farahani,et al.  Facility location: concepts, models, algorithms and case studies , 2009 .

[14]  Nimrod Megiddo,et al.  Linear-time algorithms for linear programming in R3 and related problems , 1982, 23rd Annual Symposium on Foundations of Computer Science (sfcs 1982).

[15]  Vittorio Bilò,et al.  Geometric Clustering to Minimize the Sum of Cluster Sizes , 2005, ESA.

[16]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[17]  Robert J. Fowler,et al.  Optimal Packing and Covering in the Plane are NP-Complete , 1981, Inf. Process. Lett..

[18]  Pankaj Kumar,et al.  Almost Optimal Solutions to k-Clustering Problems , 2010, Int. J. Comput. Geom. Appl..

[19]  David G. Stork,et al.  Pattern Classification , 1973 .

[20]  Gerhard Reinelt,et al.  TSPLIB - A Traveling Salesman Problem Library , 1991, INFORMS J. Comput..

[21]  Nimrod Megiddo,et al.  On the Complexity of Some Common Geometric Location Problems , 1984, SIAM J. Comput..

[22]  Piotr Indyk,et al.  Approximate clustering via core-sets , 2002, STOC '02.

[23]  Joseph S. B. Mitchell,et al.  Approximate minimum enclosing balls in high dimensions using core-sets , 2003, ACM J. Exp. Algorithmics.

[24]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[25]  Gerald Kowalski,et al.  Information Retrieval Architecture and Algorithms , 2010 .

[26]  Pankaj K. Agarwal,et al.  Exact and Approximation Algortihms for Clustering , 1997 .

[27]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .