A continuous facility location problem and its application to a clustering problem

We consider a new problem, which we denote by Continuous Facility Location (ConFL), and its application to the k-Means Problem. Problem ConFL is a natural extension of the Uncapacitated Facility Location Problem where a facility can be any point in Rq. The proposed algorithms are based on a primal-dual technique for spaces with constant dimensions. For the ConFL Problem, we present algorithms with approximation factors 3 + ε and 1.861 + ε for euclidean distances and 9 + ε for squared euclidean distances. For the k-Means Problem (that is restricted to squared euclidean distance), we present an algorithm with approximation factor 54+ ε. All algorithms have good practical behaviour in small dimensions. Comparisons with known algorithms show that the proposed algorithms have good practical behaviour.

[1]  Alan M. Frieze,et al.  Clustering Large Graphs via the Singular Value Decomposition , 2004, Machine Learning.

[2]  Amit Kumar,et al.  A simple linear time (1 + /spl epsiv/)-approximation algorithm for k-means clustering in any dimensions , 2004, 45th Annual IEEE Symposium on Foundations of Computer Science.

[3]  Dorit S. Hochbaum,et al.  Heuristics for the fixed cost median problem , 1982, Math. Program..

[4]  Éva Tardos,et al.  Approximation algorithms for facility location problems (extended abstract) , 1997, STOC '97.

[5]  Samir Khuller,et al.  Greedy strikes back: improved facility location algorithms , 1998, SODA '98.

[6]  Vijay V. Vazirani,et al.  Approximation algorithms for metric facility location and k-Median problems using the primal-dual schema and Lagrangian relaxation , 2001, JACM.

[7]  Qiang Du,et al.  Centroidal Voronoi Tessellations: Applications and Algorithms , 1999, SIAM Rev..

[8]  Yuying Li,et al.  A Newton Acceleration of the Weiszfeld Algorithm for Minimizing the Sum of Euclidean Distances , 1995, Comput. Optim. Appl..

[9]  Jiawei Zhang,et al.  Approximation algorithms for facility location problems , 2004 .

[10]  M. Shirosaki Another proof of the defect relation for moving targets , 1991 .

[11]  David B. Shmoys,et al.  Approximation algorithms for facility location problems , 2000, APPROX.

[12]  Yinyu Ye,et al.  An Efficient Algorithm for Minimizing a Sum of p-Norms , 1999, SIAM J. Optim..

[13]  David M. Mount,et al.  A local search approximation algorithm for k-means clustering , 2002, SCG '02.

[14]  Satish Rao,et al.  Approximation schemes for Euclidean k-medians and related problems , 1998, STOC '98.

[15]  Amin Saberi,et al.  A new greedy approach for facility location problems , 2002, STOC '02.

[16]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[17]  Mohammad Mahdian,et al.  Approximation Algorithms for Metric Facility Location Problems , 2006, SIAM J. Comput..

[18]  Evangelos Markakis,et al.  Greedy facility location algorithms analyzed using dual fitting with factor-revealing LP , 2002, JACM.

[19]  Jirí Matousek,et al.  On Approximate Geometric k -Clustering , 2000, Discret. Comput. Geom..

[20]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..