Clustering in Hypergraphs to Minimize Average Edge Service Time

We study the problem of clustering the vertices of a weighted hypergraph such that on average the vertices of each edge can be covered by a small number of clusters. This problem has many applications such as for designing medical tests, clustering files on disk servers, and placing network services on servers. The edges of the hypergraph model groups of items that are likely to be needed together, and the optimization criteria which we use can be interpreted as the average delay (or cost) to serve the items of a typical edge. We describe and analyze algorithms for this problem for the case in which the clusters have to be disjoint and for the case where clusters can overlap. The analysis is often subtle and reveals interesting structure and invariants that one can utilize. 1998 ACM Subject Classification F.2.2 Nonnumerical Algorithms and Problems

[1]  R. Bru,et al.  C ´ ALCULO DEL VECTOR PAGERANK DE GOOGLE MEDIANTE EL M ´ ETODO ADITIVO DE SCHWARZ , 2005 .

[2]  Daniel B. Szyld,et al.  Additive Schwarz Iterations for Markov Chains , 2005, SIAM J. Matrix Anal. Appl..

[3]  Daniel B. Szyld,et al.  Weighted max norms, splittings, and overlapping additive Schwarz iterations , 1999, Numerische Mathematik.

[4]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[5]  Nikhil Bansal,et al.  Correlation Clustering , 2002, The 43rd Annual IEEE Symposium on Foundations of Computer Science, 2002. Proceedings..

[6]  David F. Gleich,et al.  Algorithms and Models for the Web Graph , 2014, Lecture Notes in Computer Science.

[7]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[8]  Marcello Pelillo,et al.  A Game-Theoretic Approach to Hypergraph Clustering , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  David S. Johnson,et al.  Approximation algorithms for combinatorial problems , 1973, STOC.

[10]  Vasek Chvátal,et al.  A Greedy Heuristic for the Set-Covering Problem , 1979, Math. Oper. Res..

[11]  Cristian Sminchisescu,et al.  Efficient Hypergraph Clustering , 2012, AISTATS.

[12]  Vahab Mirrokni,et al.  Overlapping clusters for distributed computation , 2012, WSDM '12.

[13]  Joseph Naor,et al.  Near optimal placement of virtual network functions , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[14]  Tamir Hazan,et al.  Multi-way Clustering Using Super-Symmetric Non-negative Tensor Factorization , 2006, ECCV.

[15]  Ran Duan A Simpler Scaling Algorithm for Weighted Matching in General Graphs , 2014, ArXiv.

[16]  J. Edmonds Paths, Trees, and Flowers , 1965, Canadian Journal of Mathematics.

[17]  Luca Trevisan,et al.  Non-approximability results for optimization problems on bounded degree instances , 2001, STOC '01.

[18]  Pietro Perona,et al.  Beyond pairwise clustering , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[19]  Harold N. Gabow,et al.  Data structures for weighted matching and nearest common ancestors with linking , 1990, SODA '90.

[20]  Isaac Keslassy,et al.  Minimizing Delay in Network Function Virtualization with Shared Pipelines , 2017, IEEE Transactions on Parallel and Distributed Systems.

[21]  László Lovász,et al.  On the ratio of optimal integral and fractional covers , 1975, Discret. Math..

[22]  J. Dunn Well-Separated Clusters and Optimal Fuzzy Partitions , 1974 .

[23]  D. Spielman,et al.  Spectral partitioning works: planar graphs and finite element meshes , 1996, Proceedings of 37th Conference on Foundations of Computer Science.

[24]  Bernhard Schölkopf,et al.  Learning with Hypergraphs: Clustering, Classification, and Embedding , 2006, NIPS.

[25]  A. M. Api,et al.  Allergic contact dermatitis: epidemiology, molecular mechanisms, in vitro methods and regulatory aspects , 2011, Cellular and Molecular Life Sciences.

[26]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Ameer Ahmed Abbasi,et al.  A survey on clustering algorithms for wireless sensor networks , 2007, Comput. Commun..

[28]  A. J. Cole,et al.  An Improved Algorithm for the Jardine-Sibson Method of Generating Overlapping Clusters , 1970, Computer/law journal.

[29]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[30]  Vladimir Estivill-Castro,et al.  Why so many clustering algorithms: a position paper , 2002, SKDD.

[31]  Robert E. Tarjan,et al.  Clustering Social Networks , 2007, WAW.