论文信息 - Hypergraph Clustering for Finding Diverse and Experienced Groups

Hypergraph Clustering for Finding Diverse and Experienced Groups

When forming a team or group of individuals, we often seek a balance of expertise in a particular task while at the same time maintaining diversity of skills within each group. Here, we view the problem of finding diverse and experienced groups as clustering in hypergraphs with multiple edge types. The input data is a hypergraph with multiple hyperedge types -- representing information about past experiences of groups of individuals -- and the output is groups of nodes. In contrast to related problems on fair or balanced clustering, we model diversity in terms of variety of past experience (instead of, e.g., protected attributes), with a goal of forming groups that have both experience and diversity with respect to participation in edge types. In other words, both diversity and experience are measured from the types of the hyperedges. Our clustering model is based on a regularized version of an edge-based hypergraph clustering objective, and we also show how naive objectives actually have no diversity-experience tradeoff. Although our objective function is NP-hard to optimize, we design an efficient 2-approximation algorithm and also show how to compute bounds for the regularization hyperparameter that lead to meaningful diversity-experience tradeoffs. We demonstrate an application of this framework in online review platforms, where the goal is to curate sets of user reviews for a product type. In this context, "experience" corresponds to users familiar with the type of product, and "diversity" to users that have reviewed related products.

Austin R. Benson | Nate Veldt | Ilya Amburg

[1] Charalampos E. Tsourakakis,et al. Chromatic Correlation Clustering , 2015, TKDD.

[2] Austin R. Benson,et al. Hypergraph Cuts with General Splitting Functions , 2020, SIAM Rev..

[3] Marian N. Ruderman,et al. Diversity in work teams: Research paradigms for a changing workplace. , 1995 .

[4] Avrim Blum,et al. Correlation Clustering , 2004, Machine Learning.

[5] Ricardo Baeza-Yates,et al. FA*IR: A Fair Top-k Ranking Algorithm , 2017, CIKM.

[6] Olgica Milenkovic,et al. Inhomogeneous Hypergraph Clustering with Applications , 2017, NIPS.

[7] Silvio Lattanzi,et al. Fair Clustering Through Fairlets , 2018, NIPS.

[8] Jakub W. Pachocki,et al. Scalable Motif-aware Graph Clustering , 2016, WWW.

[9] Avi Feller,et al. Algorithmic Decision Making and the Cost of Fairness , 2017, KDD.

[10] Sebastian Nowozin,et al. Solution stability in linear programming relaxations: graph partitioning and unsupervised learning , 2009, ICML '09.

[11] Scott W. Hadley,et al. Approximation Techniques for Hypergraph Partitioning Problems , 1995, Discret. Appl. Math..