Reconciliation k-median: Clustering with Non-polarized Representatives

We propose a new variant of the k-median problem, where the objective function models not only the cost of assigning data points to cluster representatives, but also a penalty term for disagreement among the representatives. We motivate this novel problem by applications where we are interested in clustering data while avoiding selecting representatives that are too far from each other. For example, we may want to summarize a set of news sources, but avoid selecting ideologically-extreme articles in order to reduce polarization. To solve the proposed k-median formulation we adopt the local-search algorithm of Arya et al. [2], We show that the algorithm provides a provable approximation guarantee, which becomes constant under a mild assumption on the minimum number of points for each cluster. We experimentally evaluate our problem formulation and proposed algorithm on datasets inspired by the motivating applications. In particular, we experiment with data extracted from Twitter, the US Congress voting records, and popular news sources. The results show that our objective can lead to choosing less polarized groups of representatives without significant loss in representation fidelity.

[1]  W. Haemers Interlacing eigenvalues and graphs , 1995 .

[2]  Venkata Rama Kiran Garimella,et al.  Balancing information exposure in social networks , 2017, NIPS.

[3]  Shi Li,et al.  Approximating k-Median via Pseudo-Approximation , 2016, SIAM J. Comput..

[4]  R. M. Alvarez Birds of the Same Feather Tweet Together: Bayesian Ideal Point Estimation Using Twitter Data , 2014 .

[5]  Amin Saberi,et al.  A new greedy approach for facility location problems , 2002, STOC '02.

[6]  E. Kreyszig Introductory Functional Analysis With Applications , 1978 .

[7]  Ola Svensson,et al.  Better Guarantees for k-Means and Euclidean k-Median by Primal-Dual Algorithms , 2020, SIAM J. Comput..

[8]  Aristides Gionis,et al.  Joint Non-negative Matrix Factorization for Learning Ideological Leaning on Twi er , 2017 .

[9]  Vijay V. Vazirani,et al.  Approximation algorithms for metric facility location and k-Median problems using the primal-dual schema and Lagrangian relaxation , 2001, JACM.

[10]  Christos H. Papadimitriou,et al.  Worst-Case and Probabilistic Analysis of a Geometric Location Problem , 1981, SIAM J. Comput..

[11]  Sudipto Guha,et al.  A constant-factor approximation algorithm for the k-median problem (extended abstract) , 1999, STOC '99.

[12]  Sudipto Guha,et al.  Improved combinatorial algorithms for the facility location and k-median problems , 1999, 40th Annual Symposium on Foundations of Computer Science (Cat. No.99CB37039).

[13]  Kamesh Munagala,et al.  Local Search Heuristics for k-Median and Facility Location Problems , 2004, SIAM J. Comput..

[14]  Jaroslaw Byrka,et al.  Constant-factor approximation for ordered k-median , 2017, STOC.

[15]  Sean A. Munson,et al.  Encouraging Reading of Diverse Political Viewpoints with a Browser Widget , 2013, ICWSM.

[16]  Charalampos E. Tsourakakis,et al.  Minimizing Polarization and Disagreement in Social Networks , 2017, WWW.

[17]  Aravind Srinivasan,et al.  An Improved Approximation for k-Median and Positive Correlation in Budgeted Optimization , 2014, SODA.

[18]  Refael Hassin,et al.  Approximation algorithms for maximum dispersion , 1997, Oper. Res. Lett..

[19]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[20]  Sándor P. Fekete,et al.  Maximum dispersion and geometric maximum weight cliques , 2000, APPROX.

[21]  Aristides Gionis,et al.  Balancing Opposing Views to Reduce Controversy , 2016, ArXiv.

[22]  H. Yamakawa,et al.  Toward Delegated Democracy: Vote by Yourself, or Trust Your Network , 2007 .

[23]  Francesco Bonchi,et al.  Voting in social networks , 2009, CIKM.

[24]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[25]  Jaroslaw Byrka,et al.  Bi-Factor Approximation Algorithms for Hard Capacitated k-Median Problems , 2013, SODA.

[26]  Lada A. Adamic,et al.  Exposure to ideologically diverse news and opinion on Facebook , 2015, Science.