论文信息 - Representativity Fairness in Clustering

Representativity Fairness in Clustering

Incorporating fairness constructs into machine learning algorithms is a topic of much societal importance and recent interest. Clustering, a fundamental task in unsupervised learning that manifests across a number of web data scenarios, has also been subject of attention within fair ML research. In this paper, we develop a novel notion of fairness in clustering, called representativity fairness. Representativity fairness is motivated by the need to alleviate disparity across objects’ proximity to their assigned cluster representatives, to aid fairer decision making. We illustrate the importance of representativity fairness in real-world decision making scenarios involving clustering and provide ways of quantifying objects’ representativity and fairness over it. We develop a new clustering formulation, RFKM, that targets to optimize for representativity fairness along with clustering quality. Inspired by the K-Means framework, RFKM incorporates novel loss terms to formulate an objective function. The RFKM objective and optimization approach guides it towards clustering configurations that yield higher representativity fairness. Through an empirical evaluation over a variety of public datasets, we establish the effectiveness of our method. We illustrate that we are able to significantly improve representativity fairness at only marginal impact to clustering quality.

Savitha Sam Abraham | P Deepak | Deepak P

[1] Savitha Sam Abraham,et al. Fairness in Clustering with Multiple Sensitive Attributes , 2019, EDBT.

[2] Sara Ahmadian,et al. Clustering without Over-Representation , 2019, KDD.

[3] Nisheeth K. Vishnoi,et al. Stable and Fair Classification , 2019, ICML.

[4] Raj Jain,et al. A Quantitative Measure Of Fairness And Discrimination For Resource Allocation In Shared Computer Systems , 1998, ArXiv.

[5] Kamesh Munagala,et al. Proportionally Fair Clustering , 2019, ICML.

[6] Jie Zhao,et al. A review of moving object trajectory clustering algorithms , 2016, Artificial Intelligence Review.

[7] Reuben Binns,et al. On the apparent conflict between individual and group fairness , 2019, FAT*.

[8] Eric Granger,et al. Clustering with Fairness Constraints: A Flexible and Scalable Approach , 2019, ArXiv.

[9] Krishna P. Gummadi,et al. Incremental Fairness in Two-Sided Market Platforms: On Smoothly Updating Recommendations , 2019, AAAI 2020.

[10] P. Rousseeuw. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[11] Ricardo Baeza-Yates,et al. FA*IR: A Fair Top-k Ranking Algorithm , 2017, CIKM.