论文信息 - An Interpretable Graph-based Mapping of Trustworthy Machine Learning Research

An Interpretable Graph-based Mapping of Trustworthy Machine Learning Research

There is an increasing interest in ensuring machine learning (ML) frameworks behave in a socially responsible manner and are deemed trustworthy. Although considerable progress has been made in the field of Trustworthy ML (TwML) in the recent past, much of the current characterization of this progress is qualitative. Consequently, decisions about how to address issues of trustworthiness and future research goals are often left to the interested researcher. In this paper, we present the first quantitative approach to characterize the comprehension of TwML research. We build a co-occurrence network of words using a web-scraped corpus of more than 7,000 peer-reviewed recent ML papers—consisting of papers both related and unrelated to TwML. We use community detection to obtain semantic clusters of words in this network that can infer relative positions of TwML topics. We propose an innovative fingerprinting algorithm to obtain probabilistic similarity scores for individual words, then combine them to give a paper-level relevance score. The outcomes of our analysis inform a number of interesting insights on advancing the field of TwML research.

Subhabrata Majumdar | Noemi Derzsy | Rajat Malik

[1] Harald Steck,et al. Calibrated recommendations , 2018, RecSys.

[2] Javier García,et al. A comprehensive survey on safe reinforcement learning , 2015, J. Mach. Learn. Res..

[3] Sagar Kamarthi,et al. Correction: Novel keyword co-occurrence network-based methods to foster systematic reviews of scientific literature , 2017, PloS one.

[4] Eugene Santos,et al. Explaining Reward Functions in Markov Decision Processes , 2019, FLAIRS.

[5] Kristina Lerman,et al. A Survey on Bias and Fairness in Machine Learning , 2019, ACM Comput. Surv..

[6] Alessandro Vespignani,et al. Mapping the physics research space: a machine learning approach , 2019, EPJ Data Science.

[7] Taoying Li,et al. Co-Occurrence Network of High-Frequency Words in the Bioinformatics Literature: Structural Characteristics and Evolution , 2018, Applied Sciences.

[8] Kush R. Varshney,et al. Socially Responsible AI Algorithms: Issues, Purposes, and Challenges , 2021, Journal of Artificial Intelligence Research.

[9] Ehsan Toreini,et al. The relationship between trust in AI and trustworthy machine learning technologies , 2019, FAT*.

[10] H. Stanley,et al. The science of science: from the perspective of complex systems , 2017 .

[11] Philippe Lamontagne,et al. Towards a Robust and Trustworthy Machine Learning System Development , 2021, ArXiv.