Unpacking a model: An interactive visualization of a text similarity algorithm for legal documents

This paper presents a functional prototype for an interactive web-based interface i_sift developed to foreground the decision-making process of an algorithm that detects similarities in legal texts through word embeddings. Using this as a case study in computational social science, our goal is, first, to highlight the importance of making computational tools and methods transparent to social scientists. Secondly, we suggest an approach that accomplishes this using methods and principles from interactive machine learning and the algorithmic experience framework.

[1]  Weng-Keen Wong,et al.  Principles of Explanatory Debugging to Personalize Interactive Machine Learning , 2015, IUI.

[2]  Daniel A. Keim,et al.  Visual Analytics for Topic Model Optimization based on User-Steerable Speculative Execution , 2019, IEEE Transactions on Visualization and Computer Graphics.

[3]  Perry R. Cook,et al.  Human model evaluation in interactive supervised learning , 2011, CHI.

[4]  Shie-Jue Lee,et al.  A Similarity Measure for Text Classification and Clustering , 2014, IEEE Transactions on Knowledge and Data Engineering.

[5]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[6]  Annika Wærn,et al.  Towards Algorithmic Experience: Initial Efforts for Social Media Contexts , 2018, CHI.

[7]  Li Chen,et al.  Trust building with explanation interfaces , 2006, IUI '06.

[8]  Emilee J. Rader,et al.  Explanations as Mechanisms for Supporting Algorithmic Transparency , 2018, CHI.

[9]  Nicholas Diakopoulos,et al.  Algorithmic Transparency in the News Media , 2017 .

[10]  John Riedl,et al.  Explaining collaborative filtering recommendations , 2000, CSCW '00.

[11]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[12]  Maya Cakmak,et al.  Power to the People: The Role of Humans in Interactive Machine Learning , 2014, AI Mag..

[13]  Ian H. Witten,et al.  Interactive machine learning: letting users build classifiers , 2002, Int. J. Hum. Comput. Stud..

[14]  Yoshua Bengio,et al.  Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.

[15]  J. Nadal,et al.  Manifesto of computational social science , 2012 .