Are We Closing the Loop Yet? Gaps in the Generalizability of VIS4ML Research

Visualization for machine learning (VIS4ML) research aims to help experts apply their prior knowledge to develop, understand, and improve the performance of machine learning models. In conceiving VIS4ML systems, researchers characterize the nature of human knowledge to support human-in-the-loop tasks, design interactive visualizations to make ML components interpretable and elicit knowledge, and evaluate the effectiveness of human-model interchange. We survey recent VIS4ML papers to assess the generalizability of research contributions and claims in enabling human-in-the-loop ML. Our results show potential gaps between the current scope of VIS4ML research and aspirations for its use in practice. We find that while papers motivate that VIS4ML systems are applicable beyond the specific conditions studied, conclusions are often overfitted to non-representative scenarios, are based on interactions with a small set of ML experts and well-understood datasets, fail to acknowledge crucial dependencies, and hinge on decisions that lack justification. We discuss approaches to close the gap between aspirations and research claims and suggest documentation practices to report generality constraints that better acknowledge the exploratory nature of VIS4ML research.

[1]  Miriah D. Meyer,et al.  Troubling Collaboration: Matters of Care for Visualization Design Study , 2023, CHI.

[2]  Jorge Piazentin Ono,et al.  SliceTeller: A Data Slice-Driven Approach for Machine Learning Model Validation , 2022, IEEE Transactions on Visualization and Computer Graphics.

[3]  Wei Chen,et al.  HetVis: A Visual Analysis Approach for Identifying Data Heterogeneity in Horizontal Federated Learning , 2022, IEEE Transactions on Visualization and Computer Graphics.

[4]  H. Pfister,et al.  Diagnosing Ensemble Few-Shot Classifiers , 2022, IEEE Transactions on Visualization and Computer Graphics.

[5]  B. Kwon,et al.  ConceptExplainer: Interactive Explanation for Deep Neural Networks from a Concept Perspective , 2022, IEEE Transactions on Visualization and Computer Graphics.

[6]  A. Kerren,et al.  HardVis: Visual Analytics to Handle Instance Hardness Using Undersampling and Oversampling Techniques , 2022, Comput. Graph. Forum.

[7]  J. Hullman,et al.  The Worst of Both Worlds: A Comparative Analysis of Errors in Learning from Data in Psychology and Machine Learning , 2022, AIES.

[8]  J. Hullman,et al.  A Survey of Domain Knowledge Elicitation in Applied Machine Learning , 2021, Multimodal Technol. Interact..

[9]  Mathias Lechner,et al.  Interactive Analysis of CNN Robustness , 2021, Comput. Graph. Forum.

[10]  Enrico Bertini,et al.  AdViCE: Aggregated Visual Counterfactual Explanations for Machine Learning Model Validation , 2021, 2021 IEEE Visualization Conference (VIS).

[11]  Duen Horng Chau,et al.  NeuroCartography: Scalable Automatic Visual Summarization of Concepts in Deep Neural Networks , 2021, IEEE Transactions on Visualization and Computer Graphics.

[12]  Jiawan Zhang,et al.  Towards Visual Explainable Active Learning for Zero-Shot Classification , 2021, IEEE Transactions on Visualization and Computer Graphics.

[13]  Duen Horng Chau,et al.  A Survey of Human‐Centered Evaluations in Human‐Centered Machine Learning , 2021, Comput. Graph. Forum.

[14]  Tsung Heng Wu,et al.  A Visual Designer of Layer‐wise Relevance Propagation Models , 2021, Comput. Graph. Forum.

[15]  Chin-Chia Michael Yeh,et al.  Visual Analytics for RNN-Based Deep Reinforcement Learning , 2021, IEEE Transactions on Visualization and Computer Graphics.

[16]  Andreas Kerren,et al.  FeatureEnVi: Visual Analytics for Feature Engineering Using Stepwise Selection and Semi-Automatic Extraction Approaches , 2021, IEEE Transactions on Visualization and Computer Graphics.

[17]  A. Kerren,et al.  VisEvol: Visual Analytics to Support Hyperparameter Search through Evolutionary Optimization , 2020, Comput. Graph. Forum.

[18]  Huamin Qu,et al.  GNNLens: A Visual Analytics Approach for Prediction Error Diagnosis of Graph Neural Networks , 2020, IEEE Transactions on Visualization and Computer Graphics.

[19]  Theresa-Marie Rhyne,et al.  Challenges in Evaluating Interactive Visual Machine Learning Systems , 2020, IEEE Computer Graphics and Applications.

[20]  Trevor Hastie,et al.  Transparency and reproducibility in artificial intelligence , 2020, Nature.

[21]  C. Xie,et al.  NAS-Navigator: Visual Steering for Explainable One-Shot Deep Neural Network Synthesis , 2020, IEEE Transactions on Visualization and Computer Graphics.

[22]  Alex Endert,et al.  Should We Trust (X)AI? Design Dimensions for Structured Experimental Evaluations , 2020, ArXiv.

[23]  Jun Yuan,et al.  A survey of visual analytics techniques for machine learning , 2020, Computational Visual Media.

[24]  Huamin Qu,et al.  DECE: Decision Explorer with Counterfactual Explanations for Machine Learning Models , 2020, IEEE Transactions on Visualization and Computer Graphics.

[25]  Wei Zhang,et al.  SCANViz: Interpreting the Symbol-Concept Association Captured by Deep Neural Networks through Visual Analytics , 2020, 2020 IEEE Pacific Visualization Symposium (PacificVis).

[26]  Duen Horng Chau,et al.  CNN Explainer: Learning Convolutional Neural Networks with Interactive Visualization , 2020, IEEE Transactions on Visualization and Computer Graphics.

[27]  D. Navarro,et al.  The case for formal methodology in scientific reform , 2020, bioRxiv.

[28]  Sungsoo Ray Hong,et al.  Human Factors in Model Interpretability: Industry Practices, Challenges, and Needs , 2020, Proc. ACM Hum. Comput. Interact..

[29]  Florian Heimerl,et al.  Boxer: Interactive Comparison of Classifier Results , 2020, Comput. Graph. Forum.

[30]  Jun Zhu,et al.  Analyzing the Noise Robustness of Deep Neural Networks , 2020, IEEE Transactions on Visualization and Computer Graphics.

[31]  Huamin Qu,et al.  ProtoSteer: Steering Deep Sequence Model with Prototypes , 2020, IEEE Transactions on Visualization and Computer Graphics.

[32]  Nithin Chalapathi,et al.  TopoAct: Visually Exploring the Shape of Activations in Deep Learning , 2019, Comput. Graph. Forum.

[33]  Yang Chen,et al.  Interactive Correction of Mislabeled Training Data , 2019, 2019 IEEE Conference on Visual Analytics Science and Technology (VAST).

[34]  Christian Wolf,et al.  DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning , 2019, Comput. Graph. Forum.

[35]  Alex Endert,et al.  BEAMES: Interactive Multimodel Steering, Selection, and Inspection for Regression Tasks , 2019, IEEE Computer Graphics and Applications.

[36]  Daniel A. Keim,et al.  Semantic Concept Spaces: Guided Topic Model Refinement using Word-Embedding Projections , 2019, IEEE Transactions on Visualization and Computer Graphics.

[37]  Hendrik Strobelt,et al.  Ablate, Variate, and Contemplate: Visual Analytics for Discovering Neural Architectures , 2019, IEEE Transactions on Visualization and Computer Graphics.

[38]  Jason Dykes,et al.  Criteria for Rigor in Visualization Design Study , 2019, IEEE Transactions on Visualization and Computer Graphics.

[39]  Martin Wattenberg,et al.  The What-If Tool: Interactive Probing of Machine Learning Models , 2019, IEEE Transactions on Visualization and Computer Graphics.

[40]  Jeffrey Heer,et al.  Errudite: Scalable, Reproducible, and Testable Error Analysis , 2019, ACL.

[41]  Wei Zhang,et al.  DeepVID: Deep Visual Interpretation and Diagnosis for Image Classifiers via Knowledge Distillation , 2019, IEEE Transactions on Visualization and Computer Graphics.

[42]  Duen Horng Chau,et al.  Summit: Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations , 2019, IEEE Transactions on Visualization and Computer Graphics.

[43]  Laura von Rueden,et al.  Informed Machine Learning – A Taxonomy and Survey of Integrating Prior Knowledge into Learning Systems , 2019, IEEE Transactions on Knowledge and Data Engineering.

[44]  Chandan Singh,et al.  Definitions, methods, and applications in interpretable machine learning , 2019, Proceedings of the National Academy of Sciences.

[45]  Junpeng Wang,et al.  DQNViz: A Visual Analytics Approach to Understand Deep Q-Networks , 2019, IEEE Transactions on Visualization and Computer Graphics.

[46]  D. Keim,et al.  VIS4ML: An Ontology for Visual Analytics Assisted Machine Learning , 2019, IEEE Transactions on Visualization and Computer Graphics.

[47]  D. Keim,et al.  Visual Analytics for Topic Model Optimization based on User-Steerable Speculative Execution , 2019, IEEE Transactions on Visualization and Computer Graphics.

[48]  Dik Lun Lee,et al.  iForest: Interpreting Random Forests via Visual Analytics , 2019, IEEE Transactions on Visualization and Computer Graphics.

[49]  Eric D. Ragan,et al.  A Multidisciplinary Survey and Framework for Design and Evaluation of Explainable AI Systems , 2018, ACM Trans. Interact. Intell. Syst..

[50]  Martin Wattenberg,et al.  GAN Lab: Understanding Complex Deep Generative Models using Interactive Visual Experimentation , 2018, IEEE Transactions on Visualization and Computer Graphics.

[51]  Huamin Qu,et al.  RuleMatrix: Visualizing and Understanding Classifiers with Rules , 2018, IEEE Transactions on Visualization and Computer Graphics.

[52]  Jimeng Sun,et al.  RetainVis: Visual Analytics with Interpretable and Interactive Recurrent Neural Networks on Electronic Medical Records , 2018, IEEE Transactions on Visualization and Computer Graphics.

[53]  Alexander M. Rush,et al.  Seq2seq-Vis: A Visual Debugging Tool for Sequence-to-Sequence Models , 2018, IEEE Transactions on Visualization and Computer Graphics.

[54]  Shaoliang Nie,et al.  Visualizing Deep Neural Networks for Text Analytics , 2018, 2018 IEEE Pacific Visualization Symposium (PacificVis).

[55]  Pravesh Kothari,et al.  An Analysis of the t-SNE Algorithm for Data Visualization , 2018, COLT.

[56]  Minsuk Kahng,et al.  Visual Analytics in Deep Learning: An Interrogative Survey for the Next Frontiers , 2018, IEEE Transactions on Visualization and Computer Graphics.

[57]  Elmar Eisemann,et al.  DeepEyes: Progressive Visual Analytics for Designing Deep Neural Networks , 2018, IEEE Transactions on Visualization and Computer Graphics.

[58]  Dandelion Mané,et al.  Visualizing Dataflow Graphs of Deep Learning Models in TensorFlow , 2018, IEEE Transactions on Visualization and Computer Graphics.

[59]  Zhen Li,et al.  Understanding Hidden Memories of Recurrent Neural Networks , 2017, 2017 IEEE Conference on Visual Analytics Science and Technology (VAST).

[60]  Xiaoming Liu,et al.  Do Convolutional Neural Networks Learn Class Hierarchy? , 2017, IEEE Transactions on Visualization and Computer Graphics.

[61]  Silvia Miksch,et al.  The Role of Explicit Knowledge: A Conceptual Model of Knowledge-Assisted Visual Analytics , 2017, 2017 IEEE Conference on Visual Analytics Science and Technology (VAST).

[62]  Daniel J. Simons,et al.  Constraints on Generality (COG): A Proposed Addition to All Empirical Papers , 2017, Perspectives on psychological science : a journal of the Association for Psychological Science.

[63]  Alex Endert,et al.  The State of the Art in Integrating Machine Learning into Visual Analytics , 2017, Comput. Graph. Forum.

[64]  Martin Wattenberg,et al.  Embedding Projector: Interactive Visualization and Interpretation of Embeddings , 2016, ArXiv.

[65]  Martin Wattenberg,et al.  How to Use t-SNE Effectively , 2016 .

[66]  Daniel A. Keim,et al.  Human-centered machine learning through interactive visualization , 2016 .

[67]  Alexander M. Rush,et al.  LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks , 2016, IEEE Transactions on Visualization and Computer Graphics.

[68]  Zhen Li,et al.  Towards Better Analysis of Deep Convolutional Neural Networks , 2016, IEEE Transactions on Visualization and Computer Graphics.

[69]  Tamara Munzner,et al.  The nested blocks and guidelines model , 2015, Inf. Vis..

[70]  Christopher Andrews,et al.  The human is the loop: new directions for visual analytics , 2014, Journal of Intelligent Information Systems.

[71]  Silvia Miksch,et al.  Visual Methods for Analyzing Probabilistic Classification Data , 2014, IEEE Transactions on Visualization and Computer Graphics.

[72]  Daniel A. Keim,et al.  Knowledge Generation Model for Visual Analytics , 2014, IEEE Transactions on Visualization and Computer Graphics.

[73]  Enrico Bertini,et al.  INFUSE: Interactive Feature Selection for Predictive Modeling of High Dimensional Data , 2014, IEEE Transactions on Visualization and Computer Graphics.

[74]  Sean McKenna,et al.  Design Activity Framework for Visualization Design , 2014, IEEE Transactions on Visualization and Computer Graphics.

[75]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[76]  M. Sedlmair,et al.  A Systematic Review on the Practice of Evaluating Visualization , 2013, IEEE Transactions on Visualization and Computer Graphics.

[77]  Tamara Munzner,et al.  Design Study Methodology: Reflections from the Trenches and the Stacks , 2012, IEEE Transactions on Visualization and Computer Graphics.

[78]  M. Sheelagh T. Carpendale,et al.  Empirical Studies in Information Visualization: Seven Scenarios , 2012, IEEE Transactions on Visualization and Computer Graphics.

[79]  Jarke J. van Wijk,et al.  BaobabView: Interactive construction and analysis of decision trees , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[80]  Tamara Munzner,et al.  A Nested Model for Visualization Design and Validation , 2009, IEEE Transactions on Visualization and Computer Graphics.

[81]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[82]  John T. Stasko,et al.  Understanding and characterizing insights: how do people gain insights using information visualization? , 2008, BELIV.

[83]  T. Rhyne,et al.  Toward measuring visualization insight , 2006, IEEE Computer Graphics and Applications.

[84]  P. Meehl Why Summaries of Research on Psychological Theories are Often Uninterpretable , 1990 .

[85]  nominatif de l’habitat,et al.  Definitions , 1964, Innovation Dynamics and Policy in the Energy Sector.

[86]  J. Choo,et al.  VATUN: Visual Analytics for Testing and Understanding Convolutional Neural Networks , 2021, EuroVis.

[87]  Ziawasch Abedjan,et al.  From Cleaning before ML to Cleaning for ML , 2021, IEEE Data Eng. Bull..

[88]  Thomas Liao Are We Learning Yet? A Meta Review of Evaluation Failures Across Machine Learning , 2021, NeurIPS Datasets and Benchmarks.

[89]  MIN CHEN,et al.  "Isms" in Visualization , 2020, Foundations of Data Visualization.

[90]  Jun Zhu,et al.  Analyzing the Training Processes of Deep Generative Models , 2018, IEEE Transactions on Visualization and Computer Graphics.

[91]  Jing Wu,et al.  Visual Diagnosis of Tree Boosting Methods , 2018, IEEE Transactions on Visualization and Computer Graphics.

[92]  Eli T. Brown,et al.  Human-Machine-Learner Interaction : The Best of Both Worlds , 2016 .

[93]  Surajit Chaudhuri,et al.  Data Debugger: An Operator-Centric Approach for Data Quality Solutions , 2006, IEEE Data Eng. Bull..