Tracing and Visualizing Human-ML/AI Collaborative Processes through Artifacts of Data Work

Automated Machine Learning (AutoML) technology can lower barriers in data work yet still requires human intervention to be functional. However, the complex and collaborative process resulting from humans and machines trading off work makes it difficult to trace what was done, by whom (or what), and when. In this research, we construct a taxonomy of data work artifacts that captures AutoML and human processes. We present a rigorous methodology for its creation and discuss its transferability to the visual design process. We operationalize the taxonomy through the development of AutoML Trace a visual interactive sketch showing both the context and temporality of human-ML/AI collaboration in data work. Finally, we demonstrate the utility of our approach via a usage scenario with an enterprise software development team. Collectively, our research process and findings explore challenges and fruitful avenues for developing data visualization tools that interrogate the sociotechnical relationships in automated data work. Availability of Supplemental Materials: https://osf.io/3nmyj/?view_only=19962103d58b45d289b5c83421f48b36

[1]  K. Vrotsou,et al.  A Model for Types and Levels of Automation in Visual Analytics: a Survey, a Taxonomy, and Examples , 2022, IEEE Transactions on Visualization and Computer Graphics.

[2]  Nikolay O. Nikitin,et al.  Automated Evolutionary Approach for the Design of Composite Machine Learning Pipelines , 2021, Future Gener. Comput. Syst..

[3]  Duen Horng Chau,et al.  A Survey of Human‐Centered Evaluations in Human‐Centered Machine Learning , 2021, Comput. Graph. Forum.

[4]  Padhraic Smyth,et al.  Automating data science , 2021, Commun. ACM.

[5]  Praveen K. Paritosh,et al.  “Everyone wants to do the model work, not the data work”: Data Cascades in High-Stakes AI , 2021, CHI.

[6]  Jeffrey M. Rzeszotarski,et al.  Designing Interactive Transfer Learning Tools for ML Non-Experts , 2021, CHI.

[7]  Allan Dafoe,et al.  Ethics and Governance of Artificial Intelligence: Evidence from a Survey of Machine Learning Researchers , 2021, J. Artif. Intell. Res..

[8]  Elena García Barriocanal,et al.  Traceability for Trustworthy AI: A Review of Models and Tools , 2021, Big Data Cogn. Comput..

[9]  Justin D. Weisz,et al.  Model LineUpper: Supporting Interactive Model Comparison at Multiple Levels for AutoML , 2021, IUI.

[10]  Marius Lindauer,et al.  Auto-Pytorch: Multi-Fidelity MetaLearning for Efficient and Robust AutoDL , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Karrie Karahalios,et al.  Deconstructing Categorization in Visualization Recommendation: A Taxonomy and Comparative Study , 2021, IEEE Transactions on Visualization and Computer Graphics.

[12]  Erick Oduor,et al.  AutoDS: Towards Human-Centered Automation of Data Science , 2021, CHI.

[13]  Aditya G. Parameswaran,et al.  Whither AutoML? Understanding the Role of Automation in Machine Learning Workflows , 2021, CHI.

[14]  Anamaria Crisan,et al.  Fits and Starts: Enterprise Use of AutoML and the Role of Humans in the Loop , 2021, CHI.

[15]  Soya Park,et al.  How Much Automation Does a Data Scientist Want? , 2021, ArXiv.

[16]  Chen Chen,et al.  VEGA: Towards an End-to-End Configurable AutoML Pipeline , 2020, ArXiv.

[17]  Theresa-Marie Rhyne,et al.  Challenges in Evaluating Interactive Visual Machine Learning Systems , 2020, IEEE Computer Graphics and Applications.

[18]  Oshin Agarwal,et al.  Large Scale Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training , 2020, ArXiv.

[19]  Shubhra Kanti Karmaker Santu,et al.  AutoML to Date and Beyond: Challenges and Opportunities , 2020, ACM Comput. Surv..

[20]  Melanie Tory,et al.  Passing the Data Baton : A Retrospective Analysis on Data Science Work and Workers , 2020, IEEE Transactions on Visualization and Computer Graphics.

[21]  Tamara Munzner,et al.  Table Scraps: An Actionable Framework for Multi-Table Data Wrangling From An Artifact Study of Computational Journalism , 2020, IEEE Transactions on Visualization and Computer Graphics.

[22]  David Jonker,et al.  Towards Evaluating Exploratory Model Building Process with AutoML Systems , 2020, ArXiv.

[23]  Miriah Meyer,et al.  Insights From Experiments With Rigor in an EvoBio Design Study , 2020, IEEE Transactions on Visualization and Computer Graphics.

[24]  Habib Asseiss Neto,et al.  NASirt: AutoML based learning with instance-level complexity information , 2020, ArXiv.

[25]  K. Gadhave,et al.  Trrack: A Library for Provenance-Tracking in Web-Based Visualizations , 2020, 2020 IEEE Visualization Conference (VIS).

[26]  Jeffrey Heer,et al.  Boba: Authoring and Visualizing Multiverse Analyses , 2020, IEEE Transactions on Visualization and Computer Graphics.

[27]  Marius Lindauer,et al.  Auto-Sklearn 2.0: The Next Generation , 2020, ArXiv.

[28]  Madeleine Udell,et al.  AutoML Pipeline Selection: Efficiently Navigating the Combinatorial Space , 2020, KDD.

[29]  W. Zame,et al.  AutoCP: Automated Pipelines for Accurate Prediction Intervals , 2020 .

[30]  J. Vanschoren,et al.  Adaptation Strategies for Automated Machine Learning on Evolving Data , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Juliana Freire,et al.  PipelineProfiler: A Visual Analytics Tool for the Exploration of AutoML Pipelines , 2020, IEEE Transactions on Visualization and Computer Graphics.

[32]  Sungsoo Ray Hong,et al.  Human Factors in Model Interpretability: Industry Practices, Challenges, and Needs , 2020, Proc. ACM Hum. Comput. Interact..

[33]  Harmanpreet Kaur,et al.  Interpreting Interpretability: Understanding Data Scientists' Use of Interpretability Tools for Machine Learning , 2020, CHI.

[34]  Qian Yang,et al.  Re-examining Whether, Why, and How Human-AI Interaction Is Uniquely Difficult to Design , 2020, CHI.

[35]  Stefano Alletto,et al.  RandomNet: Towards Fully Automatic Neural Architecture Design for Multimodal Learning , 2020, ArXiv.

[36]  Amy X. Zhang,et al.  How do Data Science Workers Collaborate? Roles, Workflows, and Tools , 2020, Proc. ACM Hum. Comput. Interact..

[37]  Justin D. Weisz,et al.  Trust in AutoML: exploring information needs for establishing trust in automated machine learning systems , 2020, IUI.

[38]  Alexander G. Gray,et al.  AutoAIViz: opening the blackbox of automated artificial intelligence with conditional parallel coordinates , 2019, IUI.

[39]  Hui Xiong,et al.  A Comprehensive Survey on Transfer Learning , 2019, Proceedings of the IEEE.

[40]  Steve Oney,et al.  How Data Scientists Use Computational Notebooks for Real-Time Collaboration , 2019, Proc. ACM Hum. Comput. Interact..

[41]  Lior Rokach,et al.  DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement Learning and Hierarchical Actions Filtering , 2019, KDD.

[42]  Tim Althoff,et al.  Paths Explored, Paths Omitted, Paths Obscured: Decision Points & Selective Reporting in End-to-End Data Analysis , 2019, CHI.

[43]  Octavio Loyola-González,et al.  Black-Box vs. White-Box: Understanding Their Advantages and Weaknesses From a Practical Point of View , 2019, IEEE Access.

[44]  Alejandro Barredo Arrieta,et al.  Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI , 2019, Inf. Fusion.

[45]  Marta Mattoso,et al.  Provenance Data in the Machine Learning Lifecycle in Computational Science and Engineering , 2019, 2019 IEEE/ACM Workflows in Support of Large-Scale Science (WORKS).

[46]  Parikshit Ram,et al.  Human-AI Collaboration in Data Science , 2019, Proc. ACM Hum. Comput. Interact..

[47]  Kaiyong Zhao,et al.  AutoML: A Survey of the State-of-the-Art , 2019, Knowl. Based Syst..

[48]  Mennatallah El-Assady,et al.  explAIner: A Visual Analytics Framework for Interactive and Explainable Machine Learning , 2019, IEEE Transactions on Visualization and Computer Graphics.

[49]  Alexander M. Rush,et al.  Visual Interaction with Deep Learning Models through Collaborative Semantic Inference , 2019, IEEE Transactions on Visualization and Computer Graphics.

[50]  Eric Horvitz,et al.  Updates in Human-AI Teams: Understanding and Addressing the Performance/Compatibility Tradeoff , 2019, AAAI.

[51]  Juliana Freire,et al.  Visus: An Interactive System for Automatic Machine Learning Model Building and Curation , 2019, HILDA@SIGMOD.

[52]  Andrés Montoyo,et al.  AutoML Strategy Based on Grammatical Evolution: A Case Study about Knowledge Discovery from Text , 2019, ACL.

[53]  Bernd Bischl,et al.  An Open Source AutoML Benchmark , 2019, ArXiv.

[54]  Sherif Sakr,et al.  Automated Machine Learning: State-of-The-Art and Open Challenges , 2019, ArXiv.

[55]  Leilani Battle,et al.  Characterizing Exploratory Visual Analysis: A Literature Review and Evaluation of Analytic Provenance in Tableau , 2019, Comput. Graph. Forum.

[56]  Paul N. Bennett,et al.  Guidelines for Human-AI Interaction , 2019, CHI.

[57]  Marco F. Huber,et al.  Benchmark and Survey of Automated Machine Learning Frameworks , 2019, J. Artif. Intell. Res..

[58]  Laura von Rueden,et al.  Informed Machine Learning – A Taxonomy and Survey of Integrating Prior Knowledge into Learning Systems , 2019, IEEE Transactions on Knowledge and Data Engineering.

[59]  Vidya Setlur,et al.  Inferencing underspecified natural language utterances in visual analysis , 2019, IUI.

[60]  Kalyan Veeramachaneni,et al.  ATMSeer: Increasing Transparency and Controllability in Automated Machine Learning , 2019, CHI.

[61]  Jeffrey Heer Agency plus automation: Designing artificial intelligence into interactive systems , 2019, Proceedings of the National Academy of Sciences.

[62]  Dominik Dellermann,et al.  The Future of Human-AI Collaboration: A Taxonomy of Design Knowledge for Hybrid Intelligence Systems , 2019, HICSS.

[63]  D. Keim,et al.  VIS4ML: An Ontology for Visual Analytics Assisted Machine Learning , 2019, IEEE Transactions on Visualization and Computer Graphics.

[64]  Thomas Zichner,et al.  KnowledgePearls: Provenance-Based Visualization Retrieval , 2019, IEEE Transactions on Visualization and Computer Graphics.

[65]  Chris Russell,et al.  Explaining Explanations in AI , 2018, FAT.

[66]  Isabelle Guyon,et al.  Taking Human out of Learning Applications: A Survey on Automated Machine Learning , 2018, 1810.13306.

[67]  Marina Jirotka,et al.  Ethical governance is essential to building trust in robotics and artificial intelligence systems , 2018, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[68]  Inioluwa Deborah Raji,et al.  Model Cards for Model Reporting , 2018, FAT.

[69]  Tim Kraska,et al.  VizML: A Machine Learning Approach to Visualization Recommendation , 2018, CHI.

[70]  Dae Won Kim,et al.  OBOE: Collaborative Filtering for AutoML Model Selection , 2018, KDD.

[71]  Qingquan Song,et al.  Auto-Keras: An Efficient Neural Architecture Search System , 2018, KDD.

[72]  Joaquin Vanschoren,et al.  ML-Schema: Exposing the Semantics of Machine Learning with Schemas and Ontologies , 2018, ICML 2018.

[73]  Jake VanderPlas,et al.  A Practical Taxonomy of Reproducibility for Machine Learning Research , 2018 .

[74]  Aditya G. Parameswaran,et al.  Accelerating Human-in-the-loop Machine Learning: Challenges and Opportunities , 2018, DEEM@SIGMOD.

[75]  Neil Houlsby,et al.  Transfer Learning with Neural AutoML , 2018, NeurIPS.

[76]  Daniel A. Keim,et al.  What you see is what you can change: Human-centered machine learning by interactive visualization , 2017, Neurocomputing.

[77]  D. Sculley,et al.  Google Vizier: A Service for Black-Box Optimization , 2017, KDD.

[78]  Gina Neff,et al.  Critique and Contribute: A Practice-Based Framework for Improving Critical Data Studies and Data Science , 2017, Big Data.

[79]  Alex Endert,et al.  The State of the Art in Integrating Machine Learning into Visual Analytics , 2017, Comput. Graph. Forum.

[80]  Jun Zhu,et al.  Towards better analysis of machine learning models: A visual analytics perspective , 2017, Vis. Informatics.

[81]  Michael Burch,et al.  A Taxonomy and Survey of Dynamic Graph Visualization , 2017, Comput. Graph. Forum.

[82]  Quoc V. Le,et al.  Neural Architecture Search with Reinforcement Learning , 2016, ICLR.

[83]  Stefania Mariano,et al.  Artifacts in knowledge management research: a systematic literature review and future research directions , 2016, J. Knowl. Manag..

[84]  Bogdan Gabrys,et al.  Adapting Multicomponent Predictive Systems using Hybrid Adaptation Strategies with Auto-WEKA in Process Industry , 2016, AutoML@ICML.

[85]  Randal S. Olson,et al.  Evaluation of a Tree-based Pipeline Optimization Tool for Automating Data Science , 2016, GECCO.

[86]  Kanit Wongsuphasawat,et al.  Voyager: Exploratory Analysis via Faceted Browsing of Visualization Recommendations , 2016, IEEE Transactions on Visualization and Computer Graphics.

[87]  Alex Endert,et al.  Characterizing Provenance in Visualization and Data Analysis: An Organizational Framework of Provenance Types and Purposes , 2016, IEEE Transactions on Visualization and Computer Graphics.

[88]  Jacky Akoka,et al.  A Taxonomy of Evaluation Methods for Information Systems Artifacts , 2015, J. Manag. Inf. Syst..

[89]  James Mussell Raw Data is an Oxymoron , 2014 .

[90]  Pieter Jan Stappers,et al.  Probes, toolkits and prototypes: three approaches to making in codesigning , 2014 .

[91]  T. Munzner,et al.  A Multi-Level Typology of Abstract Visualization Tasks , 2013, IEEE Transactions on Visualization and Computer Graphics.

[92]  Mark Frank,et al.  'There's no such thing as raw data': exploring the socio-technical life of a government dataset , 2013, WebSci.

[93]  Jan Muntermann,et al.  A method for taxonomy development and its application in information systems , 2013, Eur. J. Inf. Syst..

[94]  Carla E. Brodley,et al.  Dis-function: Learning distance functions interactively , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[95]  Kevin Leyton-Brown,et al.  Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms , 2012, KDD.

[96]  Chris North,et al.  Semantic interaction for visual text analytics , 2012, CHI.

[97]  Jason Dykes,et al.  Human-Centered Approaches in Geovisualization Design: Investigating Multiple Methods Through a Long-Term Case Study , 2011, IEEE Transactions on Visualization and Computer Graphics.

[98]  Jonathan C. Roberts,et al.  Visual comparison for information visualization , 2011, Inf. Vis..

[99]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[100]  Jane Huffman Hayes,et al.  Assessing traceability of software engineering artifacts , 2010, Requirements Engineering.

[101]  William Buxton,et al.  Usability evaluation considered harmful (some of the time) , 2008, CHI.

[102]  Charlotte P. Lee,et al.  Boundary Negotiating Artifacts: Unbinding the Routine of Boundary Objects and Embracing Chaos in Collaborative Work , 2007, Computer Supported Cooperative Work (CSCW).

[103]  Eliane Regina de Almeida Valiati,et al.  A taxonomy of tasks for guiding the evaluation of multidimensional visualizations , 2006, BELIV '06.

[104]  Leonard J. Bass,et al.  Identifying gaps between HCI, software engineering, and design, and boundary objects to bridge them , 2004, CHI EA '04.

[105]  Kristian Kreiner,et al.  Tacit knowledge management: the role of artifacts , 2002, J. Knowl. Manag..

[106]  Amrit Tiwana,et al.  A design knowledge management system to support collaborative information product evolution , 2001, Decis. Support Syst..

[107]  Christopher D. Wickens,et al.  A model for types and levels of human interaction with automation , 2000, IEEE Trans. Syst. Man Cybern. Part A.

[108]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[109]  José Pablo Cambronero Sánchez Mining Software Artifacts for use in Automated Machine Learning , 2021 .

[110]  Aditya G. Parameswaran,et al.  Lux: Always-on Visualization Recommendations for Exploratory Data Science , 2021, ArXiv.

[111]  Daniel A. Keim,et al.  Learning and Teaching in Co-Adaptive Guidance for Mixed-Initiative Visual Analytics , 2020, EuroVA@Eurographics/EuroVis.

[112]  Aditya G. Parameswaran,et al.  A Human-in-the-loop Perspective on AutoML: Milestones and the Road Ahead , 2019, IEEE Data Eng. Bull..

[113]  Michael Gleicher,et al.  Considerations for Visualizing Comparison , 2018, IEEE Transactions on Visualization and Computer Graphics.

[114]  Tamara Munzner,et al.  Bridging from Goals to Tasks with Design Study Analysis Reports , 2018, IEEE Transactions on Visualization and Computer Graphics.

[115]  Michael Gleicher,et al.  Examples of model visualization cards Workflow status Examples of problem discovery cards , 2018 .

[116]  Sebastian Schelter,et al.  Automatically Tracking Metadata and Provenance of Machine Learning Experiments , 2017 .

[117]  Janet Salmons Expect Originality! Using Taxonomies to Structure Assignments that Support Original Work , 2008 .