Using Simulation to Aid the Design and Optimization of Intelligent User Interfaces for Quality Assurance Processes in Machine Learning

Many mission-critical applications of machine learning (ML) in the real-world require a quality assurance (QA) process before the decisions or predictions of an ML model can be deployed. Because QA4ML users have to view a non-trivial amount of data and perform many input actions to correct errors made by the ML model, an optimally-designed user interface (UI) can reduce the cost of interactions significantly. A UI’s effectiveness can be affected by many factors, such as the number of data objects processed concurrently, the types of commands for correcting errors, and the availability of algorithms for assisting users. We propose using simulation to aid the design and optimization of intelligent user interfaces for QA4ML processes. In particular, we focus on simulating the combined effects of human intelligence in selecting appropriate commands and algorithms, and machine intelligence in providing a collection of general-purpose algorithms for reordering data objects to be quality-assured.

[1]  Leif Azzopardi,et al.  How query cost affects search behavior , 2013, SIGIR.

[2]  Jakob Nielsen,et al.  Heuristic evaluation of user interfaces , 1990, CHI '90.

[3]  Douglas C. Engelbart,et al.  Display-Selection Techniques for Text Manipulation , 1967 .

[4]  Timo Ropinski,et al.  Classifier‐Guided Visual Correction of Noisy Labels for Image Classification Tasks , 2020, Comput. Graph. Forum.

[5]  Yu Song,et al.  Using GOMS and NASA-TLX to Evaluate Human–Computer Interaction Process in Interactive Segmentation , 2017, Int. J. Hum. Comput. Interact..

[6]  Melanie Tory,et al.  Evaluating Visualizations: Do Expert Reviews Work? , 2005, IEEE Computer Graphics and Applications.

[7]  Gunther Heidemann,et al.  Inter-active learning of ad-hoc classifiers for video visual analytics , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[8]  Cathleen Wharton,et al.  Testing a walkthrough methodology for theory-based design of walk-up-and-use interfaces , 1990, CHI '90.

[9]  Shumin Zhai,et al.  View size and pointing difficulty in multi-scale navigation , 2004, AVI.

[10]  Jerry Alan Fails,et al.  Interactive machine learning , 2003, IUI '03.

[11]  P. Fitts The information capacity of the human motor system in controlling the amplitude of movement. , 1954, Journal of experimental psychology.

[12]  Ben Shneiderman,et al.  Strategies for evaluating information visualization tools: multi-dimensional in-depth long-term case studies , 2006, BELIV '06.

[13]  Krzysztof Z. Gajos,et al.  SUPPLE: automatically generating user interfaces , 2004, IUI '04.

[14]  Antti Oulasvirta,et al.  Computational Support for Functionality Selection in Interaction Design , 2017, ACM Trans. Comput. Hum. Interact..

[15]  I. Scott MacKenzie,et al.  Extending Fitts' law to two-dimensional tasks , 1992, CHI.

[16]  David E. Kieras,et al.  The GOMS family of user interface analysis techniques: comparison and contrast , 1996, TCHI.

[17]  Leysia Palen,et al.  Voice-mail diary studies for naturalistic data capture under mobile conditions , 2002, CSCW '02.

[18]  Shumin Zhai,et al.  Beyond Fitts' law: models for trajectory-based HCI tasks , 1997, CHI Extended Abstracts.

[19]  ChenMin,et al.  MI3: Machine-initiated Intelligent Interaction for Interactive Classification and Data Reconstruction , 2021 .

[20]  David E. Kieras,et al.  Using GOMS for user interface design and evaluation: which technique? , 1996, TCHI.

[21]  Changjian Chen,et al.  An Interactive Method to Improve Crowdsourced Annotations , 2019, IEEE Transactions on Visualization and Computer Graphics.

[22]  Yinan Zhang,et al.  Information Retrieval as Card Playing: A Formal Model for Optimizing Interactive Retrieval Interface , 2015, SIGIR.

[23]  Chris North,et al.  An Evaluation of Microarray Visualization Tools for Biological Insight , 2004, IEEE Symposium on Information Visualization.

[24]  Leif Azzopardi,et al.  The economics in interactive information retrieval , 2011, SIGIR.

[25]  M. Csíkszentmihályi,et al.  The Experience Sampling Method , 2014 .

[26]  Leif Azzopardi,et al.  Modelling interaction with economic models of search , 2014, SIGIR.

[27]  Steve Draper,et al.  Questionnaires as a software evaluation tool , 1983, CHI '83.

[28]  Vittorio Ferrari,et al.  Fluid Annotation: A Human-Machine Collaboration Interface for Full Image Annotation , 2018, ACM Multimedia.

[29]  David S. Ebert,et al.  An Ontological Framework for Supporting the Design and Evaluation of Visual Analytics Systems , 2019, Comput. Graph. Forum.

[30]  Yang Chen,et al.  Interactive Correction of Mislabeled Training Data , 2019, 2019 IEEE Conference on Visual Analytics Science and Technology (VAST).

[31]  Ed H. Chi,et al.  A Position Paper on 'Living Laboratories': Rethinking Ecological Designs and Experimentation in Human-Computer Interaction , 2009, HCI.

[32]  Clayton Lewis,et al.  Learning to use a text processing system: Evidence from “thinking aloud” protocols , 1982, CHI '82.

[33]  Allen Newell,et al.  The keystroke-level model for user performance time with interactive systems , 1980, CACM.