Analyzing a human-in-the-loop's decisions for the detection of data poisoning

Human-in-the-loop (HITL) is the process of combining the power of a machine or computer system and human intelligence to develop human-aware machine learning (ML) models. The HITL process creates a continuous feedback loop between human and machine, enabling the trained model to continuously improve as edge cases present themselves without the need to fine-tune the model from scratch. Several advantages of utilizing HITL ML systems are to avoid bias, ensure consistency and accuracy, improve efficiency, and provide transparency. However, adding human involvement also invites human mistakes. Occasionally, a HITL system may actually degrade the algorithm rather than improve it: mislabeling an object in an object detection algorithm, incorrectly scoring an algorithm’s output, making misclicks and typos, and other human errors cause the HITL to make a mistake based on the facts presented to it. These errors being made by the user, intentional or not, can be considered a form of data poisoning. To understand the effects of a HITL’s choices on an ML model, several pieces of information during the HITL process can be observed, i.e.,: the time taken by the user to provide input on an output or on a specific object class, as well as an evaluation of the consistency of submitted valid inputs, among other factors. Information extracted from the HITL’s decision-making process can provide insights into whether poor choices are being made by the user (i.e., data poisoning) and identify where, when, and why these choices are being made. Many state-of-the-art models can be utilized for this work, such as ResNet-50, DarkNet-53, Xception, among others. However, for this work, we are less focused on the model being used and more focused on the procedure for tracking HITL performance to maximize model improvement. Nevertheless, this work will consider a pretrained model, though the approach will be model agnostic. The dataset used in this research is the publicly available “Flowers Recognition” dataset available on Kaggle.1

[1]  Dianhai Yu,et al.  Multi-Task Learning for Multiple Language Translation , 2015, ACL.

[2]  Carla E. Brodley,et al.  Content-Based Retrieval from Medical Image Databases: A Synergy of Human Interaction, Machine Learning and Computer Vision , 1999, AAAI/IAAI.

[3]  Andreas Holzinger,et al.  Interactive machine learning for health informatics: when do we need the human-in-the-loop? , 2016, Brain Informatics.

[4]  B. Parton Sign language recognition and translation: a multidisciplined approach from the field of artificial intelligence. , 2005, Journal of deaf studies and deaf education.

[5]  Stanton R. Price,et al.  Object detection on aerial imagery to improve situational awareness for ground vehicles , 2020 .

[6]  Tadej Petric,et al.  Teaching robots to cooperate with humans in dynamic manipulation tasks based on multi-modal human-in-the-loop approach , 2014, Auton. Robots.

[7]  C. G. Looney,et al.  Human-in-the-loop control with majority vote neural networks , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[8]  Dumitru Erhan,et al.  Deep Neural Networks for Object Detection , 2013, NIPS.

[9]  Jason Weston,et al.  Dialogue Learning With Human-In-The-Loop , 2016, ICLR.

[10]  D. Treanor,et al.  The human‐in‐the‐loop: an evaluation of pathologists’ interaction with artificial intelligence in clinical practice , 2021, Histopathology.

[11]  Jordan L. Boyd-Graber,et al.  Closing the Loop: User-Centered Design and Evaluation of a Human-in-the-Loop Topic Modeling System , 2018, IUI.