论文信息 - Human-AI Interactive and Continuous Sensemaking: A Case Study of Image Classification using Scribble Attention Maps

Human-AI Interactive and Continuous Sensemaking: A Case Study of Image Classification using Scribble Attention Maps

Advances in Artificial Intelligence (AI), especially the stunning achievements of Deep Learning (DL) in recent years, have shown AI/DL models possess remarkable understanding towards the logic reasoning behind the solved tasks. However, human understanding towards what knowledge is captured by deep neural networks is still elementary and this has a detrimental effect on human’s trust in the decisions made by AI systems. Explainable AI (XAI) is a hot topic in both AI and HCI communities in order to open up the blackbox to elucidate the reasoning processes of AI algorithms in such a way that makes sense to humans. However, XAI is only half of human-AI interaction and research on the other half - human’s feedback on AI explanations together with AI making sense of the feedback - is generally lacking. Human cognition is also a blackbox to AI and effective human-AI interaction requires unveiling both blackboxes to each other for mutual sensemaking. The main contribution of this paper is a conceptual framework for supporting effective human-AI interaction, referred to as interactive and continuous sensemaking (HAICS). We further implement this framework in an image classification application using deep Convolutional Neural Network (CNN) classifiers as a browser-based tool that displays network attention maps to the human for explainability and collects human’s feedback in the form of scribble annotations overlaid onto the maps. Experimental results using a real-world dataset has shown significant improvement of classification accuracy (the AI performance) with the HAICS framework.

[1] Chris North,et al. Interactive Artificial Intelligence: Designing for the "Two Black Boxes" Problem , 2020, Computer.

[2] D. Ring,et al. Is Deep Learning On Par with Human Observers for Detection of Radiographically Visible and Occult Fractures of the Scaphoid? , 2020, Clinical orthopaedics and related research.

[3] Jaelle Scheuerman,et al. On Interactive Machine Learning and the Potential of Cognitive Feedback , 2020, ArXiv.

[4] Chenhao Tan,et al. Harnessing Explanations to Bridge AI and Humans , 2020, ArXiv.

[5] Robert O. Briggs,et al. Machines as teammates: A research agenda on AI in team collaboration , 2020, Inf. Manag..

[6] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[7] Livia C Gouvea,et al. Explanations and sensemaking with AI and HCI , 2019, Proceedings of the IX Latin American Conference on Human Computer Interaction.

[8] Dewar D. Finlay,et al. Human Centered Artificial Intelligence: Weaving UX into Algorithmic Decision Making , 2019, Romanian Conference on Human-Computer Interaction.

[9] Eric Horvitz,et al. Updates in Human-AI Teams: Understanding and Addressing the Performance/Compatibility Tradeoff , 2019, AAAI.

[10] Richard Banks,et al. Emerging Perspectives in Human-Centered Machine Learning , 2019, CHI Extended Abstracts.

[11] Dominik Dellermann,et al. The Future of Human-AI Collaboration: A Taxonomy of Design Knowledge for Hybrid Intelligence Systems , 2019, HICSS.

[12] Rama Chellappa,et al. Sensemaking Research Roadmap , 2018 .

[13] Ismail Ben Ayed,et al. On Regularized Losses for Weakly-supervised CNN Segmentation , 2018, ECCV.

[14] Abhishek Das,et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[15] Zhe L. Lin,et al. Top-Down Neural Attention by Excitation Backprop , 2016, International Journal of Computer Vision.

[16] Ece Kamar,et al. Directions in Hybrid Intelligence: Complementing AI Systems with Human Intelligence , 2016, IJCAI.

[17] Wendy E. Mackay,et al. Human-Centred Machine Learning , 2016, CHI Extended Abstracts.

[18] Jian Sun,et al. ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Christoph H. Lampert,et al. Seed, Expand and Constrain: Three Principles for Weakly-Supervised Image Segmentation , 2016, ECCV.

[20] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[21] Bolei Zhou,et al. Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Weng-Keen Wong,et al. Principles of Explanatory Debugging to Personalize Interactive Machine Learning , 2015, IUI.

[24] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[25] Thomas Brox,et al. Striving for Simplicity: The All Convolutional Net , 2014, ICLR.

[26] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Maya Cakmak,et al. Power to the People: The Role of Humans in Interactive Machine Learning , 2014, AI Mag..

[28] Andrew Zisserman,et al. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[29] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[30] Andrew Blake,et al. Geodesic star convexity for interactive image segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[31] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[32] Leo Grady,et al. Random Walks for Image Segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33] Harry Shum,et al. Lazy snapping , 2004, ACM Trans. Graph..

[34] Vladimir Kolmogorov,et al. "GrabCut": interactive foreground extraction using iterated graph cuts , 2004, ACM Trans. Graph..

[35] R. Beasley. Beasley's Surgery of the Hand , 2003 .

[36] Jerry Alan Fails,et al. Interactive machine learning , 2003, IUI '03.

[37] Marie-Pierre Jolly,et al. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.