From Machine Learning to Explainable AI

The success of statistical machine learning (ML) methods made the field of Artificial Intelligence (AI) so popular again, after the last AI winter. Meanwhile deep learning approaches even exceed human performance in particular tasks. However, such approaches have some disadvantages besides of needing big quality data, much computational power and engineering effort; those approaches are becoming increasingly opaque, and even if we understand the underlying mathematical principles of such models they still lack explicit declarative knowledge. For example, words are mapped to high-dimensional vectors, making them unintelligible to humans. What we need in the future are context-adaptive procedures, i.e. systems that construct contextual explanatory models for classes of real-world phenomena. This is the goal of explainable AI, which is not a new field; rather, the problem of explainability is as old as AI itself. While rule-based approaches of early AI were comprehensible “glass-box” approaches at least in narrow domains, their weakness was in dealing with uncertainties of the real world. Maybe one step further is in linking probabilistic learning methods with large knowledge representations (ontologies) and logical approaches, thus making results re-traceable, explainable and comprehensible on demand.

[1]  Joshua B. Tenenbaum,et al.  Human-level concept learning through probabilistic program induction , 2015, Science.

[2]  Latanya Sweeney,et al.  Achieving k-Anonymity Privacy Protection Using Generalization and Suppression , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[3]  Charles Kemp,et al.  How to Grow a Mind: Statistics, Structure, and Abstraction , 2011, Science.

[4]  Antonino Staiano,et al.  A multi-step approach to time series analysis and gene expression clustering , 2006, Bioinform..

[5]  Pierangela Samarati,et al.  Generalizing Data to Provide Anonymity when Disclosing Information , 1998, PODS 1998.

[6]  Andreas Holzinger,et al.  Trends in Interactive Knowledge Discovery for Personalized Medicine: Cognitive Science meets Machine Learning , 2014, IEEE Intell. Informatics Bull..

[7]  Mihalis Yannakakis,et al.  On the Complexity of Protein Folding , 1998, J. Comput. Biol..

[8]  Heiko Paulheim,et al.  Knowledge graph refinement: A survey of approaches and evaluation methods , 2016, Semantic Web.

[9]  Camelia-Mihaela Pintea,et al.  Towards interactive Machine Learning (iML): Applying Ant Colony Algorithms to Solve the Traveling Salesman Problem with the Human-in-the-Loop Approach , 2016, CD-ARES.

[10]  Charu C. Aggarwal,et al.  On k-Anonymity and the Curse of Dimensionality , 2005, VLDB.

[11]  Gunnar Rätsch,et al.  Large Scale Multiple Kernel Learning , 2006, J. Mach. Learn. Res..

[12]  Sebastian Thrun,et al.  Towards fully autonomous driving: Systems and algorithms , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[13]  T. Ormerod,et al.  Human performance on the traveling salesman problem , 1996, Perception & psychophysics.

[14]  Antonino Staiano,et al.  Clustering and visualization approaches for human cell cycle gene expression data analysis , 2008, Int. J. Approx. Reason..

[15]  Nando de Freitas,et al.  Taking the Human Out of the Loop: A Review of Bayesian Optimization , 2016, Proceedings of the IEEE.

[16]  William J. Cook,et al.  The Traveling Salesman Problem: A Computational Study , 2007 .

[17]  Song-Chun Zhu,et al.  Learning Hybrid Image Templates (HIT) by Information Projection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Andrew Gordon Wilson,et al.  The Human Kernel , 2015, NIPS.

[19]  H FriedmanJerome On Bias, Variance, 0/1Loss, and the Curse-of-Dimensionality , 1997 .

[20]  Andreas Holzinger,et al.  On Computationally-Enhanced Visual Analysis of Heterogeneous Data and Its Application in Biomedical Informatics , 2014, Interactive Knowledge Discovery and Data Mining in Biomedical Informatics.

[21]  Andreas Holzinger,et al.  Interactive machine learning for health informatics: when do we need the human-in-the-loop? , 2016, Brain Informatics.

[22]  John W. Tukey,et al.  A Projection Pursuit Algorithm for Exploratory Data Analysis , 1974, IEEE Transactions on Computers.

[23]  Marc'Aurelio Ranzato,et al.  Building high-level features using large scale unsupervised learning , 2011, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  Freddy Lécué,et al.  Explainable AI: The New 42? , 2018, CD-MAKE.

[25]  G. Laporte The traveling salesman problem: An overview of exact and approximate algorithms , 1992 .

[26]  Trisha Greenhalgh,et al.  Complexity science: The challenge of complexity in health care. , 2001, BMJ.

[27]  Sue Blackman,et al.  Beginning 3D Game Development with Unity: All-in-one, multi-platform game development , 2011 .

[28]  Richard M. Karp,et al.  Mapping the genome: some combinatorial problems arising in molecular biology , 1993, STOC.

[29]  Afra Zomorodian,et al.  Computational topology , 2010 .

[30]  Ira Assent,et al.  Morpheus: interactive exploration of subspace clustering , 2008, KDD.

[31]  Jerome H. Friedman,et al.  On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality , 2004, Data Mining and Knowledge Discovery.

[32]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[33]  Andreas Holzinger Biomedical Informatics: Discovering Knowledge in Big Data , 2014 .

[34]  Søren Brunak,et al.  A Travelling Salesman Approach to Protein Conformation , 1989, Complex Syst..

[35]  Jake Porway,et al.  A stochastic graph grammar for compositional object representation and recognition , 2009, Pattern Recognit..

[36]  Robin Sibson,et al.  What is projection pursuit , 1987 .

[37]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[38]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[39]  Andreas Holzinger,et al.  Interactive Machine Learning (iML): a challenge for Game-based approaches , 2016, NIPS 2016.

[40]  Benjamin M. Good,et al.  Crowdsourcing for bioinformatics , 2013, Bioinform..

[41]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[42]  T. Bayes An essay towards solving a problem in the doctrine of chances , 2003 .

[43]  Vin de Silva,et al.  Persistent Cohomology and Circular Coordinates , 2009, SCG '09.

[44]  Andreas Holzinger,et al.  Introduction to MAchine Learning & Knowledge Extraction (MAKE) , 2017, Mach. Learn. Knowl. Extr..

[45]  Chris Clifton,et al.  δ-Presence without Complete World Knowledge , 2010, IEEE Transactions on Knowledge and Data Engineering.

[46]  Martin Ebner,et al.  Successful implementation of user-centered game based learning in higher education: An example from civil engineering , 2007, Comput. Educ..

[47]  Ninghui Li,et al.  t-Closeness: Privacy Beyond k-Anonymity and l-Diversity , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[48]  Eleanor Jane Budge,et al.  Collective intelligence for translational medicine: Crowdsourcing insights and innovation from an interdisciplinary biomedical research community , 2015, Annals of medicine.

[49]  Adrien Treuille,et al.  Predicting protein structures with a multiplayer online game , 2010, Nature.

[50]  Andreas Holzinger,et al.  Towards the Augmented Pathologist: Challenges of Explainable-AI in Digital Pathology , 2017, ArXiv.

[51]  Lotfi A. Zadeh,et al.  Toward Human Level Machine Intelligence - Is It Achievable? The Need for a Paradigm Shift , 2008, IEEE Computational Intelligence Magazine.

[52]  Michael I. Jordan,et al.  Machine learning: Trends, perspectives, and prospects , 2015, Science.

[53]  Song-Chun Zhu,et al.  Learning AND-OR Templates for Object Recognition and Detection , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[54]  Andreas Holzinger,et al.  Human-Computer Interaction and Knowledge Discovery (HCI-KDD): What Is the Benefit of Bringing Those Two Fields to Work Together? , 2013, CD-ARES.

[55]  Carla E. Brodley,et al.  ASSERT: A Physician-in-the-Loop Content-Based Retrieval System for HRCT Image Databases , 1999, Comput. Vis. Image Underst..

[56]  Kenneth Steiglitz,et al.  Combinatorial Optimization: Algorithms and Complexity , 1981 .

[57]  Andreas Holzinger,et al.  On Knowledge Discovery and Interactive Intelligent Visualization of Biomedical Data - Challenges in Human-Computer Interaction & Biomedical Informatics , 2012, DATA.

[58]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[59]  Igor Jurisica,et al.  Knowledge Discovery and Data Mining in Biomedical Informatics: The Future Is in Integrative, Interactive Machine Learning Solutions , 2014, Interactive Knowledge Discovery and Data Mining in Biomedical Informatics.

[60]  P. Plsek,et al.  The challenge of complexity in health care , 2001, BMJ : British Medical Journal.

[61]  Daniel A. Keim,et al.  Analysis of Patient Groups and Immunization Results Based on Subspace Clustering , 2015, BIH.

[62]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[63]  Andreas Holzinger,et al.  Discovering Medical Knowledge Using Visual Analytics – a survey on methods for systems biology and ? omics data – , 2015 .

[64]  Andreas Holzinger,et al.  Big Complex Biomedical Data: Towards a Taxonomy of Data , 2012, ICETE.

[65]  R. Lathrop The protein threading problem with sequence amino acid interaction preferences is NP-complete. , 1994, Protein engineering.

[66]  A Min Tjoa,et al.  Current Advances, Trends and Challenges of Machine Learning and Knowledge Extraction: From Machine Learning to Explainable AI , 2018, CD-MAKE.

[67]  Frank Puppe,et al.  Introspective Subgroup Analysis for Interactive Knowledge Refinement , 2006, FLAIRS Conference.

[68]  ASHWIN MACHANAVAJJHALA,et al.  L-diversity: privacy beyond k-anonymity , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[69]  Massimo Ferri,et al.  Why Topology for Machine Learning and Knowledge Extraction? , 2018, Mach. Learn. Knowl. Extr..

[70]  Thomas L. Griffiths,et al.  Modeling human function learning with Gaussian processes , 2008, NIPS.

[71]  Igor Jurisica,et al.  Knowledge Discovery and interactive Data Mining in Bioinformatics - State-of-the-Art, future challenges and research directions , 2014, BMC Bioinformatics.

[72]  Edgar R. Weippl,et al.  Witnesses for the Doctor in the Loop , 2015, BIH.

[73]  Burr Settles,et al.  From Theories to Queries: Active Learning in Practice , 2011 .

[74]  Michael Rabadi,et al.  Kernel Methods for Machine Learning , 2015 .

[75]  Zoubin Ghahramani,et al.  Probabilistic machine learning and artificial intelligence , 2015, Nature.

[76]  Hayit Greenspan,et al.  Content-Based Image Retrieval in Radiology: Current Status and Future Directions , 2010, Journal of Digital Imaging.

[77]  Deniz Erdogmus,et al.  The Future of Human-in-the-Loop Cyber-Physical Systems , 2013, Computer.

[78]  Hirokazu Anai,et al.  Learning Multi-Way Relations via Tensor Decomposition With Neural Networks , 2018, AAAI.

[79]  Gaston H. Gonnet,et al.  Using traveling salesman problem algorithms for evolutionary tree construction , 2000, Bioinform..

[80]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[81]  Andreas Holzinger,et al.  Knowledge Discovery from Complex High Dimensional Data , 2016, Solving Large Scale Learning Tasks.

[82]  Gaston H. Gonnet,et al.  Using traveling salesman problem algorithms for evolutionary tree construction , 1999 .

[83]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.