A Survey on Uncertainty Reasoning and Quantification for Decision Making: Belief Theory Meets Deep Learning

An in-depth understanding of uncertainty is the first step to making effective decisions under uncertainty. Deep/machine learning (ML/DL) has been hugely leveraged to solve complex problems involved with processing high-dimensional data. However, reasoning and quantifying different types of uncertainties to achieve effective decision-making have been much less explored in ML/DL than in other Artificial Intelligence (AI) domains. In particular, belief/evidence theories have been studied in KRR since the 1960s to reason and measure uncertainties to enhance decision-making effectiveness. We found that only a few studies have leveraged the mature uncertainty research in belief/evidence theories in ML/DL to tackle complex problems under different types of uncertainty. In this survey paper, we discuss several popular belief theories and their core ideas dealing with uncertainty causes and types and quantifying them, along with the discussions of their applicability in ML/DL. In addition, we discuss three main approaches that leverage belief theories in Deep Neural Networks (DNNs), including Evidential DNNs, Fuzzy DNNs, and Rough DNNs, in terms of their uncertainty causes, types, and quantification methods along with their applicability in diverse problem domains. Based on our in-depth survey, we discuss insights, lessons learned, limitations of the current state-of-the-art bridging belief theories and ML/DL, and finally, future research directions.

[1]  Hugo Larochelle,et al.  Efficient Learning of Deep Boltzmann Machines , 2010, AISTATS.

[2]  M. Nagy,et al.  Multi agent trust for belief combination on the Semantic Web , 2008, 2008 4th International Conference on Intelligent Computer Communication and Processing.

[3]  Audun Jøsang,et al.  Subjective Logic: A Formalism for Reasoning Under Uncertainty , 2016 .

[4]  Li Fu,et al.  A novel fuzzy deep-learning approach to traffic flow prediction with uncertain spatial–temporal data features , 2018, Future Generation Computer Systems.

[5]  Jin Wang,et al.  Inverse Problemin DSmT and Its Applications in Trust Management , 2007, The First International Symposium on Data, Privacy, and E-Commerce (ISDPE 2007).

[6]  Minho Lee,et al.  A fuzzy convolutional neural network for text sentiment analysis , 2018, J. Intell. Fuzzy Syst..

[7]  Stephen Cole Kleene,et al.  On notation for ordinal numbers , 1938, Journal of Symbolic Logic.

[8]  Ali Elkamel,et al.  Short-term wind speed forecasting framework based on stacked denoising auto-encoders with rough ANN , 2020 .

[9]  Jaouad Boumhidi,et al.  Fuzzy deep learning based urban traffic incident detection , 2017, Cognitive Systems Research.

[10]  Jin-Hee Cho,et al.  Deep Learning for Predicting Dynamic Uncertain Opinions in Network Data , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[11]  Feng Jiang,et al.  Deep Learning and Dempster-Shafer Theory Based Insider Threat Detection , 2020 .

[12]  Yu-Jun Zheng,et al.  A Pythagorean-Type Fuzzy Deep Denoising Autoencoder for Industrial Accident Early Warning , 2017, IEEE Transactions on Fuzzy Systems.

[13]  Ramin Yasdi,et al.  Combining Rough Sets Learning- and Neural Learning-method to deal with uncertain and imprecise information , 1995, Neurocomputing.

[14]  Yu-Jun Zheng,et al.  Airline Passenger Profiling Based on Fuzzy Deep Machine Learning , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Jinho D. Choi,et al.  Boosting Cross-Lingual Transfer via Self-Learning with Uncertainty Estimation , 2021, EMNLP.

[16]  Murat Sensoy,et al.  Evidential Deep Learning to Quantify Classification Uncertainty , 2018, NeurIPS.

[17]  D. Rus,et al.  Deep Evidential Regression , 2019, NeurIPS.

[18]  Qianping Wang,et al.  A Fuzzy Logic-Based Trust Model in Grid , 2009, 2009 International Conference on Networks Security, Wireless Communications and Trusted Computing.

[19]  Zhendong Wu,et al.  Damaged fingerprint classification by Deep Learning with fuzzy feature points , 2016, 2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI).

[20]  B. Hammond Ontology , 2004, Lawrence Booth’s Book of Visions.

[21]  Ding Shuai,et al.  Trustworthy Software Evaluation Using Utility Based Evidence Theory , 2009 .

[22]  Hans-Jürgen Zimmermann,et al.  An application-oriented view of modeling uncertainty , 2000, Eur. J. Oper. Res..

[23]  C Pahl-Wostl,et al.  Integrated management of natural resources: dealing with ambiguous issues, multiple actors and diverging frames. , 2005, Water science and technology : a journal of the International Association on Water Pollution Research.

[24]  Jin-Hee Cho,et al.  Deep Learning Based Scalable Inference of Uncertain Opinions , 2018, 2018 IEEE International Conference on Data Mining (ICDM).

[25]  Florentin Smarandache,et al.  Neutrosophic masses & indeterminate models: Applications to information fusion , 2012, 2012 15th International Conference on Information Fusion.

[26]  James L. McClelland,et al.  James L. McClelland, David Rumelhart and the PDP Research Group, Parallel distributed processing: explorations in the microstructure of cognition . Vol. 1. Foundations . Vol. 2. Psychological and biological models . Cambridge MA: M.I.T. Press, 1987. , 1989, Journal of Child Language.

[27]  Debi Prosad Dogra,et al.  Surveillance scene representation and trajectory abnormality detection using aggregation of multiple concepts , 2018, Expert Syst. Appl..

[28]  Audun Jøsang,et al.  Uncertainty Characteristics of Subjective Opinions , 2018, 2018 21st International Conference on Information Fusion (FUSION).

[29]  S. I. Kashkevich,et al.  A two-level automated pattern recognition complex , 1979 .

[30]  Zhenyu Liu,et al.  TBM performance prediction with Bayesian optimization and automated machine learning , 2020 .

[31]  Verónica Dahl,et al.  Quantification in a Three-Valued Logic for Natural Language Question-Answering Systems , 1979, IJCAI.

[32]  Feng Chen,et al.  Multifaceted Uncertainty Estimation for Label-Efficient Deep Learning , 2020, NeurIPS.

[33]  Audun Jøsang,et al.  A Logic for Uncertain Probabilities , 2001, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[34]  Roberto Cipolla,et al.  Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding , 2015, BMVC.

[35]  Youyong Kong,et al.  A Hierarchical Fused Fuzzy Deep Neural Network for Data Classification , 2017, IEEE Transactions on Fuzzy Systems.

[36]  Florentin Smarandache,et al.  Advances and Applications of DSmT for Information Fusion , 2004 .

[37]  Okyay Kaynak,et al.  Rough Deep Neural Architecture for Short-Term Wind Speed Forecasting , 2017, IEEE Transactions on Industrial Informatics.

[38]  Nikhil S. Shirwandkar,et al.  Extractive Text Summarization Using Deep Learning , 2018, 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA).

[39]  Florentin Smarandache,et al.  The Effective Use of the DSmT for Multi-Class Classification , 2014 .

[40]  Giorgio Corani,et al.  A tree augmented classifier based on Extreme Imprecise Dirichlet Model , 2010, Int. J. Approx. Reason..

[41]  Catherine K. Murphy Combining belief functions when evidence conflicts , 2000, Decis. Support Syst..

[42]  Xujiang Zhao,et al.  Multidimensional Uncertainty-Aware Evidential Neural Networks , 2020, AAAI.

[43]  Michael E. Tipping Bayesian Inference: An Introduction to Principles and Practice in Machine Learning , 2003, Advanced Lectures on Machine Learning.

[44]  Wang Yaonan,et al.  Fuzzy-rough Neural Network and Its Application to Vowel Recognition , 2006 .

[45]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[46]  Éloi Bossé,et al.  Measuring ambiguity in the evidence theory , 2006, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[47]  Peter D. Hoff,et al.  A First Course in Bayesian Statistical Methods , 2009 .

[48]  Paul Smolensky,et al.  Information processing in dynamical systems: foundations of harmony theory , 1986 .

[49]  Nurali Virani,et al.  Variational Encoder-Based Reliable Classification , 2020, 2020 IEEE International Conference on Image Processing (ICIP).

[50]  M. Masson,et al.  Pairwise classifier combination in the transferable belief model , 2005, 2005 7th International Conference on Information Fusion.

[51]  Nacim Ramdani,et al.  Enhanced Multiplex Binary PIR Localization Using the Transferable Belief Model , 2019, IEEE Sensors Journal.

[52]  Thierry Denoeux,et al.  An evidential classifier based on Dempster-Shafer theory and deep learning , 2021, Neurocomputing.

[53]  Robert LIN,et al.  NOTE ON FUZZY SETS , 2014 .

[54]  Mohua Banerjee,et al.  Kleene Algebras and Logic: Boolean and Rough Set Representations, 3-Valued, Rough Set and Perp Semantics , 2015, Stud Logica.

[55]  M. Lesani,et al.  Fuzzy Trust Inference in Trust Graphs and its Application in Semantic Web Social Networks , 2006, 2006 World Automation Congress.

[56]  Nurali Virani,et al.  Justification-Based Reliability in Machine Learning , 2019, AAAI.

[57]  Deqiang Han,et al.  Comparative study of contradiction measures in the theory of belief functions , 2012, 2012 15th International Conference on Information Fusion.

[58]  L. A. Zadeh,et al.  Fuzzy logic and approximate reasoning , 1975, Synthese.

[59]  Claudia Pahl-Wostl,et al.  Toward a Relational Concept of Uncertainty: about Knowing Too Little, Knowing Too Differently, and Accepting Not to Know , 2008 .

[60]  Lev V. Utkin,et al.  The imprecise Dirichlet model as a basis for a new boosting classification algorithm , 2015, Neurocomputing.

[61]  R. Yager On the dempster-shafer framework and new combination rules , 1987, Inf. Sci..

[62]  Yi Zhang,et al.  Fuzzy trust recommendation based on collaborative filtering for mobile ad-hoc networks , 2008, 2008 33rd IEEE Conference on Local Computer Networks (LCN).

[63]  Christophe Osswald,et al.  Understanding the large family of Dempster-Shafer theory's fusion operators - a decision-based measure , 2006, 2006 9th International Conference on Information Fusion.

[64]  Jordi Casas,et al.  Unsupervised Incident Detection Model in Urban and Freeway Networks , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[65]  L. Zadeh Probability measures of Fuzzy events , 1968 .

[66]  Shu Hu,et al.  Uncertainty Aware Semi-Supervised Learning on Graph Data , 2020, NeurIPS.

[67]  Jianwen Chen,et al.  Dealing with Uncertainty: A Survey of Theories and Practices , 2013, IEEE Transactions on Knowledge and Data Engineering.

[68]  L. A. ZADEH,et al.  The concept of a linguistic variable and its application to approximate reasoning - I , 1975, Inf. Sci..

[69]  W. Walker,et al.  Defining Uncertainty: A Conceptual Basis for Uncertainty Management in Model-Based Decision Support , 2003 .

[70]  Charles Blundell,et al.  Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.

[71]  Ujjwal Maulik,et al.  A Survey on Fuzzy Deep Neural Networks , 2020, ACM Comput. Surv..

[72]  Jens Honer,et al.  Motion State Classification for Automotive LIDAR Based on Evidential Grid Maps and Transferable Belief Model , 2018, 2018 21st International Conference on Information Fusion (FUSION).

[73]  R. Govindaraju,et al.  On selection of kernel parametes in relevance vector machines for hydrologic applications , 2007 .

[74]  Francisco Guil,et al.  Associative classification based on the Transferable Belief Model , 2019, Knowl. Based Syst..

[75]  Stephan Günnemann,et al.  Posterior Network: Uncertainty Estimation without OOD Samples via Density-Based Pseudo-Counts , 2020, NeurIPS.

[76]  Erik Blasch,et al.  Overview of Dempster-Shafer and belief function tracking methods , 2013, Defense, Security, and Sensing.

[77]  Ronald R. Yager,et al.  Entropy and Specificity in a Mathematical Theory of Evidence , 2008, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[78]  Lotfi A. Zadeh,et al.  A Simple View of the Dempster-Shafer Theory of Evidence and Its Implication for the Rule of Combination , 1985, AI Mag..

[79]  Francisco Javier García Castellano,et al.  Imprecise Classification with Non-parametric Predictive Inference , 2020, IPMU.

[80]  Feng Chen,et al.  Uncertainty-Aware Opinion Inference Under Adversarial Attacks , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[81]  Feng Chen,et al.  Uncertainty-based Decision Making Using Deep Reinforcement Learning , 2019, 2019 22th International Conference on Information Fusion (FUSION).

[82]  Feng Chen,et al.  Quantifying Classification Uncertainty using Regularized Evidential Neural Networks , 2019, ArXiv.

[83]  Joaquin Quiñonero-Candela,et al.  Learning with Uncertainty: Gaussian Processes and Relevance Vector Machines , 2004 .

[84]  Mark J. F. Gales,et al.  Predictive Uncertainty Estimation via Prior Networks , 2018, NeurIPS.

[85]  P. Lingras Rough Neural Networks , 1996 .

[86]  Audun Jøsang,et al.  An Algebra for Assessing Trust in Certification Chains , 1999, NDSS.

[87]  Didier Dubois,et al.  Fuzzy sets and systems ' . Theory and applications , 2007 .

[88]  C. L. Philip Chen,et al.  Fuzzy Restricted Boltzmann Machine for the Enhancement of Deep Learning , 2015, IEEE Transactions on Fuzzy Systems.

[89]  Murat Sensoy,et al.  Uncertainty-Aware Deep Classifiers Using Generative Models , 2020, AAAI.

[90]  R. Hankin A Generalization of the Dirichlet Distribution , 2010 .

[91]  Henri Prade,et al.  Representation and combination of uncertainty with belief functions and possibility measures , 1988, Comput. Intell..

[92]  Florentin Smarandache,et al.  Advances and applications of DSmT for information fusion - Collected works - Volume 3 , 2009 .

[93]  Yuanjie Zheng,et al.  An evolving recurrent interval type-2 intuitionistic fuzzy neural network for online learning and time series prediction , 2019, Appl. Soft Comput..

[94]  Albert Y. Zomaya,et al.  Recent Trends in Computer Networks and Distributed Systems Security , 2012, Communications in Computer and Information Science.

[95]  Hao Wang,et al.  A Survey on Bayesian Deep Learning , 2016, ACM Comput. Surv..

[96]  P. S. Sastry,et al.  An Overview of Restricted Boltzmann Machines , 2019, Journal of the Indian Institute of Science.

[97]  George J. Klir,et al.  Uncertainty in the dempster-shafer Theory - A Critical Re-examination , 1990 .

[98]  Arief B. Koesdwiady,et al.  Big-data-generated traffic flow prediction using deep learning and dempster-shafer theory , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[99]  Daniel W. Manchala,et al.  Trust metrics, models and protocols for electronic commerce transactions , 1998, Proceedings. 18th International Conference on Distributed Computing Systems (Cat. No.98CB36183).

[100]  Zdzislaw Pawlak,et al.  VAGUENESS AND UNCERTAINTY: A ROUGH SET PERSPECTIVE , 1995, Comput. Intell..

[101]  Willem Waegeman,et al.  Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods , 2019, Machine Learning.

[102]  SEED: Sound Event Early Detection via Evidential Uncertainty , 2022, ArXiv.

[103]  Samia Nefti-Meziani,et al.  A fuzzy trust model for e-commerce , 2005, Seventh IEEE International Conference on E-Commerce Technology (CEC'05).

[104]  Yuichi Motai,et al.  Intra- and Inter-Fractional Variation Prediction of Lung Tumors Using Fuzzy Deep Learning , 2016, IEEE Journal of Translational Engineering in Health and Medicine.

[105]  Florentin Smarandache,et al.  Advances and Applications of DSmT for Information Fusion (Collected Works) , 2004 .

[106]  Christopher P. Reale,et al.  Multivariate Uncertainty in Deep Learning , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[107]  Wei Chen,et al.  Efficient influence maximization in social networks , 2009, KDD.

[108]  S. Fienberg When did Bayesian inference become "Bayesian"? , 2006 .

[109]  Marjolein B.A. van Asselt,et al.  Perspectives on uncertainty and risk , 2000 .

[110]  Jerry M. Mendel,et al.  Uncertainty measures for general type-2 fuzzy sets , 2009, 2009 IEEE International Conference on Systems, Man and Cybernetics.

[111]  Erik M. Fredericks,et al.  Uncertainty in big data analytics: survey, opportunities, and challenges , 2019, Journal of Big Data.

[112]  Meera Narvekar,et al.  Hybrid auto text summarization using deep neural network and fuzzy logic system , 2017, 2017 International Conference on Inventive Computing and Informatics (ICICI).

[113]  Jerry M. Mendel,et al.  Uncertainty measures for interval type-2 fuzzy sets , 2007, Inf. Sci..

[114]  J. Andrew Bagnell,et al.  Improving robot navigation through self‐supervised online learning , 2006, J. Field Robotics.

[115]  Jie Geng,et al.  Fault Diagnosis Based on Non-Negative Sparse Constrained Deep Neural Networks and Dempster–Shafer Theory , 2020, IEEE Access.

[116]  Jerry M. Mendel,et al.  A comparative study of ranking methods, similarity measures and uncertainty measures for interval type-2 fuzzy sets , 2009, Inf. Sci..

[117]  A. Tversky,et al.  The framing of decisions and the psychology of choice. , 1981, Science.

[118]  Philippe Smets,et al.  The Transferable Belief Model , 1991, Artif. Intell..

[119]  Saeid Nahavandi,et al.  Neural Network-Based Uncertainty Quantification: A Survey of Methodologies and Applications , 2018, IEEE Access.

[120]  Igor Linkov,et al.  Model Uncertainty and Choices Made by Modelers: Lessons Learned from the International Atomic Energy Agency Model Intercomparisons † , 2003, Risk analysis : an official publication of the Society for Risk Analysis.

[121]  Ronald R. Yager,et al.  Pythagorean Membership Grades in Multicriteria Decision Making , 2014, IEEE Transactions on Fuzzy Systems.

[122]  Ren Zhang,et al.  A model with Fuzzy Granulation and Deep Belief Networks for exchange rate forecasting , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[123]  Lotfi A. Zadeh,et al.  On the Validity of Dempster''s Rule of Combination of Evidence , 1979 .

[124]  Yanning Zhang,et al.  Hybrid Genetic and Variational Expectation-Maximization Algorithm for Gaussian-Mixture-Model-Based Brain MR Image Segmentation , 2011, IEEE Transactions on Information Technology in Biomedicine.

[125]  E. F. Codd,et al.  Missing information (applicable and inapplicable) in relational databases , 1986, SGMD.

[126]  Theresa Beaubouef,et al.  Rough Sets , 2019, Lecture Notes in Computer Science.

[127]  L. Zadeh The role of fuzzy logic in the management of uncertainty in expert systems , 1983 .

[128]  P. Smets Data fusion in the transferable belief model , 2000, Proceedings of the Third International Conference on Information Fusion.

[129]  Roberto Cipolla,et al.  Multi-task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[130]  P. Walley Inferences from Multinomial Data: Learning About a Bag of Marbles , 1996 .

[131]  A. Kiureghian,et al.  Aleatory or epistemic? Does it matter? , 2009 .

[132]  Jiawei Xiang,et al.  DSmT-based three-layer method using multi-classifier to detect faults in hydraulic systems , 2021 .

[133]  Alex Kendall,et al.  What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? , 2017, NIPS.