Connectionist-based Dempster-Shafer evidential reasoning for data fusion

Dempster-Shafer evidence theory (DSET) is a popular paradigm for dealing with uncertainty and imprecision. Its corresponding evidential reasoning framework is theoretically attractive. However, there are outstanding issues that hinder its use in real-life applications. Two prominent issues in this regard are 1) the issue of basic probability assignments (masses) and 2) the issue of dependence among information sources. This paper attempts to deal with these issues by utilizing neural networks in the context of pattern classification application. First, a multilayer perceptron neural network with the mean squared error as a cost function is implemented to calculate, for each information source, posteriori probabilities for all classes. Second, an evidence structure construction scheme is developed for transferring the estimated posteriori probabilities to a set of masses along with the corresponding focal elements, from a Bayesian decision point of view. Third, a network realization of the Dempster-Shafer evidential reasoning is designed and analyzed, and it is further extended to a DSET-based neural network, referred to as DSETNN, to manipulate the evidence structures. In order to tackle the issue of dependence between sources, DSETNN is tuned for optimal performance through a supervised learning process. To demonstrate the effectiveness of the proposed approach, we apply it to three benchmark pattern classification problems. Experiments reveal that the DSETNN outperforms DSET and provide encouraging results in terms of classification accuracy and the speed of learning convergence.

[1]  Fulei Chu,et al.  A study on group decision-making based fault multi-symptom-domain consensus diagnosis , 2001, Reliab. Eng. Syst. Saf..

[2]  Lorenzo Bruzzone,et al.  An experimental comparison of neural and statistical non-parametric algorithms for supervised classification of remote-sensing images , 1996, Pattern Recognit. Lett..

[3]  Isabelle Bloch,et al.  Application of Dempster-Shafer evidence theory to unsupervised classification in multisource remote sensing , 1997, IEEE Trans. Geosci. Remote. Sens..

[4]  Simone G. O. Fiori,et al.  Nonsymmetric PDF estimation by artificial neurons: application to statistical characterization of reinforced composites , 2003, IEEE Trans. Neural Networks.

[5]  Hujun Yin,et al.  Self-organizing mixture networks for probability density estimation , 2001, IEEE Trans. Neural Networks.

[6]  Jesús Cid-Sueiro,et al.  Local estimation of posterior class probabilities to minimize classification errors , 2004, IEEE Transactions on Neural Networks.

[7]  M.E.C. Hull,et al.  Updating belief in Gordon-Shortliffe's hierarchical structure , 1994, Proceedings of ANZIIS '94 - Australian New Zealnd Intelligent Information Systems Conference.

[8]  Jing-Yu Yang,et al.  On the Evidence Inference Theory , 1996, Inf. Sci..

[9]  Marios M. Polycarpou,et al.  Using localizing learning to improve supervised learning algorithms , 2001, IEEE Trans. Neural Networks.

[10]  H. Trussell,et al.  Constructing membership functions using statistical data , 1986 .

[11]  G. Kane Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol 1: Foundations, vol 2: Psychological and Biological Models , 1994 .

[12]  V. V. S. Sarma,et al.  Estimation of fuzzy memberships from histograms , 1985, Inf. Sci..

[13]  Ronald R. Yager,et al.  Including probabilistic uncertainty in fuzzy logic controller modeling using Dempster-Shafer theory , 1995, IEEE Trans. Syst. Man Cybern..

[14]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[15]  Tomaso A. Poggio,et al.  Extensions of a Theory of Networks for Approximation and Learning , 1990, NIPS.

[16]  Constantinos S. Pattichis,et al.  Classification capacity of a modular neural network implementing neurally inspired architecture and training rules , 2004, IEEE Transactions on Neural Networks.

[17]  H. D. Cheng,et al.  Automatically Determine the Membership Function Based on the Maximum Entropy Principle , 1997, Inf. Sci..

[18]  P. Smets Data fusion in the transferable belief model , 2000, Proceedings of the Third International Conference on Information Fusion.

[19]  Mohamed A. Deriche,et al.  A New Technique for Combining Multiple Classifiers using The Dempster-Shafer Theory of Evidence , 2002, J. Artif. Intell. Res..

[20]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[21]  Thierry Denoeux,et al.  A neural network classifier based on Dempster-Shafer theory , 2000, IEEE Trans. Syst. Man Cybern. Part A.

[22]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[23]  Edward J. Powers,et al.  Power quality disturbance waveform recognition using wavelet-based neural classifier. I. Theoretical foundation , 2000 .

[24]  Yue Min Zhu,et al.  Study of Dempster-Shafer theory for image segmentation applications , 2002, Image Vis. Comput..

[25]  Thierry Denoeux,et al.  A k-nearest neighbor classification rule based on Dempster-Shafer theory , 1995, IEEE Trans. Syst. Man Cybern..

[26]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[27]  K. Yamada Probability-possibility transformation based on evidence theory , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[28]  D. Louis Collins,et al.  Design and construction of a realistic digital brain phantom , 1998, IEEE Transactions on Medical Imaging.

[29]  Ignazio Gallo,et al.  A neural model for fuzzy Dempster-Shafer classifiers , 2000, Int. J. Approx. Reason..

[30]  G. Klir,et al.  PROBABILITY-POSSIBILITY TRANSFORMATIONS: A COMPARISON , 1992 .

[31]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[32]  Richard Lippmann,et al.  Neural Network Classifiers Estimate Bayesian a posteriori Probabilities , 1991, Neural Computation.

[33]  Otman A. Basir,et al.  A scheme for constructing evidence structures in Dempster-Shafer evidence theory for data fusion , 2003, Proceedings 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation. Computational Intelligence in Robotics and Automation for the New Millennium (Cat. No.03EX694).

[34]  Hongwei Zhu,et al.  Data fusion for pattern classification via the Dempster-Shafer evidence theory , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[35]  Lorenzo Bruzzone,et al.  Combination of neural and statistical algorithms for supervised classification of remote-sensing image , 2000, Pattern Recognit. Lett..

[36]  Isabelle Bloch,et al.  Introduction of neighborhood information in evidence theory and application to data fusion of radar and optical images with partial cloud cover , 1998, Pattern Recognit..

[37]  Xin Yao,et al.  A constructive algorithm for training cooperative neural network ensembles , 2003, IEEE Trans. Neural Networks.

[38]  Thierry Denoeux,et al.  An evidence-theoretic k-NN rule with parameter optimization , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[39]  Isabelle Bloch,et al.  Some aspects of Dempster-Shafer evidence theory for classification of multi-modality medical images taking partial volume effect into account , 1996, Pattern Recognit. Lett..

[40]  Sebastiano B. Serpico,et al.  Classification of multisensor remote-sensing images by structured neural networks , 1995, IEEE Trans. Geosci. Remote. Sens..

[41]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[42]  D. Dubois,et al.  Unfair coins and necessity measures: Towards a possibilistic interpretation of histograms , 1983 .

[43]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .