Model level fusion of edge histogram descriptors and gabor wavelets for landmine detection with ground penetrating radar

We propose a discriminative method for combining heterogeneous sets of features for the continuous hidden Markov model classifier. We use a model level fusion approach and apply it to the problem of landmine detection using ground penetrating radar (GPR). We hypothesize that each signature (mine or non-mine) can be characterized better by multiple synchronous sequences that can capture different and complementary features. Our work is motivated by the fact that mines and clutter objects can have different characteristics depending on the mine type, soil and weather conditions, and burial depth. Thus, different sets of specialized feature extraction mechanisms, may be needed to achieve high detection and low false alarm rates. In order to fuse the different modalities, a multi-stream continuous HMM that includes a stream relevance weighting component is developed. In particular, we modify the probability density function that characterizes the standard continuousHMMto include state and component dependent stream relevance weights. We generalize the Minimum Classification Error (MCE) objective function to include stream relevance weights and derive the necessary conditions to update all model parameters simultaneously. Results on a large collection of GPR alarms show that the proposed model level fusion outperforms the baseline HMM when each feature is used independently and when both features are combined with equal weights.

[1]  Paul D. Gader,et al.  Landmine detection with ground penetrating radar using hidden Markov models , 2001, IEEE Trans. Geosci. Remote. Sens..

[2]  Paul D. Gader,et al.  Detection and Discrimination of Land Mines in Ground-Penetrating Radar Based on Edge Histogram Descriptors and a Possibilistic K-Nearest , 2009 .

[3]  Juergen Luettin,et al.  Audio-Visual Speech Modeling for Continuous Speech Recognition , 2000, IEEE Trans. Multim..

[4]  P.A. Torrione,et al.  Performance of an adaptive feature-based processor for a wideband ground penetrating radar system , 2006, IEEE Transactions on Aerospace and Electronic Systems.

[5]  Juergen Luettin,et al.  Audio-Visual Automatic Speech Recognition: An Overview , 2004 .

[6]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[7]  Shigeru Katagiri,et al.  A derivation of minimum classification error from the theoretical classification risk using Parzen estimation , 2004, Comput. Speech Lang..

[8]  Ara V. Nefian,et al.  A Bayesian Approach to Audio-Visual Speaker Identification , 2003, AVBPA.

[9]  Biing-Hwang Juang,et al.  Minimum classification error rate methods for speech recognition , 1997, IEEE Trans. Speech Audio Process..

[10]  Michael I. Jordan,et al.  Factorial Hidden Markov Models , 1995, Machine Learning.

[11]  Paul D. Gader,et al.  Landmine detection using mixture of discrete hidden Markov models , 2009, Defense + Commercial Sensing.

[12]  Shaogang Gong,et al.  Audio- and Video-based Biometric Person Authentication , 1997, Lecture Notes in Computer Science.

[13]  Paul D. Gader,et al.  Detection and Discrimination of Land Mines in Ground-Penetrating Radar Based on Edge Histogram Descriptors and a Possibilistic $K$-Nearest Neighbor Classifier , 2009, IEEE Transactions on Fuzzy Systems.

[14]  Ioannis Pitas,et al.  Multimodal decision-level fusion for person authentication , 1999, IEEE Trans. Syst. Man Cybern. Part A.

[15]  Hichem Frigui,et al.  Optimal feature weighting for the continuous HMM , 2008, 2008 19th International Conference on Pattern Recognition.