An efficient FPGA implementation of Gaussian mixture models based classifier: Application to face recognition

This work aims to propose an efficient hardware/software system fo guassian mixture model (GMM) parts-based topology modeling for face identification and verification. Following its great success in speaker recognition, The GMM approach was extended to face recognition providing a good trade-off in terms of complexity, performance and robustness. Despite its reduced complexity compared to other statistical modeling techniques like hiden markov model (HMM) and its variants. The GMM scoring module still to be computationally intensive algorithm consisting of a series of complex tasks executed in sequential order. This constraint limits its suitability for real-time pattern recognition embedded applications. This paper presents an efficient hardware implementation of embedded GMM based classifier. Reconfigurable system in the form of field programmable gate arrays (FPGA) is deployed to embed the hardware part of the proposed system. Furthermore a design of exponential calculation circuit is proposed for the best compromise between effectiveness and complexity. Approximations are also developed to reduce the hardware complexity. The developed system performs the identification process of an unknown input pattern over 200 models in 2.3 seconds, our performance evaluation indicates that a speedup of around S.IX can be achieved over an optimized software implementation running on a 3.3GHz core i3 processor. A results precision of 10-2 is obtained after performing the GMM calculation using the proposed hardware/software system.

[1]  Masahiko Yoshimoto,et al.  A low memory bandwidth Gaussian mixture model (GMM) processor for 20,000-word real-time speech recognition FPGA system , 2008, 2008 International Conference on Field-Programmable Technology.

[2]  Ki-Hyun Kim,et al.  Design of Embedded Multimodal Biometric Systems , 2007, 2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System.

[3]  Zoran Salcic,et al.  Speech recognition system for embedded real-time applications , 2009, 2009 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT).

[4]  R Togneri,et al.  An Overview of Speaker Identification: Accuracy and Robustness Issues , 2011, IEEE Circuits and Systems Magazine.

[5]  Sébastien Marcel,et al.  Cross-Pollination of Normalization Techniques From Speaker to Face Authentication Using Gaussian Mixture Models , 2012, IEEE Transactions on Information Forensics and Security.

[6]  Amine Bermak,et al.  An Efficient Digital VLSI Implementation of , 2006 .

[7]  Wei Wang,et al.  A reconfigurable co-processor for GMM-based classifier , 2009, 2009 IEEE International SOC Conference (SOCC).

[8]  John H. L. Hansen,et al.  Feature compensation employing online GMM adaptation for speech recognition in unknown severely adverse environments , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Samy Bengio,et al.  User authentication via adapted statistical models of face images , 2006, IEEE Transactions on Signal Processing.

[10]  Jiri Matas,et al.  XM2VTSDB: The Extended M2VTS Database , 1999 .

[11]  Mariano Fons,et al.  Real-time embedded systems powered by FPGA dynamic partial self-reconfiguration: a case study oriented to biometric recognition applications , 2010, Journal of Real-Time Image Processing.

[12]  S. Mitra,et al.  Gaussian Mixture Models based on the Phase Spectra for Illumination Invariant Face Identification on the Yale Database , 2007, 2007 First IEEE International Conference on Biometrics: Theory, Applications, and Systems.