Spatial Gaussian Mixture Model for gender recognition

Patch-based approaches have become popular in many computer vision applications over recent years. An intrinsic flaw of this framework, missing of the spatial information, however, restricts its usage in face related applications where the spatial configuration is relatively settled. In this paper, we introduce a new patch feature representation, namely Spatial Gaussian Mixture Models (SGMM), which enhances the traditional GMM approach by taking the spatial information into consideration at both local and global scales. In the meantime, SGMM inherits all the merits of GMM, such as precise appearance description and robustness to image misalignment. The experiments on gender recognition demonstrate that the SGMM representation achieves more than 40% relative error reduction compared with either GMM or SVM-based approaches.

[1]  Biing-Hwang Juang,et al.  A study on speaker adaptation of the parameters of continuous density hidden Markov models , 1991, IEEE Trans. Signal Process..

[2]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3]  Shinichi Tamura,et al.  Male/female identification from 8×6 very low resolution face images by neural network , 1996, Pattern Recognit..

[4]  Florent Perronnin,et al.  A similarity measure between unordered vector sets with application to image categorization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Shumeet Baluja,et al.  Boosting Sex Identification Performance , 2005, International Journal of Computer Vision.

[6]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[7]  Ming Liu,et al.  Regression from patch-kernel , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Tsuhan Chen,et al.  A GMM parts based face representation for improved verification through relevance adaptation , 2004, CVPR 2004.

[9]  Shuicheng Yan,et al.  SIFT-Bag kernel for video event analysis , 2008, ACM Multimedia.

[10]  Ming-Hsuan Yang,et al.  Learning Gender with Support Faces , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Xun Xu,et al.  SODA-Boosting and Its Application to Gender Recognition , 2007, AMFG.

[12]  Volkan Atalay,et al.  PCA for gender estimation: which eigenvectors contribute? , 2002, Object recognition supported by user interaction for service robots.

[13]  Harry Wechsler,et al.  Mixture of experts for classification of gender, ethnic origin, and pose of human faces , 2000, IEEE Trans. Neural Networks Learn. Syst..

[14]  Bao-Liang Lu,et al.  Gender Recognition Using a Min-Max Modular Support Vector Machine , 2005, ICNC.

[15]  Nuno Vasconcelos,et al.  A Kullback-Leibler Divergence Based Kernel for SVM Classification in Multimedia Applications , 2003, NIPS.