Foveated Vision for Biologically Inspired Continuous Face Authentication

In everyday life whenever people observe, interact or speak to each other, visual attention is mostly directed toward the other person’s face, particularly to the eyes and the nearby periocular regions. This is naturally reflected when the user interacts with their mobile phones in several usual activities, such as web access, payments and video calls. For this reason, the functionality of mobile devices is strongly affected by the design of the user interface. In this chapter, we propose a biologically inspired approach for continuous user authentication based on the analysis of the ocular regions. The proposed system is based on a modified version of the HMAX visual processing module. HMAX is a hierarchical model which has been conceived to mimic the basic neural architecture of the ventral stream of the visual cortex. The original HMAX model consists of four layers: S1, C1, S2 and C2. S1 and C1 represent the responses to a bank of orientation-selective Gabor filters. S2 and C2 represent the responses of simple and complex cells to other textural features. The discrimination power of HMAX in recognizing classes of objects is invariant to rotation and scale. The C1 layer, which is mainly responsible for the scale and rotation invariance, is implemented using a max-pooling operation, which may lose some spatial information. To overcome this problem while preserving the maximal visual acuity and hence the localization accuracy, we propose to augment the model by applying a retinal log-polar mapping. The log-polar mapping is an approximation of the retino-cortical mapping that is performed by the early stages of the primate visual system. Due to the high density of the cones in the fovea, the log-polar approximation of the space-variant distribution model of the photoreceptors can only be applied outside the foveal region. Therefore, the log-polar mapping is added to the HMAX model as a complementary stage to process the peripheral region of the grabbed images. In order to demonstrate the feasibility of the proposed approach to mobile scenarios, experimental results obtained from publicly available databases and image streams grabbed from mobile devices will be presented.

[1]  Kostas Daniilidis,et al.  Attentive Visual Motion Processing: Computations in the Log-Polar Plane , 1994, Theoretical Foundations of Computer Vision.

[2]  Xiaogang Wang,et al.  Deep Learning Face Representation from Predicting 10,000 Classes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Peter J. Burt,et al.  `Smart Sensing' in machine vision , 1988 .

[4]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Eric L. Schwartz,et al.  Anatomical and physiological correlates of visual computation from striate to infero-temporal cortex , 1984, IEEE Transactions on Systems, Man, and Cybernetics.

[6]  Ajita Rattani,et al.  A Survey Of mobile face biometrics , 2018, Comput. Electr. Eng..

[7]  Ali Motie Nasrabadi,et al.  C3 Effective features inspired from Ventral and dorsal stream of visual cortex for view independent face recognition , 2016 .

[8]  Joel Z. Leibo,et al.  Learning invariant representations and applications to face verification , 2013, NIPS.

[9]  Giulio Sandini,et al.  Vision and Space-Variant Sensing , 1992 .

[10]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Giulio Sandini,et al.  "Form-invariant" topological mapping strategy for 2D shape recognition , 1985, Comput. Vis. Graph. Image Process..

[12]  Shengcai Liao,et al.  Partial Face Recognition: Alignment-Free Approach , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Claus Nebauer,et al.  Evaluation of convolutional neural networks for visual recognition , 1998, IEEE Trans. Neural Networks.

[15]  Jiwen Lu,et al.  Robust Point Set Matching for Partial Face Recognition , 2016, IEEE Transactions on Image Processing.

[16]  David G. Lowe,et al.  University of British Columbia. , 1945, Canadian Medical Association journal.

[17]  Xiaolin Hu,et al.  Sparsity-Regularized HMAX for Visual Recognition , 2014, PloS one.

[18]  Andrea Lagorio,et al.  Towards practical space-variant based face recognition and authentication , 2014, 2nd International Workshop on Biometrics and Forensics.

[19]  Rama Chellappa,et al.  Face-based Active Authentication on mobile devices , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).