Novel Real-time Face Recognition from Video Streams

Real-time recognition from video streams is a very challenging task due to background, facial expressions, and lighting differences. Recent studies show that deep learning approaches can achieve impressive performance on this task. Our system solves these problems well using deep learning method. It contains face detection module and face recognition module. Our face detection module is based on MTCNN, which is very fast and accurate, and is robust enough for changes in lighting differences and background. Our face recognition module is based on FaceNet, which directly learns mapping the face images to the points in Euclidean space, where the distance of two points in Euclidean space directly corresponds to how similar two face images are. Once such Euclidean space is created, we transform the face images into the FaceNet embedding, as the feature vectors for face images. Next, we put the feature vectors into an SVM classifier to help us quickly identify the face images. The experimental results show that the system has very high accuracy and low computational complexity, which is the key to real-time face recognition, and has significant value in application.

[1]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Xiaogang Wang,et al.  Deeply learned face representations are sparse, selective, and robust , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Yanyan Huang,et al.  Real-Time Face Detection and Recognition for Video Surveillance Applications , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[4]  Luis Torres,et al.  Automatic face recognition for video indexing applications , 2002, Pattern Recognit..

[5]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[7]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[8]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.