CNNs and Transfer Learning for Lecture Venue Occupancy and Student Attention Monitoring

Lower student success rates in higher education might, in some case, be due to the unprecedented increase of student numbers without a comparable increase in resources and funding. This paper proposes a face-based detection system to monitor occupancy and student attention in crowded classroom using state of the art deep Convolutional Neural Networks (CNN) architectures. The aim of the proposed system is to contribute to the increase of subject success rates by monitoring attendance and attention. The system utilizes a two-phased approach: The first phase determines the number of student faces in an image frame. The Haar Cascade, LBP, HOG, Resnet CNN, TinyFace CNN, and SSD were compared to determine the algorithm best suited to the detection of faces in crowded classroom scenes. In phase two, the orientations of the faces are determined using transfer learning. Faces are classified as “right”, “left”, or at the “center”. This information is displayed on an augmented reality display to provide feedback to lecturers in semi real-time. It is hoped that this will assist lecturers to address problems related to student attention in crowded classrooms.

[1]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2]  Yann LeCun,et al.  Regularization of Neural Networks using DropConnect , 2013, ICML.

[3]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Peter Peer,et al.  Strategies for Exploiting Independent Cloud Implementations of Biometric Experts in Multibiometric Scenarios , 2014, Mathematical Problems in Engineering.

[7]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[9]  Peter Peer,et al.  Human Skeleton Model Based Dynamic Features for Walking Speed Invariant Gait Recognition , 2014 .

[10]  Jürgen Schmidhuber,et al.  Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[12]  Jürgen Schmidhuber,et al.  Multi-column deep neural network for traffic sign classification , 2012, Neural Networks.

[13]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[14]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[15]  Lina J. Karam,et al.  A Study and Comparison of Human and Deep Learning Recognition Performance under Visual Distortions , 2017, 2017 26th International Conference on Computer Communication and Networks (ICCCN).

[16]  Peiyun Hu,et al.  Finding Tiny Faces , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Haidawati Nasir,et al.  A comparative study between LBP and Haar-like features for Face Detection using OpenCV , 2014, 2014 4th International Conference on Engineering Technology and Technopreneuship (ICE2T).

[18]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Xiaogang Wang,et al.  Deep Learning Face Representation by Joint Identification-Verification , 2014, NIPS.

[20]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.