ATL-BP: A Student Engagement Dataset and Model for Affect Transfer Learning for Behavior Prediction

We propose a video-based transfer learning approach for predicting problem outcomes of students working with an intelligent tutoring system (ITS) by analyzing their faces and gestures. The ability to predict such outcomes enables tutoring systems to adjust interventions and ultimately yield improved student learning. We collected and released a labeled dataset of 2,749 problem-solving interaction samples of 54 students working with an intelligent online math tutor. Our transfer-learning challenge was then to design a representation in the source domain of images obtained from the Internet for facial expression analysis, and transfer this learned representation for human behavior prediction in the domain of webcam videos of students in a classroom environment. We developed a novel facial affect representation and a user-personalized training scheme that unlocks the potential of this representation. We designed several variants of a recurrent neural network that models the temporal structure of video sequences. Our final model, named ATL-BP for Affect Transfer Learning for Behavior Prediction, achieves a relative increase in the mean F-score of 50% over the state-of-the-art method on this new dataset. We also propose an additional set of annotations to predict students’ engagement while solving a specific problem, and present models that can predict such engagement.

[1]  Mostafa Shahabinejad,et al.  Toward Personalized Emotion Recognition: A Face Recognition Based Attention Method for Facial Emotion Recognition , 2021, 2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021).

[2]  Shiguang Shan,et al.  Emotion-aware Contrastive Learning for Facial Action Unit Detection , 2021, 2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021).

[3]  Dominic Kao,et al.  A Systematic Review of Literature on the Effectiveness of Intelligent Tutoring Systems in STEM , 2021, 2021 IEEE Frontiers in Education Conference (FIE).

[4]  Margrit Betke,et al.  Student Engagement Dataset , 2021, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).

[5]  S. Gelly,et al.  An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2020, ICLR.

[6]  Jay Holland,et al.  Effect of Non-mandatory Use of an Intelligent Tutoring System on Students’ Learning , 2020, AIED.

[7]  John J. Magee,et al.  Leveraging Affect Transfer Learning for Behavior Prediction in an Intelligent Tutoring System , 2020, 2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021).

[8]  Yung-Yu Chuang,et al.  FSA-Net: Learning Fine-Grained Structure Aggregation for Head Pose Estimation From a Single Image , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Margrit Betke,et al.  Affect-driven Learning Outcomes Prediction in Intelligent Tutoring Systems , 2019, 2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019).

[10]  C. V. Jawahar,et al.  AutoRate: How attentive is the driver? , 2019, 2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019).

[11]  Saïda Bouakaz,et al.  A novel database of Children's Spontaneous Facial Expressions (LIRIS-CSE) , 2018, Image Vis. Comput..

[12]  Cristina Conati,et al.  Prediction of Student Achievement Goals and Emotion Valence during Interaction with Pedagogical Agents , 2018, AAMAS.

[13]  Kallirroi Georgila,et al.  Engaging with the Scenario: Affect and Facial Patterns from a Scenario-Based Intelligent Tutoring System , 2018, AIED.

[14]  Louis-Philippe Morency,et al.  OpenFace 2.0: Facial Behavior Analysis Toolkit , 2018, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[15]  Scotty D. Craig,et al.  Advances from the Office of Naval Research STEM Grand Challenge: expanding the boundaries of intelligent tutoring systems , 2018, International journal of STEM education.

[16]  Yoshua Bengio,et al.  Finding Flatter Minima with SGD , 2018, ICLR.

[17]  James M. Rehg,et al.  Fine-Grained Head Pose Estimation Without Keypoints , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[18]  Albert Ali Salah,et al.  Video-based emotion recognition in the wild using deep transfer learning and score fusion , 2017, Image Vis. Comput..

[19]  Mohammad H. Mahoor,et al.  AffectNet: A Database for Facial Expression, Valence, and Arousal Computing in the Wild , 2017, IEEE Transactions on Affective Computing.

[20]  Gengming Zhu,et al.  Joint Face Detection and Facial Expression Recognition with MTCNN , 2017, 2017 4th International Conference on Information Science and Control Engineering (ICISCE).

[21]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[22]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[23]  Cristina Conati,et al.  Using Feature Selection and Unsupervised Clustering to Identify Affective Expressions in Educational Games , 2016 .

[24]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Pierre Poirier,et al.  Evaluating the Effectiveness of an Affective Tutoring Agent in Specialized Education , 2016, EC-TEL.

[26]  Cynthia Breazeal,et al.  Affective Personalization of a Social Robot Tutor for Children's Second Language Skills , 2016, AAAI.

[27]  Nicu Sebe,et al.  Learning Personalized Models for Facial Expression Analysis and Gesture Recognition , 2016, IEEE Transactions on Multimedia.

[28]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Michel F. Valstar,et al.  Learning to Transfer: Transferring Latent Task Structures and Its Application to Person-Specific Facial Action Unit Detection , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[30]  Li Ma,et al.  Facial expression recognition based on transfer learning from deep convolutional networks , 2015, 2015 11th International Conference on Natural Computation (ICNC).

[31]  Vanessa Lobue,et al.  The Child Affective Facial Expression (CAFE) set: validity and reliability from untrained adults , 2014, Front. Psychol..

[32]  Nicu Sebe,et al.  We are not All Equal: Personalizing Models for Facial Expression Analysis with Transductive Parameter Transfer , 2014, ACM Multimedia.

[33]  I. Arroyo,et al.  A Multimedia Adaptive Tutoring System for Mathematics that Addresses Cognition, Metacognition and Affect , 2014, International Journal of Artificial Intelligence in Education.

[34]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[35]  Cristina Conati,et al.  Predicting Affect from Gaze Data during Interaction with an Intelligent Tutoring System , 2014, Intelligent Tutoring Systems.

[36]  A. Graesser,et al.  Confusion can be beneficial for learning. , 2014 .

[37]  Peter H. Tu,et al.  Learning person-specific models for facial expression and action unit recognition , 2013, Pattern Recognit. Lett..

[38]  Edward John Lazaros Indiana,et al.  IMPROVING STUDENT ENGAGEMENT , 2013 .

[39]  Peter H. Tu,et al.  Person-specific expression recognition with transfer learning , 2012, 2012 19th IEEE International Conference on Image Processing.

[40]  Andrew Olney,et al.  Gaze tutor: A gaze-reactive intelligent tutoring system , 2012, Int. J. Hum. Comput. Stud..

[41]  K. VanLehn The Relative Effectiveness of Human Tutoring, Intelligent Tutoring Systems, and Other Tutoring Systems , 2011 .

[42]  Sidney K. D'Mello,et al.  Emotion Regulation during Learning , 2011, AIED.

[43]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[44]  Arthur C. Graesser,et al.  A Time for Emoting: When Affect-Sensitivity Is and Isn't Effective at Promoting Deep Learning , 2010, Intelligent Tutoring Systems.

[45]  Kasia Muldner,et al.  The Effect of Motivational Learning Companions on Low Achieving Students and Students with Disabilities , 2010, Intelligent Tutoring Systems.

[46]  Arthur C. Graesser,et al.  Better to be frustrated than bored: The incidence, persistence, and impact of learners' cognitive-affective states during interactions with three different computer-based learning environments , 2010, Int. J. Hum. Comput. Stud..

[47]  Davis E. King,et al.  Dlib-ml: A Machine Learning Toolkit , 2009, J. Mach. Learn. Res..

[48]  Beverly Park Woolf,et al.  Affect-aware tutors: recognising and responding to student affect , 2009, Int. J. Learn. Technol..

[49]  Kasia Muldner,et al.  Emotion Sensors Go To School , 2009, AIED.

[50]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[51]  Ashish Kapoor,et al.  Automatic prediction of frustration , 2007, Int. J. Hum. Comput. Stud..

[52]  Arthur C. Graesser,et al.  Toward an Affect-Sensitive AutoTutor , 2007, IEEE Intelligent Systems.

[53]  R. Nkambou,et al.  A Framework for Affective Intelligent Tutoring Systems , 2006, 2006 7th International Conference on Information Technology Based Higher Education and Training.

[54]  George D. Kuh,et al.  Student Engagement and Student Learning: Testing the Linkages* , 2006 .

[55]  Joscelyn E. Fisher,et al.  Emotion-modulated performance and activity in left dorsolateral prefrontal cortex. , 2005, Emotion.

[56]  Scotty D. Craig,et al.  Affect and learning: An exploratory look into the role of affect in learning with AutoTutor , 2004 .

[57]  Arthur C. Graesser,et al.  Intelligent Tutoring Systems with Conversational Dialogue , 2001, AI Mag..

[58]  Jennifer Healey,et al.  Toward Machine Emotional Intelligence: Analysis of Affective Physiological State , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[59]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[60]  John R. Anderson,et al.  Student Modeling and Mastery Learning in a Computer-Based Proramming Tutor , 1992, Intelligent Tutoring Systems.

[61]  A. Isen,et al.  Positive affect facilitates creative problem solving. , 1987, Journal of personality and social psychology.

[62]  Margrit Betke,et al.  Measuring and Integrating Facial Expressions and Head Pose as Indicators of Engagement and Affect in Tutoring Systems , 2021, HCI.

[63]  Stephen Lin,et al.  Swin Transformer: Hierarchical Vision Transformer using Shifted Windows , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[64]  Beverly Park Woolf,et al.  Addressing Student Behavior and Affect with Empathy and Growth Mindset , 2017, EDM.

[65]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[66]  Rosalind W. Picard Affective Computing: From Laughter to IEEE , 2010 .

[67]  Yanghee Kim,et al.  Empathetic virtual peers enhanced learner interest and self-efficacy , 2005 .

[68]  Antonija Mitrovic,et al.  Optimising ITS Behaviour with Bayesian Networks and Decision Theory , 2001 .