Toward Automated Classroom Observation: Predicting Positive and Negative Climate

We devised and evaluated a multi-modal machine learning-based system to analyze videos of school classrooms for "positive climate" and "negative climate", which are two dimensions of the Classroom Assessment Scoring System (CLASS) [1]. School classrooms are highly cluttered audiovisual scenes containing many overlapping faces and voices. Due to the difficulty of labeling them (reliable coding requires weeks of training) and their sensitive nature (students and teachers may be in stressful or potentially embarrassing situations), CLASS- labeled classroom video datasets are scarce, and their labels are sparse (just a few labels per 15-minute video dip). Thus, the overarching challenge was how to harness modem deep perceptual architectures despite the paucity of labeled data. Through training low-level CNN-based facial attribute detectors (facial expression & adult/child) as well as a direct audio-to- climate regressor, and by integrating low-level information over time using a Bi-LSTM, we constructed automated detectors of positive and negative classroom climate with accuracy (10- fold cross-validation Pearson correlation on 241 CLASS-labeled videos) of 0.40 and 0.51, respectively. These numbers are superior to what we obtained using shallower architectures. This work represents the first automated system designed to detect specific dimensions of the CLASS.

[1]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[2]  Charlotte F. Danielson Enhancing Professional Practice: A Framework for Teaching , 1996 .

[3]  Andrew D. Ho,et al.  The Reliability of Classroom Observations by School Personnel. Research Paper. MET Project. , 2013 .

[4]  J. Belsky,et al.  Do effects of early child care extend to age 15 years? Results from the NICHD study of early child care and youth development. , 2010, Child development.

[5]  Frank A. Russo,et al.  Acoustic differences in the speaking and singing voice , 2013 .

[6]  Kasia Muldner,et al.  Emotion Sensors Go To School , 2009, AIED.

[7]  Ryan Shaun Joazeiro de Baker,et al.  Automatic Detection of Learning-Centered Affective States in the Wild , 2015, IUI.

[8]  Ashish Kapoor,et al.  Automatic prediction of frustration , 2007, Int. J. Hum. Comput. Stud..

[9]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[10]  Kaiming He,et al.  Exploring the Limits of Weakly Supervised Pretraining , 2018, ECCV.

[11]  Peter A. Beling,et al.  Classroom Video Assessment and Retrieval via Multiple Instance Learning , 2011, AIED.

[12]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Andrew J. Mashburn,et al.  Measures of classroom quality in prekindergarten and children's development of academic, language, and social skills. , 2008, Child development.

[14]  Megan M. McClelland,et al.  A structured observation of behavioral self-regulation and its contribution to kindergarten outcomes. , 2009, Developmental psychology.

[15]  Huaizu Jiang,et al.  Face Detection with the Faster R-CNN , 2016, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[16]  Javier R. Movellan,et al.  The Faces of Engagement: Automatic Recognition of Student Engagementfrom Facial Expressions , 2014, IEEE Transactions on Affective Computing.

[17]  Aren Jansen,et al.  Audio Set: An ontology and human-labeled dataset for audio events , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19]  S. Kontos,et al.  Teachers' Interactions with Children: Why Are They So Important? Research in Review. , 1997 .

[20]  E. Peisner-Feinberg,et al.  The relation of preschool child-care quality to children's cognitive and social developmental trajectories through second grade. , 2001, Child development.

[21]  Aren Jansen,et al.  CNN architectures for large-scale audio classification , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[23]  Daniel F. McCaffrey,et al.  Have We Identified Effective Teachers? Validating Measures of Effective Teaching Using Random Assignment. Research Paper. MET Project. , 2013 .

[24]  Colin Raffel,et al.  librosa: Audio and Music Signal Analysis in Python , 2015, SciPy.

[25]  Arthur C. Graesser,et al.  Toward an Affect-Sensitive AutoTutor , 2007, IEEE Intelligent Systems.

[26]  Neil T. Heffernan,et al.  Improving Sensor-Free Affect Detection Using Deep Learning , 2017, AIED.

[27]  Peter Robinson,et al.  OpenFace: An open source facial behavior analysis toolkit , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[28]  Vincent Aleven,et al.  Towards Sensor-Free Affect Detection in Cognitive Tutor Algebra. , 2012, EDM 2012.

[29]  L. Hedges,et al.  How Large Are Teacher Effects? , 2004 .

[30]  Kristy Elizabeth Boyer,et al.  Automatically Recognizing Facial Expression: Predicting Engagement and Frustration , 2013, EDM.

[31]  Hatice Gunes,et al.  SmileNet: Registration-Free Smiling Face Detection In The Wild , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[32]  Xingyu Pan,et al.  Automatic classification of activities in classroom discourse , 2014, Comput. Educ..

[33]  Zachary A. Pardos,et al.  Affective states and state tests: investigating how affect throughout the school year predicts end of year learning outcomes , 2013, LAK '13.

[34]  Emily Mower Provost,et al.  Predicting Emotion Perception Across Domains: A Study of Singing and Speaking , 2015, AAAI.

[35]  Andrew Olney,et al.  Multi-sensor modeling of teacher instructional segments in live classrooms , 2016, ICMI.

[36]  Andrew Olney,et al.  Multimodal Capture of Teacher-Student Interactions for Automated Dialogic Analysis in Live Classrooms , 2015, ICMI.

[37]  B. Hamre,et al.  Classroom assessment scoring system. , 2008 .

[38]  Mohammad H. Mahoor,et al.  AffectNet: A Database for Facial Expression, Valence, and Arousal Computing in the Wild , 2017, IEEE Transactions on Affective Computing.

[39]  Development of the UTeach Observation Protocol : A Classroom Observation Instrument to Evaluate Mathematics and Science Teachers from the UTeach Preparation Program , 2022 .