Privacy-Preserving Speech Analytics for Automatic Assessment of Student Collaboration

This work investigates whether nonlexical information from speech can automatically predict the quality of smallgroup collaborations. Audio was collected from students as they collaborated in groups of three to solve math problems. Experts in education annotated 30-second time windows by hand for collaboration quality. Speech activity features (computed at the group level) and spectral, temporal and prosodic features (extracted at the speaker level) were explored. After the latter were transformed from the speaker level to the group level, features were fused. Results using support vector machines and random forests show that feature fusion yields best classification performance. The corresponding unweighted average F1 measure on a 4-class prediction task ranges between 40% and 50%, significantly higher than chance (12%). Speech activity features alone are strong predictors of collaboration quality, achieving an F1 measure between 35% and 43%. Speaker-based acoustic features alone achieve lower classification performance, but offer value in fusion. These findings illustrate that the approach under study offers promise for future monitoring of group dynamics, and should be attractive for many collaboration activity settings in which privacy is desired.

[1]  Carolyn Penstein Rosé,et al.  Using Machine Learning Techniques to Analyze and Support Mediation of Student E-Discussions , 2007, AIED.

[2]  Gary W. Ladd,et al.  Grade-School Children’s Social Collaborative Skills , 2014 .

[3]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[4]  Dirk Heylen,et al.  Dominance Detection in Meetings Using Easily Obtainable Features , 2005, MLMI.

[5]  J. Koenig Assessing 21st Century Skills: Summary of a Workshop , 2011 .

[6]  Steven E. Stemler,et al.  An Overview of Content Analysis. , 2001 .

[7]  Carolyn Penstein Rosé,et al.  Measuring prevalence of other-oriented transactive contributions using an automated measure of speech style accommodation , 2013, International Journal of Computer-Supported Collaborative Learning.

[8]  Carolyn Penstein Rosé,et al.  Analyzing collaborative learning processes automatically: Exploiting the advances of computational linguistics in computer-supported collaborative learning , 2008, Int. J. Comput. Support. Collab. Learn..

[9]  Jeroen Janssen,et al.  Automatic coding of dialogue acts in collaboration protocols , 2008, Int. J. Comput. Support. Collab. Learn..

[10]  Athanasios Katsamanis,et al.  Multi-band long-term signal variability features for robust voice activity detection , 2013, INTERSPEECH.

[11]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[12]  Colleen Richey,et al.  Spoken interaction modeling for automatic assessment of collaborative learning , 2016 .

[13]  E. Cohen Restructuring the Classroom: Conditions for Productive Small Groups , 1994 .

[14]  D. Kuhn Thinking Together and Alone , 2015 .

[15]  Shrikanth S. Narayanan,et al.  Robust Voice Activity Detection Using Long-Term Signal Variability , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  Colleen Richey,et al.  The SRI Speech-Based Collaborative Learning Corpus , 2016, INTERSPEECH.

[18]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[19]  Marcelo Worsley,et al.  Towards the development of multimodal action based assessment , 2013, LAK '13.

[20]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[21]  Gerald Friedland,et al.  Estimating Dominance in Multi-Party Meetings Using Speaker Diarization , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[22]  Pulkit Agrawal,et al.  The automatic assessment of knowledge integration processes in project teams , 2011, CSCL.

[23]  Björn Schuller,et al.  Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.

[24]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.