DiAd: Domain Adaptation for Learning at Scale

Massive online courses occupy an important place in the educational landscape of today. We study an approach to scale predictive analytic models derived from online course discussion fora--specifically that of confusion detection--onto other courses. The primary challenge here is the lack of labeled examples in a new course and this calls for unsupervised domain adaptation (DA). As a first step in exploring DA in the education domain, we propose a simple algorithm, DiAd, which adapts a classifier trained on a course with labeled data by selectively choosing instances from a new course (with no labeled data) that are most dissimilar to the course with labeled data and on which the classifier is very confident of classification. Our algorithm is empirically validated on the confusion detection task across multiple online courses. We find that DiAd outperforms other methods on the target domain, while showing a comparable performance to a popular method that uses labeled data from the target domain.

[1]  Theodore J. Kopcha,et al.  Exploring College Students' Online Help-Seeking Behavior in a Flipped Classroom with a Web-Based Help-Seeking Tool. , 2015 .

[2]  George Siemens,et al.  Towards automated content analysis of discussion transcripts: a cognitive presence case , 2016, LAK.

[3]  Geert-Jan Houben,et al.  Follow the successful crowd: raising MOOC completion rates through social comparison at scale , 2017, LAK.

[4]  Shane Dawson,et al.  Social Presence in Massive Open Online Courses , 2018, The International Review of Research in Open and Distributed Learning.

[5]  Sean Burns,et al.  A generalized classifier to identify online learning tool disengagement at scale , 2018, LAK.

[6]  Dan Goldwasser,et al.  Predicting Instructor’s Intervention in MOOC forums , 2014, ACL.

[7]  Marek Hatala,et al.  Social presence in online discussions as a process predictor of academic performance , 2015, J. Comput. Assist. Learn..

[8]  Eric P. Xing,et al.  Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2014, ACL 2014.

[9]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[10]  John Blitzer,et al.  Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.

[11]  Lise Getoor,et al.  Learning Latent Engagement Patterns of Students in Online Courses , 2014, AAAI.

[12]  Carolyn Penstein Rosé,et al.  Sentiment Analysis in MOOC Discussion Forums: What does it tell us? , 2014, EDM.

[13]  Carolyn Penstein Rosé,et al.  Forum Thread Recommendation for Massive Open Online Courses , 2014, EDM.

[14]  Jihie Kim,et al.  Capturing Difficulty Expressions in Student Online Q&A Discussions , 2014, AAAI.

[15]  Shourya Roy,et al.  An Iterative Similarity based Adaptation Technique for Cross-domain Text Classification , 2015, CoNLL.

[16]  Sherif Halawa,et al.  Attrition and Achievement Gaps in Online Learning , 2015, L@S.

[17]  Shane Dawson,et al.  Are MOOC forums changing? , 2018, LAK.

[18]  Lise Getoor,et al.  Understanding MOOC Discussion Forums using Seeded LDA , 2014, BEA@ACL.

[19]  Bart Rienties,et al.  Linking students' timing of engagement to learning design and academic performance , 2018, LAK.

[20]  Hal Daumé,et al.  Frustratingly Easy Domain Adaptation , 2007, ACL.

[21]  Andreas Paepcke,et al.  YouEDU: Addressing Confusion in MOOC Discussion Forums by Recommending Instructional Video Clips , 2015, EDM.

[22]  Jianfei Yu,et al.  A Hassle-Free Unsupervised Domain Adaptation Method Using Instance Similarity Features , 2015, ACL.

[23]  Snigdha Chaturvedi,et al.  Learner Affect Through the Looking Glass: Characterization and Detection of Confusion in Online Courses , 2017, EDM.

[24]  Carolyn Penstein Rosé,et al.  Investigating How Student's Cognitive Behavior in MOOC Discussion Forum Affect Learning Gains , 2015, EDM.