Transfer Learning to Timed Text Based Video Classification Using CNN

Open educational video resources are gaining popularity with a growing number of massive open online courses (MOOCs). This has created a niche for content providers to adopt effective solutions in automatically organizing and structuring of educational resources for maximum visibility. Recent advances in deep learning techniques are proving useful in managing and classifying resources into appropriate categories. This paper proposes one such convolutional neural network (CNN) model for classifying video lectures in a MOOC setting using a transfer learning approach. The model uses a time-aligned text transcripts corresponding to video lectures from six broader subject categories. Video lectures and their corresponding transcript dataset is gathered from the Coursera MOOC platform. Two different CNN models are proposed: i) CNN based classification using embeddings learned from our MOOC dataset, ii) CNN based classification using transfer learning. Word embeddings generated from two well known state-of-the-art pre-trained models Word2Vec and GloVe, are used in the transfer learning approach for the second case. The proposed CNN models are evaluated using precision, recall, and F1 score and the obtained performance is compared with both conventional and deep learning classifiers. The proposed CNN models have an F1 score improvement of 10-22 percentage points over DNN and conventional classifiers

[1]  Philip J. Guo,et al.  How video production affects student engagement: an empirical study of MOOC videos , 2014, L@S.

[2]  Lule Ahmedi,et al.  Rethinking the conventional learning paradigm towards MOOC based flipped classroom learning , 2017, 2017 16th International Conference on Information Technology Based Higher Education and Training (ITHET).

[3]  Ilja Kuzborskij,et al.  Stability and Hypothesis Transfer Learning , 2013, ICML.

[4]  Jiang Zhou,et al.  Automatic MOOC Video Classification using Transcript Features and Convolutional Neural Networks , 2017, MultiEdTech@MM.

[5]  Diego Reforgiato Recupero,et al.  Bridging learning analytics and Cognitive Computing for Big Data classification in micro-learning video collections , 2019, Comput. Hum. Behav..

[6]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[7]  Sule Yildirim Yayilgan,et al.  A General Framework for Text Document Classification Using SEMCON and ACVSR , 2015, HCI.

[8]  Stewart Kowalski,et al.  Automatic annotation of lecture videos for multimedia driven pedagogical platforms , 2016 .

[9]  Hsinchun Chen,et al.  Text-based video content classification for online video-sharing sites , 2010 .

[10]  Jaber El Bouhdidi,et al.  Education data mining: Mining MOOCs videos using metadata based approach , 2016, 2016 4th IEEE International Colloquium on Information Science and Technology (CiSt).

[11]  Zenun Kastrati,et al.  Document image classification using SEMCON , 2015, 2015 20th Symposium on Signal Processing, Images and Computer Vision (STSIVA).

[12]  Zenun Kastrati,et al.  Pedagogical Document Classification and Organization Using Domain Ontology , 2016, HCI.

[13]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[14]  Guodong Zhao,et al.  Microlessons in Chinese Universities: Concepts, Technology, and Case Analyses , 2016, ICBL.

[15]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[16]  Mostafa Keikha,et al.  Rich document representation and classification: An analysis , 2009, Knowl. Based Syst..

[17]  James Bret Michael,et al.  A Nonlinear Perspective on Higher Education , 2010, Computer.

[18]  Abderrahim Ghadi,et al.  Mining Moocs Videos Metadata Using Classification Techniques , 2017, BDCA'17.