VISA: A Supervised Approach to Indexing Video Lectures with Semantic Annotations

Many universities adopt educational systems where the teacher lecture is video recorded and the video lecture is made available to students with minimum post-processing effort. These cost-effective solutions suffer from the limited amount of annotations associated with the video content, which strongly limits the usability of the service when students need to retrieve specific portions of video, e.g., to revise unclear aspects covered in the past lectures. This paper presents, as a real case study, the system developed and implemented in our university for video lecture annotation and indexing. The original video recordings, which last around 1.5 hour, are first partitioned into smaller segments and then annotated by mapping their content with the entities in a multilingual knowledge base. To this purpose, the proposed approach analyzes both the transcription of the teacher's speech and the text appearing in the video (e.g., the slide content, the note written on the whiteboard) by means of an ad hoc Named Entity Recognition and Disambiguation (NERD) step. NERD relies on a supervised classification approach tailored to the domain under analysis. More specifically, to identify the most salient entities of the knowledge base matching the video content it considers not only text similarity measures but also the semantic pertinence of the candidate entities to the main subject of the video lectures. The performance of the proposed system was validated on a ground truth against the techniques available in the general entity annotation system GERBIL. The preliminary results demonstrate the effectiveness of the proposed approach.

[1]  Axel-Cyrille Ngonga Ngomo,et al.  GERBIL - Benchmarking Named Entity Recognition and Linking consistently , 2017, Semantic Web.

[2]  Igor Malioutov,et al.  Minimum Cut Model for Spoken Lecture Segmentation , 2006, ACL.

[3]  James R. Glass,et al.  Recent progress in the MIT spoken lecture processing project , 2007, INTERSPEECH.

[4]  Takako Akakura,et al.  Video Bookmarking for Learner Support in Blended Learning: Selection of Appropriate Keywords for Efficient Review of Lecture Video , 2011, 2011 IEEE 11th International Conference on Advanced Learning Technologies.

[5]  Maryam Habibi,et al.  Multi-factor segmentation for topic visualization and recommendation: the MUST-VIS system , 2013, ACM Multimedia.

[6]  Christoph Meinel,et al.  Enhance learning in a video lecture archive with annotations , 2018, 2018 IEEE Global Engineering Education Conference (EDUCON).

[7]  M. Muhamad,et al.  Online video lecture series for digital logic fundamental courses blended learning , 2017, 2017 IEEE 9th International Conference on Engineering Education (ICEED).

[8]  Edmundo Tovar Caro,et al.  From Higher Education to Open Education: Challenges in the Transformation of an Online Traditional Course , 2017, IEEE Trans. Educ..

[9]  Greg C. Lee,et al.  Supporting In-Class Learning with Asynchronous and Autonomous Viewing of Near Real-Time Lecture Videos , 2014, 2014 International Conference on Teaching and Learning in Computing and Engineering.

[10]  Takayuki Nagai,et al.  Implementation of high-definition lecture recording system for daily use , 2013, 2013 IEEE Global Engineering Education Conference (EDUCON).

[11]  René F. Kizilcec,et al.  Showing face in video instruction: effects on information retention, visual attention, and affect , 2014, CHI.

[12]  Kamal Bijlani,et al.  Pedagogy Experiments with Recorded Video Lectures , 2014, 2014 IEEE Sixth International Conference on Technology for Education.

[13]  Qiang Ji,et al.  Video Affective Content Analysis: A Survey of State-of-the-Art Methods , 2015, IEEE Transactions on Affective Computing.

[14]  Mounir Zrigui,et al.  A Framework for Semantic Video Content Indexing Using Textual Information , 2018, 2018 IEEE Second International Conference on Data Stream Mining & Processing (DSMP).

[15]  Wei JiuHong,et al.  Advantages and Deficiencies of the Automated Multimedia Lecture Recording System in Lecture Video Production , 2009, 2009 International Forum on Computer Science-Technology and Applications.

[16]  Inad Aljarrah,et al.  Video content analysis using convolutional neural networks , 2018, 2018 9th International Conference on Information and Communication Systems (ICICS).

[17]  Silvia Mirri,et al.  Topic-based playlist to improve video lecture accessibility , 2018, 2018 15th IEEE Annual Consumer Communications & Networking Conference (CCNC).

[18]  Chung-Lin Huang,et al.  Content-based multi-functional video retrieval system , 2005, 2005 Digest of Technical Papers. International Conference on Consumer Electronics, 2005. ICCE..

[19]  Yi Yu,et al.  TRACE: Linguistic-Based Approach for Automatic Lecture Video Segmentation Leveraging Wikipedia Texts , 2015, 2015 IEEE International Symposium on Multimedia (ISM).

[20]  Amrit Priyadarshi,et al.  An approach for automated video indexing and video search in large lecture video archives , 2015, 2015 International Conference on Pervasive Computing (ICPC).

[21]  Sridhar Iyer,et al.  Automated Tagging to Enable Fine-Grained Browsing of Lecture Videos , 2011, 2011 IEEE International Conference on Technology for Education.

[22]  Judy Kay,et al.  MOOCs: So Many Learners, So Much Potential ... , 2013, IEEE Intelligent Systems.

[23]  Luca Cagliero,et al.  Experimental Validation of a Massive Educational Service in a Blended Learning Environment , 2017, 2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC).

[24]  Shai Ben-David,et al.  Understanding Machine Learning: From Theory to Algorithms , 2014 .

[25]  Kenzi Watanabe,et al.  Development of Lecture Videos Delivery System using HTML5 Video Element , 2013, 2013 Eighth International Conference on Broadband and Wireless Computing, Communication and Applications.

[26]  Christoph Meinel,et al.  Automatic Lecture Video Indexing Using Video OCR Technology , 2011, 2011 IEEE International Symposium on Multimedia.

[27]  James R. Glass,et al.  Automatic processing of audio lectures for information retrieval: vocabulary selection and language modeling , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[28]  Jaspal Subhlok,et al.  Indexing and keyword search to ease navigation in lecture videos , 2011, 2011 IEEE Applied Imagery Pattern Recognition Workshop (AIPR).

[29]  Christoph Meinel,et al.  Improving access to online lecture videos , 2018, 2018 IEEE Global Engineering Education Conference (EDUCON).

[30]  A. Damodar,et al.  Automatic keyphrase extraction and segmentation of video lectures , 2012, 2012 IEEE International Conference on Technology Enhanced Education (ICTEE).

[31]  Maxime Pedrotti,et al.  Online Lecture Videos in Higher Education: Acceptance and Motivation Effects on Students' System Use , 2014, 2014 IEEE 14th International Conference on Advanced Learning Technologies.

[32]  Huang-Chia Shih,et al.  A Survey of Content-Aware Video Analysis for Sports , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Christoph Meinel,et al.  Content Based Lecture Video Retrieval Using Speech and Video Text Information , 2014, IEEE Transactions on Learning Technologies.

[34]  Sanjay Goel,et al.  LectureKhoj: Automatic tagging and semantic segmentation of online lecture videos , 2014, 2014 Seventh International Conference on Contemporary Computing (IC3).