Speakers' Language Characteristics Analysis of Online Educational Videos

Research in the field of educational videos and the contribution of data mining to education can affect the instructors’ approach to learning. This particular study focuses on online educational videos and more specifically on their speakers. Initially a survey is conducted related to the popularity of educational videos on the YouTube which are then divided into two categories the more popular and the less popular. Then the characteristics related to language are extracted from the transcript of the speakers and after a clustering procedure the differences between the two categories are stated. The characteristics related to the language of the speakers of the popular videos present very interesting results. That is, the pace of speaking is faster and the complexity off the sentences is higher than the ones in the less popular videos.

[1]  M. Mitchell Waldrop,et al.  Massive Open Online Courses, aka MOOCs, Transform Higher Education and Science , 2014 .

[2]  Flavio Figueiredo,et al.  On the Dynamics of Social Media Popularity: A YouTube Case Study , 2014, TOIT.

[3]  A. Friederici The brain basis of language processing: from structure to function. , 2011, Physiological reviews.

[4]  Avideh Zakhor,et al.  Efficient video similarity measurement with video signature , 2003, IEEE Trans. Circuits Syst. Video Technol..

[5]  Nitin Indurkhya,et al.  Handbook of Natural Language Processing , 2010 .

[6]  Humberto Bustince,et al.  The median and its extensions , 2011, Fuzzy Sets Syst..

[7]  Pritesh Vora,et al.  A Survey on K-mean Clustering and Particle Swarm Optimization , 2013 .

[8]  Hsinchun Chen,et al.  AI and Opinion Mining , 2010, IEEE Intelligent Systems.

[9]  Armando Padilla Beginning Zend Framework , 2008 .

[10]  N. Selwyn Social media in higher education , 2011 .

[11]  Masakatsu Murakami,et al.  AUTHORS' CHARACTERISTIC WRITING STYLES AS SEEN THROUGH THEIR USE OF COMMAS , 1993 .

[12]  Jiaheng Lu,et al.  Clustering Web video search results based on integration of multiple features , 2010, World Wide Web.

[13]  S. Piantadosi,et al.  Info/information theory: Speakers choose shorter words in predictive contexts , 2013, Cognition.

[14]  Ernst Stadlober,et al.  The Relationship of Word Length and Sentence Length: The Inter-Textual Perspective , 2006, GfKl.

[15]  Michalis Faloutsos,et al.  A First Step Towards Understanding Popularity in YouTube , 2010, 2010 INFOCOM IEEE Conference on Computer Communications Workshops.

[16]  Abdelmalek Amine,et al.  Evaluation of text clustering methods using wordnet , 2010, Int. Arab J. Inf. Technol..

[17]  David C. Gibbon,et al.  Introduction to video search engines , 2008 .

[18]  Nelson F. F. Ebecken,et al.  A Strategy for Training Set Selection in Text Classification Problems , 2013 .

[19]  David M. Mount,et al.  The analysis of a simple k-means clustering algorithm , 2000, SCG '00.

[20]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Keith B. Hall,et al.  Improved video categorization from text metadata and user comments , 2011, SIGIR '11.

[22]  J. Seaman,et al.  Teaching, Learning, and Sharing: How Today's Higher Education Faculty Use Social Media. , 2011 .

[23]  Fakhri Karray,et al.  An Efficient Concept-Based Mining Model for Enhancing Text Clustering , 2010, IEEE Transactions on Knowledge and Data Engineering.

[24]  A. Zimek,et al.  On Using Class-Labels in Evaluation of Clusterings , 2010 .

[25]  Roger Wattenhofer,et al.  The YouTube Social Network , 2012, ICWSM.

[26]  Marilyn Gilroy Higher Education Migrates to YouTube and Social Networks. , 2010 .

[27]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[28]  Stan Matwin,et al.  Email classification with co-training , 2011, CASCON.

[29]  Yueting Zhuang,et al.  Searching for Flash Movies on the Web: A Content and Context Based Framework , 2005, World Wide Web.

[30]  Nancy Wiegand Creating Complex Sentence Structure , 1984 .