Learning analytics techniques and visualisation with textual data for determining causes of academic failure

ABSTRACT The primary goal of higher education institutions is to support all students in the pursuit of academic success, which requires timely assistance for ‘at risk’ students. The adoption of learning management systems results in a large amount of data that can be collected, processed and utilised to improve the students’ learning experiences. This research examines the potential applications of analytics techniques for extracting insights from student-generated content in an academic setting. It showcases how different text analytics techniques, from descriptive content analysis, semantic network analysis, to topic modelling support the discovery of new insights from unstructured, user-generated data. We looked at 968 letters written by ‘at risk’ students in an Australian-based university in Southeast Asia to examine the difficulties the students faced, which led to their academic failure. The results show that time management, family, learning, assessment, and subjects were the leading causes of poor performance, but in a more nuanced way than was expected. Students often faced multiple challenges, one led to another, which resulted in the failing grades. Our study contributes a set of effective text analytics techniques for extracting insights from student data, providing the preliminary guidelines for an information system to detect early at risk students.

[1]  Eva H. Telzer,et al.  A Longitudinal Daily Diary Study of Family Assistance and Academic Achievement Among Adolescents from Mexican, Chinese, and European Backgrounds , 2009, Journal of youth and adolescence.

[2]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[3]  Glauco Devita,et al.  Inclusive approaches to effective communication and active participation in the multicultural classroom , 2000 .

[4]  James A. Danowski,et al.  The Faces of Facebookers: Investigating Social Enhancement and Social Compensation Hypotheses; Predicting FacebookTM and Offline Popularity from Sociability and Self-Esteem, and Mapping the Meanings of Popularity with Semantic Networks , 2008, J. Comput. Mediat. Commun..

[5]  M. Narasimha Murty,et al.  On Finding the Natural Number of Topics with Latent Dirichlet Allocation: Some Observations , 2010, PAKDD.

[6]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[7]  George Siemens,et al.  Let’s not forget: Learning analytics are about learning , 2015 .

[8]  J. Siegfried,et al.  Research on Teaching Economics to Undergraduates , 2015 .

[9]  Colin Seymour-Ure,et al.  Content Analysis in Communication Research. , 1972 .

[10]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[11]  Sebastián Ventura,et al.  Educational data mining: A survey from 1995 to 2005 , 2007, Expert Syst. Appl..

[12]  Kim Hua Tan,et al.  Strategy visualisation: knowing, understanding, and formulating , 2004 .

[13]  Ben Kei Daniel Overview of Big Data and Analytics in Higher Education , 2017 .

[14]  Teresa Moore,et al.  Identification of ‘at risk’ students using learning analytics: the ethical dilemmas of intervention strategies in a higher education institution , 2016 .

[15]  G. Altmann,et al.  Quantitative text analysis , 1993 .

[16]  Kurt Hornik,et al.  topicmodels : An R Package for Fitting Topic Models , 2016 .

[17]  H. Klüver,et al.  Measuring Interest Group Influence Using Quantitative Text Analysis , 2009 .

[18]  Sheng Tang,et al.  A density-based method for adaptive LDA model selection , 2009, Neurocomputing.

[19]  C. Suárez-Orozco,et al.  Transformations: Immigration, Family Life, and Achievement Motivation Among Latino Adolescents , 1995 .

[20]  A. Bakker,et al.  The job demands-resources model : state of the art , 2007 .

[21]  V. Dhanalakshmi,et al.  Opinion mining from student feedback data using supervised learning algorithms , 2016, 2016 3rd MEC International Conference on Big Data and Smart City (ICBDSC).

[22]  Detlef Schoder,et al.  Web Science 2.0: Identifying Trends through Semantic Social Network Analysis , 2008, 2009 International Conference on Computational Science and Engineering.

[23]  Nitesh V. Chawla,et al.  Qualitatively exploring electronic portfolios: a text mining approach to measuring student emotion as an early warning indicator , 2015, LAK.

[24]  Patricia Scott,et al.  Knowledge workers: social, task and semantic network analysis , 2005 .

[25]  Patrice Bellot,et al.  Accurate and effective latent concept modeling for ad hoc information retrieval , 2014, Document Numérique.

[26]  Dorothy E. Leidner,et al.  Knowledge Management Systems: Issues, Challenges, and Benefits , 1999, Commun. Assoc. Inf. Syst..

[27]  Carlos Delgado Kloos,et al.  Monitoring student progress using virtual appliances: A case study , 2012, Comput. Educ..

[28]  Eugenijus Kurilovas Advanced machine learning approaches to personalise learning: learning analytics and decision making , 2019, Behav. Inf. Technol..

[29]  Lindsay Miller,et al.  Student Perceptions, Problems and Strategies in Second Language Lecture Comprehension , 1992 .

[30]  Martin Ebner,et al.  What is Learning Analytics about? A Survey of Different Methods Used in 2013-2015 , 2016, ArXiv.

[31]  George Siemens,et al.  Understand students' self-reflections through learning analytics , 2018, LAK.

[32]  Sebastián Ventura,et al.  Educational Data Mining: A Review of the State of the Art , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[33]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Krishna Madhavan,et al.  Problems in Big Data Analytics in Learning , 2016 .

[35]  M. Born,et al.  The Family-Study Interface and Academic Outcomes: Testing a Structural Model , 2011 .

[36]  Chia-Lun Lo,et al.  Developing early warning systems to predict students' online learning performance , 2014, Comput. Hum. Behav..

[37]  Jerry D. Pratton,et al.  The Effects of Active Participation on Student Learning , 1986 .

[38]  Gökhan Akçapinar,et al.  How automated feedback through text mining changes plagiaristic behavior in online assignments , 2015, Comput. Educ..

[39]  Jinan Fiaidhi,et al.  The Next Step for Learning Analytics , 2014, IT Prof..

[40]  Philipp Drieger,et al.  Semantic Network Analysis as a Method for Visual Text Analytics , 2013 .

[41]  Daniel R. Marburger,et al.  Absenteeism and Undergraduate Exam Performance , 2001 .

[42]  Bernd W. Becker,et al.  Learning Analytics: Insights Into the Natural Learning Behavior of Our Students , 2013 .

[43]  Riccardo Mazza,et al.  Introduction to Information Visualisation , 2004 .

[44]  Rebecca Ferguson,et al.  Learning analytics: drivers, developments and challenges , 2012 .

[45]  Jeffrey H. Greenhaus,et al.  Sources of Conflict Between Work and Family Roles , 1985 .

[46]  Tsui-Fang Lin,et al.  Cumulative class attendance and exam performance , 2006 .

[47]  Harold D. Lasswell,et al.  The comparative study of symbols : an introduction , 1952 .

[48]  Camilo Vieira,et al.  Visual learning analytics of educational data: A systematic literature review and research agenda , 2018, Comput. Educ..

[49]  Ben Daniel,et al.  Big Data and analytics in higher education: Opportunities and challenges , 2015, Br. J. Educ. Technol..

[50]  John D. Jackson,et al.  College Grades and Labor Market Rewards , 1990 .

[51]  Martin Ponweiser,et al.  Latent Dirichlet Allocation in R , 2012 .

[52]  Lara Khansa,et al.  Using network-based text analysis to analyze trends in Microsoft's security innovations , 2013, Comput. Secur..

[53]  L. Morris,et al.  Tracking student behavior, persistence, and achievement in online courses , 2005, Internet High. Educ..

[54]  Shane Dawson,et al.  Mining LMS data to develop an "early warning system" for educators: A proof of concept , 2010, Comput. Educ..

[55]  Po-Yao Chao,et al.  Improving early prediction of academic failure using sentiment analysis on self-evaluated comments , 2018, J. Comput. Assist. Learn..

[56]  Sarah Moore,et al.  Lecture absenteeism among students in higher education: a valuable route to understanding student motivation , 2008 .

[57]  David P. Tegarden,et al.  Business Information Visualization , 1999, Commun. Assoc. Inf. Syst..

[58]  A. Fuligni,et al.  Dimensions of family connectedness among adolescents with Mexican, Chinese, and European backgrounds. , 2006, Developmental psychology.