Integrating multiple data sources for learning analytics—review of literature

Learning analytics (LA) promises understanding and optimization of learning and learning environments. To enable richer insights regarding questions related to learning and education, LA solutions should be able to integrate data coming from many different data sources, which may be stored in different formats and have varying levels of structure. Data integration also plays a role for the scalability of LA, an important challenge in itself. The objective of this review is to assess the current state of LA in terms of data integration in the context of higher education. The initial search of six academic databases and common venues for publishing LA research resulted in 115 publications, out of which 20 were included in the final analysis. The results show that a few data sources (e.g., LMS) appear repeatedly in the research studies; the number of data sources used in LA studies in higher education tends to be limited; when data are integrated, similar data formats are often combined (a low-hanging fruit in terms of technical challenges); the research literature tends to lack details about data integration in the implemented systems; and, despite being a good starting point for data integration, educational data specifications (e.g., xAPI) seem to seldom be used. In addition, the results indicate a lack of stakeholder (e.g., teachers/instructors, technology vendors) involvement in the research studies. The review concludes by offering recommendations to address limitations and gaps in the research reported in the literature.

[1]  Theresa Kwong,et al.  Learning on mobile augmented reality trails of integrity and ethics , 2018, Res. Pract. Technol. Enhanc. Learn..

[2]  Dean Allemang,et al.  Semantic Web for the Working Ontologist - Effective Modeling in RDFS and OWL, Second Edition , 2011 .

[3]  Bikram Sengupta,et al.  Seamless blended learning using the Cognitive Learning Companion: A systemic view , 2015, IBM J. Res. Dev..

[4]  María Jesús Rodríguez-Triana,et al.  The teacher in the loop: customizing multimodal learning analytics for blended learning , 2018, LAK.

[5]  Eitel J. M. Lauría,et al.  Early Alert of Academically At-Risk Students: An Open Source Analytics Initiative , 2014, J. Learn. Anal..

[6]  Rosa Alarcón,et al.  Centralized student performance prediction in large courses based on low-cost variables in an institutional context , 2018, Internet High. Educ..

[7]  Rajeev Agrawal,et al.  Challenges of data integration and interoperability in big data , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[8]  Thomas Keays,et al.  Semantic Web for the Working Ontologist , 2008 .

[9]  D. Moher,et al.  Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement , 2009, BMJ : British Medical Journal.

[10]  Penny Wheeler,et al.  Discovering indicators of successful collaboration using tense: Automated extraction of patterns in discourse , 2014, Br. J. Educ. Technol..

[11]  M. Hofmann,et al.  Learning Factor Models of Students at Risk of Failing in the Early Stage of Tertiary Education , 2016, J. Learn. Anal..

[12]  Brendan FLANAGANa,et al.  Integration of Learning Analytics Research and Production Systems While Protecting Privacy , 2018 .

[13]  Zachary A. Pardos,et al.  moocRP: An Open-source Analytics Platform , 2015, L@S.

[14]  Dieter Fensel,et al.  Knowledge Engineering: Principles and Methods , 1998, Data Knowl. Eng..

[15]  Jina Kang,et al.  Using Data to Understand How to Better Design Adaptive Learning , 2017, Technol. Knowl. Learn..

[16]  Stefaan Ternier,et al.  Learning pulse: a machine learning approach for predicting performance in self-regulated learning using multimodal data , 2017, LAK.

[17]  M. Zheng,et al.  Faculty professional development in emergent pedagogies for instructional innovation in dental education , 2017, European journal of dental education : official journal of the Association for Dental Education in Europe.

[18]  Xavier Ochoa,et al.  The RAP system: automatic feedback of oral presentation skills using multimodal analysis and low-cost sensors , 2018, LAK.

[19]  Il-Hyun Jo,et al.  Clustering blended learning courses by online behavior data: A case study in a Korean higher education institute , 2016, Internet High. Educ..

[20]  Fabio A. González,et al.  A Model to Predict Low Academic Performance at a Specific Enrollment Using Data Mining , 2015, IEEE Revista Iberoamericana de Tecnologias del Aprendizaje.

[21]  Barbara Wasson,et al.  State of the Field Report on Learning Analytics , 2017 .

[22]  George S. Chen,et al.  Linking early alert systems and student retention: a survival analysis approach , 2018 .

[23]  Luc Paquette,et al.  A Longitudinal Study on Learner Career Advancement in MOOCs , 2014, J. Learn. Anal..

[24]  Pierre Dillenbourg,et al.  Sleepers' lag - study on motion and attention , 2014, LAK.

[25]  Michail N. Giannakos,et al.  Multimodal data as a means to understand the learning experience , 2019, Int. J. Inf. Manag..

[26]  Kui Xie,et al.  The role of self-regulated learning in students' success in flipped undergraduate math courses , 2018, Internet High. Educ..

[27]  Wahidah Husain,et al.  A Review on Predicting Student's Performance Using Data Mining Techniques , 2015 .

[28]  Ulrik Schroeder,et al.  Toward an Open Learning Analytics Ecosystem , 2017 .

[29]  Boban Vesin,et al.  Cross-Platform Analytics: A step towards Personalization and Adaptation in Education , 2019, LAK.

[30]  Dean Allemang,et al.  Chapter 2 – Semantic modeling , 2011 .

[31]  D. Moher,et al.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. , 2010, International journal of surgery.

[32]  M.-H. Chang,et al.  An analysis of collaborative problem-solving activities mediated by individual-based and collaborative computer simulations , 2017, J. Comput. Assist. Learn..

[33]  Dragan Gasevic,et al.  Rethinking learning analytics adoption through complexity leadership theory , 2018, LAK.

[34]  Richard T. Scott,et al.  Data sharing requirements: perspectives from three authors. , 2018, Fertility and sterility.

[35]  Angela Stewart,et al.  Gaze-based Detection of Mind Wandering during Lecture Viewing , 2017, EDM.