Pedagogical Data Analysis Via Federated Learning Toward Education 4.0

Pedagogical data analysis has been recognized as one of the most important features in pursuing Education 4.0. The recent rapid development of ICT technologies benefits and revolutionizes pedagogical data analysis via the provisioning of many advanced technologies such as big data analysis and machine learning. Meanwhile, the privacy of the students become another concern and this makes the educational institutions reluctant to share their students' data, forming isolated data islands and hindering the realization of big educational data analysis. To tackle such challenge, in this paper, we propose a federated learning based education data analysis framework FEEDAN, via which education data analysis federations can be formed by a number of institutions. None of them needs to direct exchange their students' data with each other and they always keep the data in their own place to guarantee their students' privacy. We apply our framework to analyze two real education datasets via two different federated learning paradigms. The experiment results show that it not only guarantees the students' privacy but also indeed breaks the borders of data island by achieving a higher analysis quality. Our framework can much approach the performance of centralized analysis which needs to collect the data in a common place with the risk of privacy exposure.

[1]  Heribert Popp,et al.  Education 4.0 — Fostering student's performance with machine learning methods , 2017, 2017 IEEE 23rd International Symposium for Design and Technology in Electronic Packaging (SIITME).

[2]  Wenhao Zhu,et al.  Identifying Beneficial Sessions in an e-learning System Using Machine Learning Techniques , 2018, 2018 IEEE Conference on Big Data and Analytics (ICBDA).

[3]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[4]  Arslan Shaukat,et al.  Towards the Selection of Best Machine Learning Model for Student Performance Analysis and Prediction , 2019, 2019 6th International Conference on Soft Computing & Machine Intelligence (ISCMI).

[5]  Ford Lumban Gaol,et al.  Using Machine Learning Techniques to Earlier Predict Student's Performance , 2018, 2018 Indonesian Association for Pattern Recognition International Conference (INAPR).

[6]  Toshiharu Hatanaka,et al.  Early Detection of At-Risk Students Using Machine Learning Based on LMS Log Data , 2017, 2017 6th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI).

[7]  Shou-De Lin,et al.  Feature Engineering and Classifier Ensemble for KDD Cup 2010 , 2010, KDD 2010.

[8]  Mihaela van der Schaar,et al.  A Machine Learning Approach for Tracking and Predicting Student Performance in Degree Programs , 2017, IEEE Journal of Selected Topics in Signal Processing.

[9]  Wattana Punlumjeak,et al.  Big Data Analytics: Student Performance Prediction Using Feature Selection and Machine Learning on Microsoft Azure Platform , 2017 .

[10]  Niels Pinkwart,et al.  Predicting MOOC Dropout over Weeks Using Machine Learning Methods , 2014, EMNLP 2014.

[11]  H. Vincent Poor,et al.  On Safeguarding Privacy and Security in the Framework of Federated Learning , 2020, IEEE Network.

[12]  Andre B. de Carvalho,et al.  Supervised Learning in the Context of Educational Data Mining to Avoid University Students Dropout , 2019, 2019 IEEE 19th International Conference on Advanced Learning Technologies (ICALT).

[13]  Tianjian Chen,et al.  Federated Machine Learning: Concept and Applications , 2019 .

[14]  Steven C. Harris,et al.  Identifying Student Difficulty in a Digital Learning Environment , 2018, 2018 IEEE 18th International Conference on Advanced Learning Technologies (ICALT).

[15]  Song Guo,et al.  Pedagogical Data Federation toward Education 4.0 , 2020, ICFET.