DaST: An Online Platform for Automated Exercise Generation and Solving in the Data Science Domain

Over the last few years data science has emerged both as a new research field and as an educational domain that attracted a large number of researchers and data practitioners. Although data science research is developing at a high pace, the educational process in the field has been left behind in terms of educational tools and practices, despite the high number of data science courses offered and the number of involved stakeholders (professors, tutors, and students). The present work aims to cover the gap of educational data science tools by proposing a novel platform; the platform, coined Data Science Tutor (DaST), is a free online tool that offers automated step-by-step exercise solving in a variety of data science algorithms/techniques aiming at giving insight to the particularities of each algorithm. The solutions of the exercises are accompanied with in-context explanations that refer to the operation of the respective algorithm/technique, and are compatible with the terminology and the methodology in popular textbooks. The tool aims at students, lecturers, and data practitioners in many diverse fields (ranging from data analysts to transport engineers to logistics managers) that want to learn the particularities of data science algorithms in a stepwise, interactive manner. Through the proposed platform (a) students in the data science field or in related courses (e.g., machine learning, information retrieval) may get solutions for different types of exercises and focus on the details of each algorithm, (b) tutors (lecturers, lab assistants) may easily produce a wide variety of exercises with the accompanying solutions and use them in classroom or as an auxiliary tool for test correction, and (c) data practitioners may get valuable insights on popular data science algorithms. To the best of our knowledge, the proposed platform is the first educational tool that aims at the data science field, and has so far been warmly accepted by departments worldwide.

[1]  Michael Lvov,et al.  Mathematical Models of Supporting the Solution of the Algebra Tasks in Systems of Computer Mathematics for Educational Purposes , 2018, ICTERI Workshops.

[2]  Juan M. Fernández-Luna,et al.  Teaching and learning in information retrieval , 2009, Information Retrieval.

[3]  Johan Jeuring,et al.  Strategy-based feedback in a programming tutor , 2014, CSERC.

[4]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[5]  K. Viswanathan Iyer A dynamic intranet-based online-portal support for Computer Science teaching , 2015, Education and Information Technologies.

[6]  Sylvia Stuurman,et al.  Beautiful JavaScript: how to guide students to create good and elegant code , 2014, CSERC.

[7]  Rosa Maria Vicari,et al.  A Personalized Information Retrieval Service for an Educational Environment , 2004, Intelligent Tutoring Systems.

[8]  Vasant Dhar,et al.  Data science and prediction , 2012, CACM.

[9]  Eero Sormunen,et al.  Learning Information Retrieval through an Educational Game. Is Gaming sufficient for learning? , 2000, Educ. Inf..

[10]  Anand Rajaraman,et al.  Mining of Massive Datasets , 2011 .

[11]  Anthony J. G. Hey,et al.  The Fourth Paradigm: Data-Intensive Scientific Discovery [Point of View] , 2011 .

[12]  Din J. Wasem,et al.  Mining of Massive Datasets , 2014 .

[13]  Ioannis Hatzilygeroudis,et al.  An Educational System for Learning Search Algorithms and Automatically Assessing Student Performance , 2017, International Journal of Artificial Intelligence in Education.

[14]  David Arnau,et al.  Fundamentals of the design and the operation of an intelligent tutoring system for the learning of the arithmetical and algebraic way of solving word problems , 2013, Comput. Educ..

[15]  Jules J. Berman,et al.  Principles of Big Data: Preparing, Sharing, and Analyzing Complex Information , 2013 .

[16]  Rafael A. Irizarry,et al.  A Guide to Teaching Data Science , 2016, The American statistician.

[17]  Tracy Anne Hammond,et al.  Maestoso: An Intelligent Educational Sketching Tool for Learning Music Theory , 2015, AAAI.

[18]  Frances Johnson Shifting contexts: relating the user, search and system in teaching information retrieval , 2011 .

[19]  Charu C. Aggarwal,et al.  Data Mining: The Textbook , 2015 .

[20]  Cristina Conati,et al.  Exploring the Need for Explainable Artificial Intelligence (XAI) in Intelligent Tutoring Systems (ITS) , 2019, IUI Workshops.

[21]  Haifeng Zhang,et al.  Teaching AI Algorithms with Games Including Mahjong and FightTheLandlord on the Botzone Online Platform , 2019, CompEd.

[22]  William S. Cleveland,et al.  Data science: An action plan for expanding the technical areas of the field of statistics , 2001, Stat. Anal. Data Min..

[23]  Jimmy J. Lin,et al.  Book Reviews: Data-Intensive Text Processing with MapReduce by Jimmy Lin and Chris Dyer , 2010, CL.

[24]  Edward J. Kim,et al.  Teaching Data Science , 2016, ICCS.

[25]  J. Marcos Moreno-Vega,et al.  Problem MetaHeuristic Solver: An educational tool aimed at studying heuristic optimization methods , 2015, Comput. Appl. Eng. Educ..

[26]  Tony Hey,et al.  The Fourth Paradigm: Data-Intensive Scientific Discovery , 2009 .

[27]  Shizuka Shirai,et al.  Intelligent Editor for Authoring Educational Materials in Mathematics e-Learning Systems , 2018, ICMS.

[28]  Swantje Willms Visualizing a User Model for Educational Adaptive Information Retrieval , 2003, User Modeling.