Towards Compiling Textbooks from Wikipedia

In this paper, we explore challenges in compiling a pedagogic resource like a textbook on a given topic from relevant Wikipedia articles, and present an approach towards assisting humans in this task. We present an algorithm that attempts to suggest the textbook structure from Wikipedia based on a set of seed concepts (chapters) provided by the user. We also conceptualize a decision support system where users can interact with the proposed structure and the corresponding Wikipedia content to improve its pedagogic value. The proposed algorithm is implemented and evaluated against the outline of online textbooks on five different subjects. We also propose a measure to quantify the pedagogic value of the suggested textbook structure.

[1]  William W. Cohen,et al.  Crowdsourced Comprehension: Predicting Prerequisite Structure in Wikipedia , 2012, BEA@NAACL-HLT.

[2]  Yujian Li,et al.  A metric normalization of tree edit distance , 2011, Frontiers of Computer Science in China.

[3]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[4]  Ditty Mathew,et al.  Towards Creating Pedagogic Views from Encyclopedic Resources , 2015, BEA@NAACL-HLT.

[5]  G. W. Milligan,et al.  An examination of procedures for determining the number of clusters in a data set , 1985 .

[6]  Sreenivas Gollapudi,et al.  Quality of textbooks: an empirical study , 2012, ACM DEV '12.

[7]  Ian H. Witten,et al.  An effective, low-cost measure of semantic relatedness obtained from Wikipedia links , 2008 .

[8]  Rakesh Agrawal,et al.  Toward Data-Driven Design of Educational Courses: A Feasibility Study , 2016, EDM.

[9]  Nikolaus Augsten,et al.  Tree edit distance: Robust and memory-efficient , 2016, Inf. Syst..

[10]  G. Jenks The Data Model Concept in Statistical Mapping , 1967 .

[11]  Devavrat Shah,et al.  Rank Centrality: Ranking from Pairwise Comparisons , 2012, Oper. Res..

[12]  Evangelos E. Papalexakis,et al.  Data-Driven Synthesis of Study Plans , 2015 .

[13]  Wenyi Huang,et al.  Measuring Prerequisite Relations Among Concepts , 2015, EMNLP.

[14]  Zhaohui Wu,et al.  Recovering Concept Prerequisite Relations from University Course Dependencies , 2017, AAAI.

[15]  Benjamin Bräutigam,et al.  BBookX: An Automatic Book Creation Framework , 2015, DocEng.

[16]  Paul M. B. Vitányi,et al.  The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[17]  Nitish Srivastava,et al.  Enriching textbooks through data mining , 2010, ACM DEV '10.

[18]  Amit P. Sheth,et al.  Moving beyond SameAs with PLATO: partonomy detection for linked data , 2012, HT '12.

[19]  Zhaohui Wu,et al.  Using Prerequisites to Extract Concept Maps fromTextbooks , 2016, CIKM.

[20]  Jean-Pierre Eckmann,et al.  Loops and Self-Reference in the Construction of Dictionaries , 2012 .

[21]  Chengjiang Li,et al.  Prerequisite Relation Learning for Concepts in MOOCs , 2017, ACL.