Using Prerequisites to Extract Concept Maps fromTextbooks

We present a framework for constructing a specific type of knowledge graph, a concept map from textbooks. Using Wikipedia, we derive prerequisite relations among these concepts. A traditional approach for concept map extraction consists of two sub-problems: key concept extraction and concept relationship identification. Previous work for the most part had considered these two sub-problems independently. We propose a framework that jointly optimizes these sub-problems and investigates methods that identify concept relationships. Experiments on concept maps that are manually extracted in six educational areas (computer networks, macroeconomics, precalculus, databases, physics, and geometry) show that our model outperforms supervised learning baselines that solve the two sub-problems separately. Moreover, we observe that incorporating textbook information helps with concept map extraction.

[1]  Distant Supervision for Relation Extraction with Matrix Completion , 2014, ACL.

[2]  Taha Yasseri,et al.  The distorted mirror of Wikipedia: a quantitative analysis of Wikipedia coverage of academics , 2013, EPJ Data Science.

[3]  Ramesh Nallapati,et al.  Multi-instance Multi-label Learning for Relation Extraction , 2012, EMNLP.

[4]  Benjamin Bräutigam,et al.  Concept Hierarchy Extraction from Textbooks , 2015, DocEng.

[5]  Yiming Yang,et al.  Concept Graph Learning from Educational Data , 2015, WSDM.

[6]  Gerald S. Craig,et al.  Learning with science , 1961 .

[7]  Richard T. Gill,et al.  Conceptual Graph Analysis: Knowledge Acquisition for Instructional System Design , 1993 .

[8]  Henry A. Kautz,et al.  Hardening soft information sources , 2000, KDD '00.

[9]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[10]  Ian H. Witten,et al.  Topic indexing with Wikipedia , 2008 .

[11]  Shian-Shyong Tseng,et al.  A new approach for constructing the concept map , 2004, IEEE International Conference on Advanced Learning Technologies, 2004. Proceedings..

[12]  Luke S. Zettlemoyer,et al.  Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations , 2011, ACL.

[13]  Wenyi Huang,et al.  Measuring Prerequisite Relations Among Concepts , 2015, EMNLP.

[14]  Qinghua Zheng,et al.  MOTIF-RE: Motif-Based Hypernym/Hyponym Relation Extraction from Wikipedia Links , 2012, ICONIP.

[15]  Xianpei Han,et al.  Named entity disambiguation by leveraging wikipedia semantic knowledge , 2009, CIKM.

[16]  Doug Downey,et al.  Web-scale information extraction in knowitall: (preliminary results) , 2004, WWW '04.

[17]  Yinghui Wu,et al.  Fast top-k search in knowledge graphs , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[18]  References , 1971 .

[19]  R. Kavitha,et al.  An augmented prerequisite concept relation map design to improve adaptivity in e-learning , 2012, International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME-2012).

[20]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[21]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[22]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[23]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[24]  Sarah J. Tracy,et al.  Wikipedia as Public Scholarship: Communicating Our Impact Online , 2010 .

[25]  Razvan C. Bunescu,et al.  Using Encyclopedic Knowledge for Named entity Disambiguation , 2006, EACL.

[26]  Chau Q. Nguyen,et al.  An Ontology-Based Approach for Key Phrase Extraction , 2009, ACL/IJCNLP.

[27]  Lise Getoor,et al.  Large-Scale Knowledge Graph Identification Using PSL , 2013, AAAI Fall Symposia.

[28]  Ganesh Ramakrishnan,et al.  Collective annotation of Wikipedia entities in web text , 2009, KDD.

[29]  William W. Cohen,et al.  Crowdsourced Comprehension: Predicting Prerequisite Structure in Wikipedia , 2012, BEA@NAACL-HLT.

[30]  Jason Weston,et al.  Learning Structured Embeddings of Knowledge Bases , 2011, AAAI.

[31]  Finn Årup Nielsen,et al.  Wikipedia in the eyes of its beholders: A systematic review of scholarly research on Wikipedia readers and readership , 2014, J. Assoc. Inf. Sci. Technol..

[32]  Andrew McCallum,et al.  Modeling Relations and Their Mentions without Labeled Text , 2010, ECML/PKDD.

[33]  N. J. Schweitzer Wikipedia and Psychology: Coverage of Concepts and Its Use by Undergraduate Students , 2008 .

[34]  H. Suen,et al.  Concept Map Assessment of Classroom Learning: Reliability, Validity, and Logistical Practicality , 1999 .

[35]  Dejing Dou,et al.  Learning to Refine an Automatically Extracted Knowledge Base Using Markov Logic , 2012, 2012 IEEE 12th International Conference on Data Mining.

[36]  Lise Getoor,et al.  Knowledge Graph Identification , 2013, SEMWEB.

[37]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[38]  Lei Liu,et al.  Prerequisite Concept Maps Extraction for AutomaticAssessment , 2016, WWW.

[39]  Zhifang Sui,et al.  Towards Accurate Distant Supervision for Relational Facts Extraction , 2013, ACL.

[40]  Ian H. Witten,et al.  Human-competitive tagging using automatic keyphrase extraction , 2009, EMNLP.

[41]  Ian H. Witten,et al.  An effective, low-cost measure of semantic relatedness obtained from Wikipedia links , 2008 .

[42]  Li Guo,et al.  Knowledge Base Completion Using Embeddings and Rules , 2015, IJCAI.

[43]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[44]  Paolo Ferragina,et al.  TAGME: on-the-fly annotation of short text fragments (by wikipedia entities) , 2010, CIKM.

[45]  Doug Downey,et al.  Local and Global Algorithms for Disambiguation to Wikipedia , 2011, ACL.