A method for automatic construction of learning contents in semantic web by a text mining approach

E-learning has gradually been accepted as an alternative for traditional lecture-based learning. One key factor for the success of e-learning is the possibility of understanding the semantics of learning contents autonomously by machines. The semantic web could naturally fit in according to its ability on information interchange and sharing between machines. Such ability is made possible when the semantic web pages are properly annotated. To transform the existing semantics-lacking learning contents to semantics-enriched ones, we propose a machine learning approach to automatically generate semantic markups for traditional learning contents, which are usually presented in web pages. The proposed method applies the self-organising map algorithm to cluster training web pages and conducts a text mining process to discover the anchor texts to be tagged and their semantic descriptions. Preliminary experiments show that our method may successfully generate semantic markups for the web pages that could be used for e-learning in the semantic web environment.

[1]  Marja-Riitta Koivunen,et al.  Metadata Based Annotation Infrastructure Offers Flexibility and Extensibility for Collaborative Applications and Beyond , 2001, Semannot@K-CAP 2001.

[2]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[3]  Steffen Staab,et al.  S-CREAM: Semiautomatic CREAtion of Metadata , 2002, SAAKM@ECAI.

[4]  Alexiei Dingli,et al.  Automatic semantic annotation using unsupervised information extraction and integration , 2003 .

[5]  Lei Zhang,et al.  Learning to Generate Semantic Annotation for Domain Specific Sentences , 2001, Semannot@K-CAP 2001.

[6]  Enrico Motta,et al.  Knowledge Extraction by Using an Ontology Based Annotation Tool , 2001, Semannot@K-CAP 2001.

[7]  Steffen Staab,et al.  eLearning based on the semantic web , 2001 .

[8]  Vladan Devedzic,et al.  Education and the Semantic Web , 2005, Int. J. Artif. Intell. Educ..

[9]  Arthur Stutt,et al.  MnM: Ontology Driven Semi-automatic and Automatic Support for Semantic Markup , 2002, EKAW.

[10]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[11]  Mikael Nilsson,et al.  Semantic Web Meta-data for e-Learning : Some Architectural Guidelines , 2002, WWW 2002.

[12]  Myra Spiliopoulou,et al.  Extraction of Semantic XML DTDs from Texts Using Data Mining Techniques , 2001, Semannot@K-CAP 2001.

[13]  Carole A. Goble,et al.  Towards Annotation Using DAML+OIL , 2001, Semannot@K-CAP 2001.

[14]  Steffen Staab,et al.  From Manual to Semi-Automatic Semantic Annotation: About Ontology-Based Text Annotation Tools , 2000, SAIC@COLING.

[15]  Steffen Staab,et al.  Authoring and annotation of web pages in CREAM , 2002, WWW.

[16]  Atanas Kiryakov,et al.  Semantic annotation, indexing, and retrieval , 2004, J. Web Semant..

[17]  T. Anderson,et al.  The Educational Semantic Web: Visioning and Practicing the Future of Education , 2004 .

[18]  Jeff Heflin,et al.  Searching the Web with SHOE , 2000 .

[19]  Philippe Martin,et al.  Embedding Knowledge in Web Documents , 1999, Comput. Networks.

[20]  Gerd Stumme,et al.  Semantic resource management for the web: an e-learning application , 2004, WWW Alt. '04.

[21]  Stefan Decker,et al.  Creating Semantic Web Contents with Protégé-2000 , 2001, IEEE Intell. Syst..

[22]  Karsten Winkler,et al.  Semantic Tagging of Domain-Specific Text Documents with DIAsDEM , 2001 .

[23]  Ramanathan V. Guha,et al.  A case for automated large-scale semantic annotation , 2003, J. Web Semant..

[24]  Laura Farinetti,et al.  Semantic annotation and search at the document substructure level , 2003 .

[25]  Eric Prud'hommeaux,et al.  Annotea: an open RDF infrastructure for shared Web annotations , 2002, Comput. Networks.