Improving term extraction by utilizing user annotations

Automated acquisition of relevant domain terms from educational documents available in social educational systems can benefit from processing a growing number of user-created annotations assigned to the content. Annotations provide us potentially useful information about documents and can improve the results of base Automatic Term Recognition (ATR) algorithms. We propose a method for relevant domain terms extraction based on user-created annotations processing. We consider three basic annotation types: tags, comments and highlights. The final term weight is computed by combining relevant domain terms weights obtained from the individual annotation types and those obtained from the text. The method was evaluated using data from Principles of Software Engineering course in adaptive educational system ALEF and showed that enhancements based on annotation processing yield significant improvement of results.

[1]  Mária Bieliková,et al.  Automated Educational Course Metadata Generation Based on Semantics Discovery , 2009, EC-TEL.

[2]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[3]  Philipp Cimiano,et al.  Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[4]  Stephen Downes E-learning 2.0 , 2005, ELERN.

[5]  Ziqi Zhang,et al.  A Comparative Evaluation of Term Recognition Algorithms , 2008, LREC.

[6]  Zdenek Zdrahal,et al.  Towards a framework for comparing automatic term recognition methods , 2009 .

[7]  Róbert Móro,et al.  Personalized Text Summarization Based on Important Terms Identification , 2012, 2012 23rd International Workshop on Database and Expert Systems Applications.

[8]  Wolfgang Nejdl,et al.  How do you feel about "dancing queen"?: deriving mood & theme annotations from user tags , 2009, JCDL '09.

[9]  Mária Bieliková,et al.  Utilizing Microblogs for Web Page Relevant Term Acquisition , 2013, SOFSEM.

[10]  Mária Bieliková,et al.  ALEF: A Framework for Adaptive Web-Based Learning 2.0 , 2010, Key Competencies in the Knowledge Society.

[11]  Yong Yu,et al.  Exploring social annotations for the semantic web , 2006, WWW '06.

[12]  Jakub Simko,et al.  Little search game: term network acquisition via a human computation game , 2011, HT '11.

[13]  Mohammed Bennamoun,et al.  Ontology learning from text: A look back and into the future , 2012, CSUR.

[14]  Marián Simko,et al.  Improving Relevance of Keyword Extraction from the Web Utilizing Visual Style Information , 2013, SOFSEM.

[15]  Mária Bieliková,et al.  Exercises recommending for limited time learning , 2010, RecSysTEL@RecSys.