A case study of duplications detection for educational domain thorough ad hoc search and identification NLP-based method

During the organization and planning of lecture courses for a discipline, its content may be overlapped and partially delivered in more than one course. Sometimes this action causes time loss through unnecessary repeating. This paper introduces an automated tool for duplications detections adapting methods of natural language processing used for Web search. The experiment for unstructured electronic document repositories clustering for thematic duplicate identification in different documents in the case of educational domain is presented. A prototype of this Web service-based software search engine is being designed and discussed. The experiment aimed to identify thematic duplicates of various courses within one of the teaching disciplines is also presented.

[1]  Rolf Schwitter,et al.  Meaningful web annotations for humans and machines using controlled natural language , 2008, Expert Syst. J. Knowl. Eng..

[2]  Eduard H. Hovy,et al.  Intelligent Approaches to Mining the Primary Research Literature: Techniques, Systems, and Examples , 2008, Computational Intelligence in Medical Informatics.

[3]  D. Benos,et al.  The ups and downs of peer review. , 2007, Advances in physiology education.

[4]  Olga C. Santos,et al.  User‐centred design and educational data mining support during the recommendations elicitation process in social online learning environments , 2015, Expert Syst. J. Knowl. Eng..

[5]  Ah-Hwee Tan,et al.  Knowledge discovery from texts: a concept frame graph approach , 2002, CIKM '02.

[6]  Danco Davcev,et al.  Distributed multimedia information retrieval system , 1992, Comput. Commun..

[7]  Nikos Papadakis,et al.  A tool for access to relational databases in natural language , 2011, Expert Syst. Appl..

[8]  Christian Severin Sauer,et al.  Extracting knowledge from web communities and linked data for case-based reasoning systems , 2014, Expert Syst. J. Knowl. Eng..

[9]  Xiangji Huang,et al.  Hierarchical grouping of association rules and its application to a real-world domain , 2006, Int. J. Syst. Sci..

[10]  Sebastián Ventura,et al.  Educational data mining: A survey from 1995 to 2005 , 2007, Expert Syst. Appl..

[11]  Sang Chan Park,et al.  Web mining for distance education , 2000, Proceedings of the 2000 IEEE International Conference on Management of Innovation and Technology. ICMIT 2000. 'Management in the 21st Century' (Cat. No.00EX457).

[12]  Chi Fai Cheung,et al.  Mining knowledge from natural language texts using fuzzy associated concept mapping , 2008, Inf. Process. Manag..

[13]  William F. Punch,et al.  Mining interesting contrast rules for a web-based educational system , 2004, 2004 International Conference on Machine Learning and Applications, 2004. Proceedings..

[14]  Sanja Vranes,et al.  NIMFA - Natural language Implicit Meaning Formalization and Abstraction , 2010, Expert Syst. Appl..

[15]  Antonio Fernández-Caballero,et al.  A Meta-ontological Framework for Multi-agent Systems Design , 2007, IWINAC.

[16]  Fabian M. Suchanek,et al.  AMIE: association rule mining under incomplete evidence in ontological knowledge bases , 2013, WWW.