Text Mining Approach Using TF-IDF and Naive Bayes for Classification of Exam Questions Based on Cognitive Level of Bloom's Taxonomy

Bloom's Taxonomy is a unity of three domains, which are divided into lower orders and high orders based on the Bloom Taxonomic Cognitive Domain, the level is used to classify learning objectives and serve as benchmarks for evaluating student achievement. Basically, an evaluation of student achievement can be done by giving questions on exam activities. The questions given are then classified according to the level in the Cognitive Domain. However, because the number of questions is too many and the classification is still manual, it causes the classification results are not accurate and inconsistent. Therefore, the employing of the Naive Bayes Classifier in classifying exam questions based on levels in the Cognitive Domain can be a solution. This study uses real-world dataset collected from mid-terms and final exams questions taken from Department of Information Systems, Telkom University from the academic year 2012/2013 to 2018/2019. In particular, we examined Words, Characters, and N-gram as indexing terms. The results showed that the classification using Naïve Bayes and TF-IDF with N-gram as indexing terms achieved precision of 85% and recall of 80%.

[1]  D. Krathwohl A Revision of Bloom's Taxonomy: An Overview , 2002 .

[2]  Nazlia Omar,et al.  Automated Analysis of Exam Questions According to Bloom's Taxonomy , 2012 .

[3]  Baker Lawley Bloom’s Taxonomy , 2020 .

[4]  Ted Kwartler What is Text Mining , 2017 .

[5]  Xxyyzz,et al.  Bloom’s Taxonomy , 2020 .

[6]  Renato De Mori,et al.  The Application of Semantic Classification Trees to Natural Language Understanding , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Iwan Tri Riyadi Yanto,et al.  Attribute selection on student performance dataset using maximum dependency attribute , 2017, 2017 5th International Conference on Electrical, Electronics and Information Engineering (ICEEIE).

[8]  Tutut Herawan,et al.  A Soft Set-based Co-occurrence for Clustering Web User Transactions , 2017 .

[9]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[10]  Sergei Nirenburg,et al.  Semantic Classification for Practical Natural Language Processing , 1995 .

[11]  Judit Ács,et al.  A Two-level Classifier for Discriminating Similar Languages , 2015 .

[12]  K. Bretonnel Cohen,et al.  Getting Started in Text Mining , 2008, PLoS Comput. Biol..

[13]  Sankaran Mahadevan,et al.  An improved method to construct basic probability assignment based on the confusion matrix for classification problem , 2016, Inf. Sci..

[14]  Geoff Holmes,et al.  Multinomial Naive Bayes for Text Categorization Revisited , 2004, Australian Conference on Artificial Intelligence.

[15]  Dan Braha Data mining for design and manufacturing: methods and applications , 2001 .

[16]  Susan Feldman,et al.  NLP Meets the Jabberwocky: Natural Language Processing in Information Retrieval. , 1999 .

[17]  Iwan Tri Riyadi Yanto,et al.  Application of adaptive neuro-fuzzy inference system and chicken swarm optimization for classifying river water quality , 2017, 2017 5th International Conference on Electrical, Electronics and Information Engineering (ICEEIE).

[18]  Arthur James Swart,et al.  Evaluation of Final Examination Papers in Engineering: A Case Study Using Bloom's Taxonomy , 2010, IEEE Transactions on Education.

[19]  Ömer Gökhan Ulum,et al.  Language assessment through Bloom’s Taxonomy , 2018 .

[20]  Milton S. Boyd,et al.  Designing a neural network for forecasting financial and economic time series , 1996, Neurocomputing.

[21]  Iwan Tri Riyadi Yanto,et al.  A Relative Tolerance Relation of Rough Set for Incomplete Information Systems , 2018, SCDM.

[22]  Hajo Hippner,et al.  Text Mining , 2006, Informatik-Spektrum.

[23]  Paolo Giudici,et al.  Applied Data Mining: Statistical Methods for Business and Industry , 2003 .

[24]  Haruna Chiroma,et al.  A Framework for Clustering of Web Users Transaction Based on Soft Set Theory , 2015, DaEng.

[25]  E. Furst Bloom’s Taxonomy of Educational Objectives for the Cognitive Domain: Philosophical and Educational Issues , 1981 .