Automated Essay Scoring using Word2vec and Support Vector Machine

Essay scoring is one of the most important tools for evaluating and assessing the level of achievement of educational goals. It aims to innovate performance, arrange, integrate ideas, and connect them by using the vocabulary of the particular subjects. Human essay scoring consumes a lot of time and effort, this leads to mistakes. Automated Essay Scoring (AES) solve to great extent problems. A new approach for AES is presented. It is based on Natural Language Processing (NLP) which is used to unify linguistic answers, word2vec model which converts words into features and synonyms in semantic space, Support Vector Machine(SVM) is used to classify students answers and estimate score levels. The system stages consist of preprocessing, feature extraction, classification and similarity algorithm. The results of proposed method reaches high precision (94%) relative to human resident scores.

[1]  Anjali Ganesh Jivani,et al.  A Comparative Study of Stemming Algorithms , 2011 .

[2]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[3]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[4]  Alabhya Farkiya,et al.  Natural Language Processing using NLTK and WordNet , 2015 .

[5]  Isabelle Guyon,et al.  An Introduction to Feature Extraction , 2006, Feature Extraction.

[6]  Robert Williams,et al.  Automated essay grading systems applied to a first year university subject: how can we do it better? , 2002 .

[7]  Yun Zhu,et al.  Support vector machines and Word2vec for text classification with semantic features , 2015, 2015 IEEE 14th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC).

[8]  Parag A. Guruji,et al.  EVALUATION OF SUBJECTIVE ANSWERS USING GLSA ENHANCED WITH CONTEXTUAL SYNONYMY , 2015 .

[9]  Grigori Sidorov,et al.  Soft Similarity and Soft Cosine Measure: Similarity of Features in Vector Space Model , 2014, Computación y Sistemas.

[10]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[11]  Beata Beigman Klebanov,et al.  Automated Essay Scoring , 2021, Synthesis Lectures on Human Language Technologies.

[12]  A. R. Weerasinghe,et al.  A dynamic semantic space modelling approach for short essay grading , 2015, 2015 Fifteenth International Conference on Advances in ICT for Emerging Regions (ICTer).

[13]  Semire Dikli,et al.  An Overview of Automated Scoring of Essays. , 2006 .

[14]  Mark,et al.  Apples Grading based on SVM Classifier , 2020 .

[15]  Arshad Arafat,et al.  Automated essay grading with recommendation , 2016 .

[16]  Klaus Zechner,et al.  Automated Essay Scoring: Writing Assessment and Instruction , 2010 .

[17]  Geoffrey Zweig,et al.  Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[19]  Md. Monjurul Islam,et al.  Automated essay scoring using Generalized Latent Semantic Analysis , 2010, 2010 13th International Conference on Computer and Information Technology (ICCIT).

[20]  Jatinderkumar R. Saini,et al.  Stop-Word Removal Algorithm and its Implementation for Sanskrit Language , 2016 .

[21]  Ben He,et al.  Automated Essay Scoring by Maximizing Human-Machine Agreement , 2013, EMNLP.

[22]  Barbara J. Grosz,et al.  Natural-Language Processing , 1982, Artificial Intelligence.

[23]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[24]  Knowledge Analysis Technologies , .

[25]  Ruchika Malhotra,et al.  Techniques for text classification: Literature review and current trends , 2015, Webology.

[26]  Yanqing Zhang,et al.  Using Word2Vec to process big text data , 2015, 2015 IEEE International Conference on Big Data (Big Data).