Intelligent Agent for Information Extraction from Arabic Text without Machine Translation

The process of classifying text into two opposing opinions is known as sentiment polarity classification. It has been shown in the literature that this problem cannot reach accuracy higher than 80-85%. This paper shows that a higher accuracy (96%) can be achieved without the need to translate text into English language. More specifically, our case study is: Islamic Hadith Narration. The problem is to tell whether a person is trustworthy or not based on his biographical data. With such high accuracy, the agent can be used to create new books in the area of Hadith automatically instead of manual classification done before. The results of our experiments encourage the use of an intelligent agent for information extraction using supervised learning, domain knowledge and number of natural language processing techniques.

[1]  Jeffrey M. Bradshaw,et al.  An Agent-Oriented Service Model for a Personal Information Manager , 2009, SOCASE.

[2]  Wei Cao,et al.  Image classification based on Bayes point machines , 2009, 2009 IEEE International Workshop on Imaging Systems and Techniques.

[3]  Edward Y. Chang,et al.  BPMs versus SVMs for image classification , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[4]  Rada Mihalcea,et al.  Multilingual Subjectivity Analysis Using Machine Translation , 2008, EMNLP.

[5]  Alistair Kennedy,et al.  SENTIMENT CLASSIFICATION of MOVIE REVIEWS USING CONTEXTUAL VALENCE SHIFTERS , 2006, Comput. Intell..

[6]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[7]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[8]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[9]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.