An associative rule-based classifier for Arabic medical text

Text classification is one of the methods used for managing, organising and retrieving the needed data among the huge available text. Several methods have been proposed to manipulate the text classification problem. In recent years, some studies proposed the use of Associative Classification AC approach. This paper examines an associative classification approach for the categorisation of text typed in Arabic language and related to medical domain. The approach discovers a set of association rules to build a classification model where three steps were applied to build the model: generating association rules, rule ordering and pruning, and then validation. The results of the experiments showed that the ordered decision list approach outperforms other approaches with accuracy reaching 90.6%. In general, the results of the experiments showed that association rule mining is a suitable method for building good classification models to categorise Arabic medical text.

[1]  Azuraliza Abu Bakar,et al.  Text associative classification approach for mining Arabic data set , 2012, 2012 4th Conference on Data Mining and Optimization (DMO).

[2]  Abdulmohsen Al-Thubaity,et al.  Automatic Arabic Text Classification , 2008 .

[3]  Shweta Taneja,et al.  A Lexical Approach for Text Categorization of Medical Documents , 2015 .

[4]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[5]  Jun Zhang,et al.  Associative Classification in Text Categorization , 2005, ICIC.

[6]  Zakaria Elberrichi,et al.  Multilingual Medical Documents Classification Based on MesH Domain Ontology , 2012, ArXiv.

[7]  Abdullah S. Ghareb,et al.  An Approach for Arabic Text Categorization Using Association Rule Mining , 2011, Int. J. Comput. Process. Orient. Lang..

[8]  Mohammed N. Al-Kabi,et al.  Comparative Assessment of the Performance of Three WEKA Text Classifiers Applied to Arabic Text , 2012 .

[9]  Uzma Ansari,et al.  Text Categorization based on Associative Classification , 2010 .

[10]  Peter I. Cowling,et al.  MMAC: a new multi-class, multi-label associative classification approach , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[11]  Suzan Wedyan,et al.  Review and Comparison of Associative Classification Data Mining Approaches , 2014 .

[12]  Zakaria Elberrichi,et al.  Arabic text categorization: a comparative study of different representation modes , 2012, Int. Arab J. Inf. Technol..

[13]  Saleh Alsaleem,et al.  Automated Arabic Text Categorization Using SVM and NB , 2011, Int. Arab. J. e Technol..

[14]  Mofleh Al-Diabat,et al.  Arabic Text Categorization Using Classification Rule Mining , 2012 .

[15]  Anette Hulth,et al.  General-Purpose Text Categorization Applied to the Medical Domain. , 2007 .

[16]  Alaa M. El-Halees,et al.  Arabic Text Classification Using Maximum Entropy , 2015 .

[17]  Wa'el Musa Hadi,et al.  Performance of NB and SVM classifiers in Islamic Arabic data , 2010, ISWSA '10.

[18]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[19]  Julio Villena-Román,et al.  MIDAS: An Information-Extraction Approach to Medical Text Classification , 2008, Proces. del Leng. Natural.

[20]  John D. Kelleher,et al.  Medical Language Processing for Patient Diagnosis Using Text Classification and Negation Labelling , 2008 .

[21]  S. M. Kamruzzaman,et al.  Text Classification using Association Rule with a Hybrid Concept of Naive Bayes Classifier and Genetic Algorithm , 2010, ArXiv.

[22]  Fawaz A. Al Zaghoul,et al.  Arabic Text Classification Based on Features Reduction Using Artificial Neural Networks , 2013, 2013 UKSim 15th International Conference on Computer Modelling and Simulation.

[23]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[24]  Peter I. Cowling,et al.  A greedy classification algorithm based on association rule , 2007, Appl. Soft Comput..

[25]  Jaber Alwedyan,et al.  Categorize arabic data sets using multi-class classification based on association rule approach , 2011, ISWSA '11.

[26]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[27]  Anirban Chakrabarty A Framework for Medical Text Mining using a Novel Categorical Clustering Algorithm , 2013 .

[28]  Nazlia Omar,et al.  An automated arabic text categorization based on the frequency ratio accumulation , 2014, Int. Arab J. Inf. Technol..

[29]  Ashwin Ram,et al.  Adapting associative classification to text categorization , 2007, DocEng '07.

[30]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[31]  Fadi A. Thabtah,et al.  A review of associative classification mining , 2007, The Knowledge Engineering Review.

[32]  Laila Khreisat,et al.  Arabic Text Classification Using N-Gram Frequency Statistics A Comparative Study , 2006, DMIN.

[33]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.