Optimizing XCSR for Text Classification

XCS, an evolutionary computing technique, can classify data using both bit strings and real valued representations. "Real valued XCS" (XCSR) commonly uses the min max interval based representation (MMR) for continuous valued data sets. Text data sets can be represented using bag of words based real valued representation, e.g. term frequency inverse document frequency of features. In this work we classify social media short informal text messages using XCSR, for the first time, from two major domains, i.e. spam detection and sentiment analysis. We perform spam detection of SMS and Email messages, and sentiment analysis of reviews and tweets. Feature vectors extracted from short text messages are very sparse and XCSR with MMR representation can not handle sparse data sets very well. We proposed XCSR# that uses MMR representation with explicit "don't care" intervals to handle sparse social media data sets. The experimental results indicate that introduction of the explicit "don't care" intervals improved the performance and created a statistically significant impact, specifically in the spam detection data sets. Further, it is observed that XCSR# produced more accurate and general rules than XCSR.

[1]  Stewart W. Wilson Get Real! XCS with Continuous-Valued Inputs , 1999, Learning Classifier Systems.

[2]  Martin V. Butz,et al.  Toward a theory of generalization and learning in XCS , 2004, IEEE Transactions on Evolutionary Computation.

[3]  Stewart W. Wilson Mining Oblique Data with XCS , 2000, IWLCS.

[4]  Olgierd Unold,et al.  Context-free grammar induction with grammar-based classifier system , 2005 .

[5]  Stewart W. Wilson Generalization in the XCS Classifier System , 1998 .

[6]  Martin V. Butz,et al.  An Algorithmic Description of XCS , 2000, IWLCS.

[7]  Andrea Bonarini,et al.  Fuzzy and Crisp Representations of Real-Valued Input for Learning Classifier Systems , 1999, Learning Classifier Systems.

[8]  Mengjie Zhang,et al.  Reusing Building Blocks of Extracted Knowledge to Solve Complex, Large-Scale Boolean Problems , 2014, IEEE Transactions on Evolutionary Computation.

[9]  Mengjie Zhang,et al.  Learning feature fusion strategies for various image types to detect salient objects , 2016, Pattern Recognit..

[10]  Mengjie Zhang,et al.  XCSR with Computed Continuous Action , 2012, Australasian Conference on Artificial Intelligence.

[11]  Pier Luca Lanzi,et al.  An Analysis of Generalization in the XCS Classifier System , 1999, Evolutionary Computation.

[12]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[13]  Christopher Stone,et al.  For Real! XCS with Continuous-Valued Inputs , 2003, Evolutionary Computation.

[14]  Mengjie Zhang,et al.  Learning complex, overlapping and niche imbalance Boolean problems using XCS-based classifier systems , 2013, Evol. Intell..

[15]  Stewart W. Wilson Classifier Fitness Based on Accuracy , 1995, Evolutionary Computation.

[16]  Will N. Browne,et al.  Extending XCS with Cyclic Graphs for Scalability on Complex Boolean Problems , 2017, Evolutionary Computation.

[17]  Edmund K. Burke,et al.  Improving the scalability of rule-based evolutionary learning , 2009, Memetic Comput..

[18]  Xavier Llorà,et al.  XCS and GALE: A Comparative Study of Two Learning Classifier Systems on Data Mining , 2001, IWLCS.

[19]  Harith Alani,et al.  Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new dataset, the STS-Gold , 2013, ESSEM@AI*IA.

[20]  Martin V. Butz,et al.  An algorithmic description of XCS , 2000, Soft Comput..

[21]  Tin Kam Ho,et al.  Domain of competence of XCS classifier system in complexity measurement space , 2005, IEEE Transactions on Evolutionary Computation.

[22]  Min Zhu,et al.  An XCS-based approach to classification learning on continuous-valued attributes , 2012, Proceedings of 2012 2nd International Conference on Computer Science and Network Technology.

[23]  Martin V. Butz Kernel-based, ellipsoidal conditions in the real-valued XCS classifier system , 2005, GECCO '05.

[24]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..