Generated rules for AIDS and e-learning classifier using rough set approach

The emergence and growth of internet usage has accumulated an extensive amount of data. These data contain a wealth of undiscovered valuable information and problems of incomplete data set may lead to observation error. This research explored a technique to analyze data that transforms meaningless data to meaningful information. The work focused on Rough Set (RS) to deal with incomplete data and rules derivation. Rules with high and low left-hand-side (LHS) support value generated by RS were used as query statements to form a cluster of data. The model was tested on AIDS blog data set consisting of 146 bloggers and E-Learning@UTM (EL) log data set comprising 23105 URLs. 5-fold and 10-fold cross validation were used to split the data. Naive algorithm and Boolean algorithm as discretization techniques and Johnson’s algorithm (Johnson) and Genetic algorithm (GA) as reduction techniques were employed to compare the results. 5-fold cross validation tended to suit AIDS data well while 10-fold cross validation was the best for EL data set. Johnson and GA yielded the same number of rules for both data sets. These findings are significant as evidence in terms of accuracy that was achieved using the proposed model

[1]  Chengwei Xiao,et al.  Using machine learning for exploratory data analysis and predictive models on large datasets , 2015 .

[2]  Sadie Creese,et al.  Applying Social Network Analysis to Security , 2015 .

[3]  Hung Son Nguyen,et al.  A method of Web search result clustering based on rough sets , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[4]  Supriya Kumar De,et al.  Clustering web transactions using rough approximation , 2004, Fuzzy Sets Syst..

[5]  Naimah Mohd Hussin,et al.  Developing a Team Performance Prediction Model: A Rough Sets Approach , 2011 .

[6]  Prerna Mahajan,et al.  Rough Set Approach in Machine Learning: A Review , 2012 .

[7]  Yijie Li,et al.  A New Heuristic Algorithm of Rules Generation Based on Rough Sets , 2008, 2008 International Seminar on Business and Information Management.

[8]  Joseph L. Breault,et al.  Data Mining Diabetic Databases: Are Rough Sets a Useful Addition? , 2001 .

[9]  Hanaa Ismail Elshazly,et al.  Rough sets and genetic algorithms: A hybrid approach to breast cancer classification , 2012, 2012 World Congress on Information and Communication Technologies.

[10]  Petra Perner,et al.  Data Preparation of Web Log Files for Marketing Aspects Analyses , 2006, Industrial Conference on Data Mining.

[11]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[12]  Salwani Abdullah,et al.  Hybrid of genetic algorithm and great deluge algorithm for rough set attribute reduction , 2013 .

[13]  Torgeir R. Hvidsten,et al.  A tutorial-based guide to the ROSETTA system : A Rough Set Toolkit for Analysis of Data by , 2013 .

[14]  Mustafa Mat Deris,et al.  A Framework of Rough Clustering for Web Transactions , 2010, Advances in Intelligent Information and Database Systems.

[15]  Duoqian Miao,et al.  A rough set approach to feature selection based on ant colony optimization , 2010, Pattern Recognit. Lett..

[16]  Jiqin Jiang,et al.  System model of college students' network behavior research based on rough sets , 2014 .

[17]  Azuraliza Abu Bakar,et al.  COMPARATIVE STUDY ON DIFFERENT CLASSIFICATION TECHNIQUES FOR BREAST CANCER DATASET , 2014 .

[18]  Indranil Bose,et al.  Deciding the financial health of dot-coms using rough sets , 2006, Inf. Manag..

[19]  S. Shamsuddin,et al.  Feature granularity for cardiac datasets using Rough Set , 2011, 2011 IEEE International Conference on Computer Science and Automation Engineering.

[20]  Shalini Batra,et al.  Efficient Rule Set Generation using K-Map & Rough Set Theory (RST) , 2015 .

[21]  A Anitha,et al.  A Dynamic Web Mining Framework for E-Learning Recommendations using Rough Sets and Association Rule Mining , 2011 .

[22]  Siti Mariyam Shamsuddin,et al.  Authorship Invarianceness for Writer Identification , 2009, 2009 International Conference on Biometrics and Kansei Engineering.

[23]  Noor Suhana Sulaiman,et al.  Generation of rough set (RS) significant reducts and rules for cardiac dataset classification , 2007 .

[24]  V. Jevtic,et al.  A comparison of rule sets induced by techniques based on rough set theory , 2008, 2008 6th International Symposium on Intelligent Systems and Informatics.

[25]  Mohd Kamir Yusof,et al.  Rules Generation for Multimedia Data Classifying using Rough Sets Theory , 2013 .

[26]  Hala S. Own,et al.  Rough set based classification of real world Web services , 2015, Inf. Syst. Frontiers.

[27]  Ajith Abraham,et al.  Meaningless to meaningful Web log data for generation of Web pre-caching decision rules using Rough Set , 2012, 2012 4th Conference on Data Mining and Optimization (DMO).

[28]  Gou Panjie,et al.  Study on Knowledge Discovery for Lifestyle Diseases Using Rough Set , 2013, 2013 6th International Conference on Intelligent Networks and Intelligent Systems.

[29]  Ajith Abraham,et al.  An Implementation of Rough Set in Optimizing Mobile Web Caching Performance (Invited Paper) , 2008, Tenth International Conference on Computer Modeling and Simulation (uksim 2008).

[30]  M. Durairaj,et al.  Applying Rough Set Theory for Medical Informatics Data Analysis , 2022 .

[31]  Liangzhong Shen,et al.  Research of Customer Classification Based on Rough Set Using Rosetta Software , 2013 .

[32]  Moslem Yousefi,et al.  An evolutionary approach for solving the job shop scheduling problem in a service industry , 2015 .

[33]  J. Anuradha,et al.  Classification and Rule Extraction using Rough Set for Diagnosis of Liver Disease and its Types , 2011 .

[34]  Azuraliza Abu Bakar,et al.  Performance Study on Data Discretization Techniques Using Nutrition Dataset , 2009 .

[35]  A. Ohrn,et al.  Rough sets: a knowledge discovery technique for multifactorial medical outcomes. , 2000, American journal of physical medicine & rehabilitation.

[36]  Julia A. Johnson,et al.  Rough Set Based WebCT Learning , 2000, Web-Age Information Management.

[37]  Nidhika Yadav,et al.  Fuzzy Rough Sets and Its Application in Data Mining Field , 2015 .

[38]  Artur Szymanski,et al.  Rough Set Rules Help to Optimize Parameters of Deep Brain Stimulation in Parkinson's Patients , 2014, Brain Informatics and Health.

[39]  Ajith Abraham,et al.  Rough Neuro-PSO Web caching and XML prefetching for accessing Facebook from mobile environment , 2009, 2009 World Congress on Nature & Biologically Inspired Computing (NaBIC).