A relative tolerance relation of rough set in incomplete information

University is an educational institution that has objectives to increase student retention and also to make sure students graduate on time. Student learning performance can be predicted using data mining techniques e.g. the application of finding essential association rules on student learning base on demographic data by the university in order to achieve these objectives. However, the complete data i.e. the dataset without missing values to generate interesting rules for the detection system, is the key requirement for any mining technique. Furthermore, it is problematic to capture complete information from the nature of student data, due to high computational time to scan the datasets. To overcome these problems, this paper introduces a relative tolerance relation of rough set (RTRS). The novelty of RTRS is that, unlike previous rough set approaches that use tolerance relation, non-symmetric similarity relation, and limited tolerance relation, it is based on limited tolerance relation by taking account into consideration the relatively precision between two objects and therefore this is the first work that uses relatively precision. Moreover, this paper presents the mathematical properties of the RTRS approach and compares the performance and the existing approaches by using real-world student dataset for classifying university’s student performance. The results show that the proposed approach outperformed the existing approaches in terms of computational time and accuracy.

[1]  Tutut Herawan,et al.  A Soft Set-based Co-occurrence for Clustering Web User Transactions , 2017 .

[2]  Iwan Tri Riyadi Yanto,et al.  Clustering Based on Classification Quality (CCQ) , 2016, SCDM.

[3]  Alexis Tsoukiàs,et al.  Incomplete Information Tables and Rough Classification , 2001, Comput. Intell..

[4]  Wang Guo,et al.  EXTENSION OF ROUGH SET UNDER INCOMPLETE INFORMATION SYSTEMS , 2002 .

[5]  Jing Zhang,et al.  EDUCATIONAL DATA MINING , 2016 .

[6]  Mohd Arfian Ismail,et al.  Coastal Ecosystem Responses to Human and Climatic Changes throughout Asia , 2018, Journal of Coastal Research.

[7]  Iwan Tri Riyadi Yanto,et al.  Attribute selection on student performance dataset using maximum dependency attribute , 2017, 2017 5th International Conference on Electrical, Electronics and Information Engineering (ICEEIE).

[8]  Koichi Yamada,et al.  Extended Tolerance Relation to Define a New Rough Set Model in Incomplete Information Systems , 2013, Adv. Fuzzy Syst..

[9]  Saurabh Pal,et al.  Mining Educational Data to Reduce Dropout Rates of Engineering Students , 2012 .

[10]  W. F. Punch,et al.  Predicting student performance: an application of data mining methods with an educational Web-based system , 2003, 33rd Annual Frontiers in Education, 2003. FIE 2003..

[11]  Hairulnizam Mahdin,et al.  Soft Set Approach for Clustering Graduated Dataset , 2016, SCDM.

[12]  Haruna Chiroma,et al.  Analysis of Parameterization Value Reduction of Soft Sets and its Algorithm , 2016 .

[13]  Haruna Chiroma,et al.  A Framework for Clustering of Web Users Transaction Based on Soft Set Theory , 2015, DaEng.

[14]  Usama M. Fayyad,et al.  Data Mining and Knowledge Discovery: Making Sense Out of Data , 1996, IEEE Expert.

[15]  Surjeet Kumar Yadav,et al.  Mining Education Data to Predict Student's Retention: A comparative Study , 2012, ArXiv.

[16]  Barbara A. Wasik,et al.  Preventing Early School Failure: Research, Policy, and Practice , 1993 .

[17]  Surjeet Kumar Yadav,et al.  Data Mining: A Prediction for Performance Improvement of Engineering Students using Classification , 2012, ArXiv.

[18]  K. Rajeswari,et al.  Predicting Students Academic Performance Using Education Data Mining , 2013 .

[19]  Carlos Márquez-Vera,et al.  Predicting student failure at school using genetic programming and different data mining approaches with high dimensional and imbalanced data , 2013, Applied Intelligence.

[20]  Marzena Kryszkiewicz,et al.  Rules in Incomplete Information Systems , 1999, Inf. Sci..

[21]  Alexis Tsoukiàs,et al.  On the Extension of Rough Sets under Incomplete Information , 1999, RSFDGrC.

[22]  Xibei Yang,et al.  Rough Set Model Based on Hybrid Tolerance Relation , 2012, RSKT.

[23]  Iwan Tri Riyadi Yanto,et al.  A Comparative Analysis of Rough Sets for Incomplete Information System in Student Dataset , 2017 .

[24]  Qing Zhou Research on Tolerance-Based Rough Set Models , 2010, 2010 International Conference on System Science, Engineering Design and Manufacturing Informatization.

[25]  Saima Anwar Lashari,et al.  A Numerical Classification Technique Based on Fuzzy Soft Set Using Hamming Distance , 2018, SCDM.

[26]  Qingshun Guo,et al.  An extension model of rough set in incomplete information system , 2010, 2010 2nd International Conference on Future Computer and Communication.

[27]  Gary Adamson,et al.  A Monte Carlo Examination of an MTMM Model With Planned Incomplete Data Structures , 2002 .

[28]  Sotiris B. Kotsiantis,et al.  PREDICTING STUDENTS' PERFORMANCE IN DISTANCE LEARNING USING MACHINE LEARNING TECHNIQUES , 2004, Appl. Artif. Intell..

[29]  O OgundeA.,et al.  A Data Mining System for Predicting University Students' Graduation Grades Using ID3 Decision Tree Algorithm , 2014 .

[30]  Marzena Kryszkiewicz,et al.  Rough Set Approach to Incomplete Information Systems , 1998, Inf. Sci..

[31]  Jerzy W. Grzymala-Busse,et al.  The Rule Induction System LERS-a version for personal computers in Foun-dations of Computing and Dec , 1993 .

[32]  Haruna Chiroma,et al.  An Intelligent Modeling of Oil Consumption , 2014, ISI.

[33]  Xiaoping Yang An Improved Model of Rough Sets on Incomplete Information Systems , 2009, 2009 International Conference on Management of e-Commerce and e-Government.

[34]  Edi Sutoyo,et al.  Fuzzy Soft Set for Rock Igneous Clasification , 2018, 2018 International Symposium on Advanced Intelligent Informatics (SAIN).

[35]  Xibei Yang,et al.  Generalisation of rough set for rule induction in incomplete system , 2011, Int. J. Granul. Comput. Rough Sets Intell. Syst..

[36]  Marina Dobrota,et al.  Data Mining Models for Prediction of Customers’ Satisfaction: The CART Analysis , 2014 .