Outliers Detection on Educational Data using Fuzzy Association Rule Mining

Most of the mining techniques have only concerned with interesting patterns. However, in the recent years, there is an increasing demand in mining Unexpected Items or Outliers or Rare Items. Several application domains have realized the direct mapping between outliers in data and real world anomalies that are of great interest to an analyst. Outliers represents semantically correct but infrequent situationin a database. Detecting outliers allows extracting useful and actionable knowledge to the domain experts. In Educational Data, outliers are those students who have secured scores deviated so much from the average scores of other students. The educational data are Quantitative in nature. Any mining technique on quantitative data will partition the quantitative attributes with unnatural boundaries which lead to overestimate or underestimate the boundary values. Fuzzy logic handles this in a more realistic way. Knowing the threshold values apriori is not possible, hence our method uses dynamically calculated Support and Rank measures rather than predefined values. Our method uses a modified Fuzzy Apriori Rare Item sets Mining (FARIM) algorithm to detect the outliers (weak student). This will help the teachers in giving extra coaching for the weak students.

[1]  Jie Chen,et al.  Mining Unexpected Temporal Associations: Applications in Detecting Adverse Drug Reactions , 2008, IEEE Transactions on Information Technology in Biomedicine.

[2]  R. Agarwal Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[3]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[4]  C. T. Dhanya,et al.  Data Mining for Evolving Fuzzy Association Rules for Predicting Monsoon Rainfall of India , 2009 .

[5]  Giulia Bruno,et al.  TOD: Temporal outlier detection by using quasi-functional temporal dependencies , 2010, Data Knowl. Eng..

[6]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[7]  Tzung-Pei Hong,et al.  A GA-based Fuzzy Mining Approach to Achieve a Trade-off Between Number of Rules and Suitability of Membership Functions , 2006, Soft Comput..

[8]  J. Kalita,et al.  Outlier Identification using Symmetric Neighborhoods , 2012 .

[9]  W. S. Chan,et al.  Diagnosing shocks in stock markets of southeast Asia, Australia, and New Zealand , 2002, Math. Comput. Simul..

[10]  Tzung-Pei Hong,et al.  Trade-off Between Computation Time and Number of Rules for Fuzzy Mining from Quantitative Data , 2001, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[11]  Cheng-Hsiung Weng,et al.  Mining fuzzy specific rare itemsets for education data , 2011, Knowl. Based Syst..

[12]  Das Amrita,et al.  Mining Association Rules between Sets of Items in Large Databases , 2013 .

[13]  V. Radha,et al.  Enhanced Outlier Detection Method Using Association Rule Mining Technique , 2012 .

[14]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[15]  Carla E. Brodley,et al.  Anomaly Detection Using an Ensemble of Feature Models , 2010, 2010 IEEE International Conference on Data Mining.

[16]  Soung Hie Kim,et al.  Mining the change of customer behavior in an internet shopping mall , 2001, Expert Syst. Appl..

[17]  Yun Sing Koh,et al.  Rare Association Rule Mining via Transaction Clustering , 2008, AusDM.

[18]  Jie Chen,et al.  Signaling Potential Adverse Drug Reactions from Administrative Health Databases , 2010, IEEE Transactions on Knowledge and Data Engineering.

[19]  Deisy Chelliah,et al.  Temporal outlier detection on quantitative data using unexpectedness measure , 2012, 2012 12th International Conference on Intelligent Systems Design and Applications (ISDA).

[20]  Jesús Alcalá-Fdez,et al.  A Fuzzy Association Rule-Based Classification Model for High-Dimensional Problems With Genetic Rule Selection and Lateral Tuning , 2011, IEEE Transactions on Fuzzy Systems.

[21]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[22]  Howard J. Hamilton,et al.  Interestingness measures for data mining: A survey , 2006, CSUR.

[23]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.