Efficient Clustering Algorithms in Educational Data Mining

Higher education institutions are competing for excellence, and in this process, they are utilizing information technologies to gather relevant information for achieving academic excellence. The institutes are putting greater emphasis on meeting students’ academic needs, enhancing the quality of service provided to students, providing better placements, course excellence, etc. The use of modern information technologies helps in storing huge data but requires the use of data mining technologies to extract useful information and knowledge from this data. Some of the knowledge achievable for higher education institutes through implementing several data mining techniques (classification, association learning, clustering, etc.) is the correlation between specialization and the chosen employment path, determining the subjects, courses, labs with high degree of difficulty, interesting subjects, courses, labs, facilities that might attract new students, etc. This chapter explores efficient clustering algorithms in educational data mining.

[1]  Liu Xiao-liang,et al.  Research of data mining based on e-commerce , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[2]  Robert Tibshirani,et al.  Estimating the number of clusters in a data set via the gap statistic , 2000 .

[3]  A. Jain,et al.  Design, Analysis and Implementation of Modified K-Mean Algorithm for Large Data-Set to Increase Scalability and Efficiency , 2012, 2012 Fourth International Conference on Computational Intelligence and Communication Networks.

[4]  Harleen Kaur,et al.  Empirical Study on Applications of Data Mining Techniques in Healthcare , 2006 .

[5]  N. Karthikeyani Visalakshi,et al.  K-means clustering using Max-min distance measure , 2009, NAFIPS 2009 - 2009 Annual Meeting of the North American Fuzzy Information Processing Society.

[6]  Suresh Kumar,et al.  Extension of K-Modes Algorithm for Generating Clusters Automatically , 2016 .

[7]  Hong Jia,et al.  Categorical-and-numerical-attribute data clustering based on a unified similarity metric without knowing cluster number , 2013, Pattern Recognit..

[8]  Ahamed B M Shafeeq,et al.  Dynamic Clustering of Data with Modified K-Means Algorithm , 2012 .

[9]  Suresh Kumar,et al.  An Efficient K-Means Algorithm and its Benchmarking against Other Algorithms , 2016 .

[10]  Anand Sharma,et al.  Emerging applications of data mining for healthcare management - A critical review , 2014, 2014 International Conference on Computing for Sustainable Global Development (INDIACom).

[11]  Doheon Lee,et al.  A k-populations algorithm for clustering categorical data , 2005, Pattern Recognit..

[12]  Jiye Liang,et al.  A weighting k-modes algorithm for subspace clustering of categorical data , 2013, Neurocomputing.

[13]  Wei Li Modified K-Means Clustering Algorithm , 2008, 2008 Congress on Image and Signal Processing.

[14]  Lokesh Kumar Sharma,et al.  Genetic K-Means Clustering Algorithm for Mixed Numeric and Categorical Data Sets , 2010 .

[15]  Wesam M. Ashour,et al.  Efficient Data Clustering Algorithms: Improvements over Kmeans , 2013 .

[16]  Chen Hong,et al.  Clustering Algorithm for Incomplete Data Sets with Mixed Numeric and Categorical Attributes , 2013 .

[17]  Lynette A. Hunt,et al.  Clustering mixed data , 2011, WIREs Data Mining Knowl. Discov..

[18]  Salem Chakhar,et al.  Extension of Partitional Clustering Methods for Handling Mixed Data , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[19]  Michael K. Ng,et al.  Automated variable weighting in k-means type clustering , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Yiu-ming Cheung,et al.  k*-Means: A new generalized k-means clustering algorithm , 2003, Pattern Recognit. Lett..

[21]  Lipika Dey,et al.  A k-mean clustering algorithm for mixed numeric and categorical data , 2007, Data Knowl. Eng..

[22]  Wu Cheng,et al.  A Modified k-means Algorithm for Clustering Problem with Balancing Constraints , 2011, 2011 Third International Conference on Measuring Technology and Mechatronics Automation.

[23]  Ohn Mar San,et al.  An alternative extension of the k-means algorithm for clustering categorical data , 2004 .

[24]  Philip S. Yu,et al.  Targeting the right students using data mining , 2000, KDD '00.

[25]  Nicolae Ghisoiu,et al.  Towards the development of decision support in academic environments , 2009, Proceedings of the ITI 2009 31st International Conference on Information Technology Interfaces.

[26]  Subhagata Chattopadhyay,et al.  Comparing Fuzzy-C Means and K-Means Clustering Techniques: A Comprehensive Study , 2012 .

[27]  Joshua Zhexue Huang,et al.  Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values , 1998, Data Mining and Knowledge Discovery.

[28]  Cheng Yu,et al.  Application of Data Mining Technology in E-Commerce , 2009, 2009 International Forum on Computer Science-Technology and Applications.

[29]  Michael K. Ng,et al.  Categorical data clustering with automatic selection of cluster number , 2009 .

[30]  Saurabh Pal Mining Educational Data Using Classification to Decrease Dropout Rate of Students , 2012, ArXiv.

[31]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[32]  Lien-Fu Lai,et al.  A Two-Step Method for Clustering Mixed Categroical and Numeric Data , 2010 .

[33]  Zhiliang Liu,et al.  Application of Visual Data Mining in Higher-Education Evaluation System , 2009, 2009 First International Workshop on Education Technology and Computer Science.

[34]  Jui-Long Hung,et al.  Integrating Data Mining in Program Evaluation of K-12 Online Education , 2012, J. Educ. Technol. Soc..

[35]  Lipika Dey,et al.  A method to compute distance between two categorical values of same attribute in unsupervised learning for categorical data set , 2007, Pattern Recognit. Lett..

[36]  Jiye Liang,et al.  A cluster centers initialization method for clustering categorical data , 2012, Expert Syst. Appl..

[37]  Jiye Liang,et al.  Determining the number of clusters using information entropy for mixed data , 2012, Pattern Recognit..

[38]  Jianhong Wu,et al.  Data clustering - theory, algorithms, and applications , 2007 .

[39]  R. Manikandan,et al.  A COMPARATIVE ANALYSIS BETWEEN K-MEAN AND Y-MEANS ALGORITHMS IN FISHER'S IRIS DATA SETS. , 2013 .

[40]  Yixiao Li,et al.  Clustering Mixed Data Based on Evidence Accumulation , 2006, ADMA.

[41]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[42]  J. Wu,et al.  A genetic fuzzy k-Modes algorithm for clustering categorical data , 2009, Expert Syst. Appl..

[43]  M. P. S Bhatia,et al.  Data clustering with modified K-means algorithm , 2011, 2011 International Conference on Recent Trends in Information Technology (ICRTIT).

[44]  Andrew W. Moore,et al.  Accelerating exact k-means algorithms with geometric reasoning , 1999, KDD '99.