Data Mining: a Magic Technology for College Recruitment

This paper introduces a case study using data mining techniques to assist higher education institutions in achieving enrollment goals. The introduction includes the general life cycle of a data mining project: business understanding, data understanding, data preparation, modeling, assessment, and deployment. The model fit statistics show that data mining techniques, neural network, decision tree, logistic regression, and ensemble models, are successful in this project. Ensemble and neural network are better. The study concludes that data mining is an effective technology for college recruitment and also for institutional research and analysis. Data mining, defined as “the process of sampling, exploring, modifying, modeling, and assessing large amounts of data to uncover previously unknown patterns” (SAS, 2005, p. 1-3), has been used widely in various areas such as science, engineering, business, banking, and even combating terrorism for a long time. Its success and effectiveness in discovering actionable information from large sets of data has been well established. Recently, there has been an increasing trend that data mining techniques were used in institutional research and analysis including, but not limited to, college admissions yield prediction (Chang, 2006), retention and graduation prediction (Herzop, 2006; Sujitparapitaya, 2006; Baily, 2006), time-to-degree analysis (Eykamp, 2006; Herzog, 2006), enrollment management (Aksenova, Zhang, & Lu, 2006; Luan, 2006), course scheduling and online course offerings (Luan, 2006; Dai, Yeh, & Lu, 2007), student performance assessment (Dede & Clarke, 2007; Heathcote & Dawson, 2005; Minaei-Bidgoli, 2004; Ogor, 2007), and survey analysis (Yu, et. al, 2007). These applications have showed great success of various mining approaches in extracting information from the enormous higher education data. The findings help institutions of higher education make more effective decisions as to improve the quality of instruction and services. These studies also provide the evidence proving that data mining techniques have many advantages compared to the conventional analytical approaches in predicting and scoring behaviors of individual students (Luan & Zhao, 2006). However, the agreement about the significance of its practical application and evidence of its advantages still need to be further developed theoretically as well as practically. This paper, therefore, introduces a data mining project about college recruitment. With the evidence showing that attaining some level of education beyond high school is

[1]  Chien-Yun Dai,et al.  Applying data mining technology to analyze user behavior in course website , 2007 .

[2]  Shane Dawson,et al.  Data Mining for Evaluation, Benchmarking and Reflective Practice in a LMS , 2005 .

[3]  Du Zhang,et al.  Enrollment Prediction through Data Mining , 2006, 2006 IEEE International Conference on Information Reuse & Integration.

[4]  Lin Chang,et al.  Applying Data Mining to Predict College Admissions Yield: A Case Study. , 2006 .

[5]  Serge Herzog,et al.  Estimating Student Retention and Degree-Completion Time: Decision Trees and Neural Networks Vis-a-Vis Regression. , 2006 .

[6]  Jing Luan Using Academic Behavior Index (AB-Index) to Develop a Learner Typology for Managing Enrollment and Course Offerings--A Data Mining Approach. IR Applications. Volume 10. , 2006 .

[7]  Brenda L. Bailey Let the Data Talk: Developing Models to Explain IPEDS Graduation Rates. , 2006 .

[8]  E.N. Ogor Student Academic Performance Monitoring and Evaluation Using Data Mining Techniques , 2007, Electronics, Robotics and Automotive Mechanics Conference (CERMA 2007).

[9]  William F. Punch,et al.  Data mining for a web-based educational system , 2005 .

[10]  Sutee Sujitparapitaya,et al.  Considering student mobility in retention outcomes , 2006 .

[11]  Jing Luan,et al.  Practicing Data Mining for Enrollment Management and Beyond. , 2006 .

[12]  H. K. D. H. Bhadeshia,et al.  Neural Networks in Materials Science , 1999 .

[13]  Patrick J. Kelly,et al.  As America Becomes More Diverse: The Impact of State Higher Education Inequality , 2005 .

[14]  Chong Ho Yu,et al.  A data visualization and data mining approach to response and non-response analysis in survey research , 2007 .