An Improved Random Forest Algorithm for Predicting Employee Turnover

Employee turnover is considered a major problem for many organizations and enterprises. The problem is critical because it affects not only the sustainability of work but also the continuity of enterprise planning and culture. Therefore, human resource departments are paying greater attention to employee turnover seeking to improve their understanding of the underlying reasons and main factors. To address this need, this study aims to enhance the ability to forecast employee turnover and introduce a new method based on an improved random forest algorithm. The proposed weighted quadratic random forest algorithm is applied to employee turnover data with high-dimensional unbalanced characteristics. First, the random forest algorithm is used to order feature importance and reduce dimensions. Second, the selected features are used with the random forest algorithm and the F-measure values are calculated for each decision tree as weights to build the prediction model for employee turnover. In the area of employee turnover forecasting, compared with the random forest, C4.5, Logistic, BP, and other algorithms, the proposed algorithm shows significant improvement in terms of various performance indicators, specifically recall and F-measure. In the experiment using the employee dataset of a branch of a communications company in China, the key factors influencing employee turnover were identified as monthly income, overtime, age, distance from home, years at the company, and percent of salary increase. Among them, monthly income and overtime were the two most important factors. The study offers a new analytic method that can help human resource departments predict employee turnover more accurately and its experimental results provide further insights to reduce employee turnover intention.

[1]  L. Labrague,et al.  Factors influencing turnover intention among registered nurses in Samar Philippines. , 2018, Applied nursing research : ANR.

[2]  Xiaohui Peng,et al.  A novel random forests based class incremental learning method for activity recognition , 2018, Pattern Recognit..

[3]  Serafín Moral,et al.  Increasing diversity in random forest learning algorithm via imprecise probabilities , 2018, Expert Syst. Appl..

[4]  Ronei Jesus Poppi,et al.  Visible and near infrared spectroscopy coupled to random forest to quantify some soil quality parameters. , 2018, Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy.

[5]  George A. Barnett,et al.  Predicting Employee Turnover from Friendship Networks , 2008 .

[6]  Hyo Sun Jung,et al.  Error management culture and turnover intent among food and beverage employees in deluxe hotels: the mediating effect of job satisfaction , 2017 .

[7]  Jing Xia,et al.  Class Weights Random Forest Algorithm for Processing Class Imbalanced Medical Data , 2018, IEEE Access.

[8]  Frank McMahon,et al.  Labour turnover in London hotels and the cost effectiveness of preventative measures , 1992 .

[9]  Daryl R. Smith,et al.  The Relative Strength of Job Attitudes and Job Embeddedness in Predicting Turnover in a U.S. Military Academy , 2014 .

[10]  Steven L. Salzberg,et al.  Book Review: C4.5: Programs for Machine Learning by J. Ross Quinlan. Morgan Kaufmann Publishers, Inc., 1993 , 1994, Machine Learning.

[11]  Hadi Tabatabaee Malazi,et al.  Combining emerging patterns with random forest for complex activity recognition in smart homes , 2018, Applied Intelligence.

[12]  Xing Wu,et al.  Panoramic Crack Detection for Steel Beam Based on Structured Random Forests , 2018, IEEE Access.

[13]  Pozi Milow,et al.  Random forest and Self Organizing Maps application for analysis of pediatric fracture healing time of the lower limb , 2018, Neurocomputing.

[14]  R. Anitha,et al.  Development of computer‐aided approach for brain tumor detection using random forest classifier , 2018, Int. J. Imaging Syst. Technol..

[15]  Jean-Michel Poggi,et al.  Random Forests for Big Data , 2015, Big Data Res..

[16]  Talel Abdessalem,et al.  Adaptive random forests for evolving data stream classification , 2017, Machine Learning.

[17]  Andrea Fronzetti Colladon,et al.  Forecasting managerial turnover through e-mail based social network analysis , 2017, Comput. Hum. Behav..

[18]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.