Prediction models for high risk of suicide in Korean adolescents using machine learning techniques

Objective Suicide in adolescents is a major problem worldwide and previous history of suicide ideation and attempt represents the strongest predictors of future suicidal behavior. The aim of this study was to develop prediction model to identify Korean adolescents of high risk suicide (= who have history of suicide ideation/attempt in previous year) using machine learning techniques. Methods A nationally representative dataset of Korea Youth Risk Behavior Web-based Survey (KYRBWS) was used (n = 59,984 of middle and high school students in 2017). The classification process was performed using machine learning techniques such as logistic regression (LR), random forest (RF), support vector machine (SVM), artificial neural network (ANN), and extreme gradient boosting (XGB). Results A total of 7,443 adolescents (12.4%) had a previous history of suicidal ideation/attempt. In the multivariable analysis, sadness (odds ratio [OR], 6.41; 95% confidence interval [95% CI], 6.08–6.87), violence (OR, 2.32; 95% CI, 2.01–2.67), substance use (OR, 1.93; 95% CI, 1.52–2.45), and stress (OR, 1.63; 95% CI, 1.40–1.86) were associated factors. Taking into account 26 variables as predictors, the accuracy of models of machine learning techniques to predict the high-risk suicidal was comparable with that of LR; the accuracy was best in XGB (79.0%), followed by SVM (78.7%), LR (77.9%), RF (77.8%), and ANN (77.5%). Conclusions The machine leaning techniques showed comparable performance with LR to classify adolescents who have previous history of suicidal ideation/attempt. This model will hopefully serve as a foundation for decreasing future suicides as it enables early identification of adolescents at risk of suicide and modification of risk factors.

[1]  Soo Beom Choi,et al.  Ten-year prediction of suicide death using Cox regression and machine learning in a nationwide retrospective cohort study in South Korea. , 2018, Journal of affective disorders.

[2]  Ross Jacobucci,et al.  The use of machine learning in the study of suicidal and non-suicidal self-injurious thoughts and behaviors: A systematic review. , 2019, Journal of affective disorders.

[3]  Evan M. Kleiman,et al.  Risk Factors for Suicidal Thoughts and Behaviors: A Meta-Analysis of 50 Years of Research , 2017, Psychological bulletin.

[4]  Soon-Woo Park,et al.  Test-retest reliability of a questionnaire for the Korea Youth Risk Behavior Web-based Survey. , 2010, Journal of preventive medicine and public health = Yebang Uihakhoe chi.

[5]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[6]  Soo Beom Choi,et al.  Risk factors of suicide attempt among people with suicidal ideation in South Korea: a cross-sectional study , 2017, BMC Public Health.

[7]  Gilbert Ritschard,et al.  Contemporary Issues in Exploratory Data Mining in the Behavioral Sciences , 2013 .

[8]  B. Mwangi,et al.  Identifying a clinical signature of suicidality among patients with mood disorders: A pilot study using a machine learning approach. , 2016, Journal of affective disorders.

[9]  S. Jee,et al.  Relationship between binge drinking experience and suicide attempts in Korean adolescents: based on the 2013 Korean Youth Risk Behavior Web-based Survey , 2018, Epidemiology and health.

[10]  K. Yun,et al.  Classification of Suicide Attempts through a Machine Learning Algorithm Based on Multiple Systemic Psychiatric Scales , 2017, Front. Psychiatry.

[11]  S. Moon,et al.  Risk Factors and Mediators of Suicidal Ideation Among Korean Adolescents , 2018, Crisis.

[12]  B. Löwe,et al.  Predicting suicidal ideation in primary care: An approach to identify easily assessable key variables. , 2018, General hospital psychiatry.

[13]  Kevin J. Grimm,et al.  A Simulation Study of the Ability of Growth Mixture Models to Uncover Growth Heterogeneity , 2013 .