Applied machine learning to identify differential risk groups underlying externalizing and internalizing problem behaviors trajectories: A case study using a cohort of Asian American children

Background Internalizing and externalizing problems account for over 75% of the mental health burden in children and adolescents in the US, with higher burden among minority children. While complex interactions of multilevel factors are associated with these outcomes and may enable early identification of children in higher risk, prior research has been limited by data and application of traditional analysis methods. In this case example focused on Asian American children, we address the gap by applying data-driven statistical and machine learning methods to study clusters of mental health trajectories among children, investigate optimal predictions of children at high-risk cluster, and identify key early predictors. Methods Data from the US Early Childhood Longitudinal Study 2010–2011 were used. Multilevel information provided by children, families, teachers, schools, and care-providers were considered as predictors. Unsupervised machine learning algorithm was applied to identify groups of internalizing and externalizing problems trajectories. For prediction of high-risk group, ensemble algorithm, Superlearner, was implemented by combining several supervised machine learning algorithms. Performance of Superlearner and candidate algorithms, including logistic regression, was assessed using discrimination and calibration metrics via crossvalidation. Variable importance measures along with partial dependence plots were utilized to rank and visualize key predictors. Findings We found two clusters suggesting high- and low-risk groups for both externalizing and internalizing problems trajectories. While Superlearner had overall best discrimination performance, logistic regression had comparable performance for externalizing problems but worse for internalizing problems. Predictions from logistic regression were not well calibrated compared to those from Superlearner, however they were still better than few candidate algorithms. Important predictors identified were combination of test scores, child factors, teacher rated scores, and contextual factors, which showed non-linear associations with predicted probabilities. Conclusions We demonstrated the application of data-driven analytical approach to predict mental health outcomes among Asian American children. Findings from the cluster analysis can inform critical age for early intervention, while prediction analysis has potential to inform intervention programing prioritization decisions. However, to better understand external validity, replicability, and value of machine learning in broader mental health research, more studies applying similar analytical approach is needed.

[1]  G. Perrault Bureau , 2021, La boussole du confiné.

[2]  S. Woltering,et al.  Trajectories of pure and co-occurring internalizing and externalizing problems from early childhood to adolescence: Associations with early childhood individual and contextual antecedents. , 2020, Developmental psychology.

[3]  Steven W. Evans,et al.  Community-Based Prevalence of Externalizing and Internalizing Disorders among School-Aged Children and Adolescents in Four Geographically Dispersed School Districts in the United States , 2020, Child psychiatry and human development.

[4]  P. Lichtenstein,et al.  Predicting mental health problems in adolescence using machine learning techniques , 2020, PloS one.

[5]  Vichet Chhuon,et al.  Making Visible the Invisible: Multistudy Investigation of Disproportionate Special Education Identification of U.S. Asian American and Pacific Islander Students , 2020 .

[6]  J. Alvidrez,et al.  The National Institute on Minority Health and Health Disparities Research Framework , 2019, American journal of public health.

[7]  M. Bornstein,et al.  Parenting, culture, and the development of externalizing behaviors from age 7 to 14 in nine countries , 2018, Development and Psychopathology.

[8]  Xiaxi Li,et al.  Psychological assessment of children and adolescents with obesity , 2017, The Journal of international medical research.

[9]  R. G. Barajas-Gonzalez,et al.  Cultural Adaptation, Parenting and Child Mental Health Among English Speaking Asian American Immigrant Families , 2017, Child psychiatry and human development.

[10]  Cassandra S Diep,et al.  Acculturation and weight change in Asian-American children: Evidence from the ECLS-K:2011. , 2017, Preventive medicine.

[11]  Ivonne P. M. Derks,et al.  The association between overweight and internalizing and externalizing behavior in early childhood. , 2016, Social science & medicine.

[12]  Michelle Najarian,et al.  Early Childhood Longitudinal Study, Kindergarten Class of 2010-11 (ECLS-K:2011). User's Manual for the ECLS-K:2011 Kindergarten Data File and Electronic Codebook, Public Version. NCES 2015-074. , 2015 .

[13]  B. Ferguson,et al.  Health for the world's adolescents: a second chance in the second decade. , 2015, The Journal of adolescent health : official publication of the Society for Adolescent Medicine.

[14]  E J Costello,et al.  A heavy burden on young minds: the global burden of mental and substance use disorders in children and youth , 2014, Psychological Medicine.

[15]  Sabrina Cheng,et al.  Applying Public Health Frameworks to Advance the Promotion of Mental Health Among Asian American Children. , 2014, Asian American journal of psychology.

[16]  Bernadette A. Thomas,et al.  Disability-adjusted life years (DALYs) for 291 diseases and injuries in 21 regions, 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010 , 2012, The Lancet.

[17]  Sabrina Cheng,et al.  Symptoms of Anxiety and Associated Risk and Protective Factors in Young Asian American Children , 2012, Child psychiatry and human development.

[18]  Sabrina Cheng,et al.  Physical and mental health disparities among young children of Asian immigrants. , 2012, The Journal of pediatrics.

[19]  T. Mirzoev,et al.  Mental health policy process: a comparative study of Ghana, South Africa, Uganda and Zambia , 2010, International journal of mental health systems.

[20]  Mark J. van der Laan,et al.  Super Learner In Prediction , 2010 .

[21]  G. Farkas,et al.  Kindergarten Predictors of Recurring Externalizing and Internalizing Psychopathology in the Third and Fifth Grades , 2009 .

[22]  Ronald C Kessler,et al.  National comorbidity survey replication adolescent supplement (NCS-A): III. Concordance of DSM-IV/CIDI diagnoses with clinical reassessments. , 2009, Journal of the American Academy of Child and Adolescent Psychiatry.

[23]  M. Stoolmiller,et al.  The Nature and Predictors of Undercontrolled and Internalizing Problem Trajectories Across Early Childhood , 2009, Journal of abnormal child psychology.

[24]  P. Bühlmann,et al.  The group lasso for logistic regression , 2008 .

[25]  R. Engels,et al.  Body mass index and body weight perception as risk factors for internalizing and externalizing problem behavior among adolescents. , 2006, The Journal of adolescent health : official publication of the Society for Adolescent Medicine.

[26]  M. Deković,et al.  Relations between parenting and externalizing and internalizing problem behaviour in early adolescence: child behaviour as moderator and predictor. , 2006, Journal of adolescence.

[27]  A. Sourander,et al.  Childhood predictors of externalizing and internalizing problems in adolescence , 2005, European Child & Adolescent Psychiatry.

[28]  Cécile Proust-Lima,et al.  Estimation of linear mixed models with a mixture of distribution for the random effects , 2005, Comput. Methods Programs Biomed..

[29]  Elvira Germino Hausken,et al.  Early Childhood Longitudinal Study, Kindergarten Class of 1998-99 (ECLS-K) Psychometric Report for the Third Grade , 2005 .

[30]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[31]  Jerome H Friedman,et al.  Multiple additive regression trees with application in epidemiology , 2003, Statistics in medicine.

[32]  H. Koot,et al.  The normative development of child and adolescent problem behavior. , 2003, Journal of abnormal psychology.

[33]  D. Fergusson,et al.  Life course outcomes of young people with anxiety disorders in adolescence. , 2001, Journal of the American Academy of Child and Adolescent Psychiatry.

[34]  Si Wu,et al.  Improving support vector machine classifiers by modifying kernel functions , 1999, Neural Networks.

[35]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[36]  R. Dahl,et al.  Unipolar depression in adolescents: clinical outcome in adulthood. , 1995, Journal of the American Academy of Child and Adolescent Psychiatry.

[37]  S. Hinshaw,et al.  Internalizing and Externalizing Behavior Problems in Preschool Children: Correspondence Among Parent and Teacher Ratings and Behavior Observations , 1992 .

[38]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.