Predicting Fluid Intelligence in Adolescent Brain MRI Data: An Ensemble Approach

Decoding fluid intelligence from brain MRI data in adolescents is a highly challenging task. In this study, we took part in the ABCD Neurocognitive Prediction (NP) Challenge 2019, in which a large set of T1-weighted magnetic resonance imaging (MRI) data and pre-residualized fluid intelligence scores (corrected for brain volume, data collection site and sociodemographic variables) of children between 9–11 years were provided (\(N=3739\) for training, \(N=415\) for validation and \(N=4516\) for testing). We propose here the Caruana Ensemble Search method to choose best performing models over a large and diverse set of candidate models. These candidate models include convolutional neural networks (CNNs) applied to brain areas considered to be relevant in fluid intelligence (e.g. frontal and parietal areas) and high-performing standard machine learning methods (namely support vector regression, random forests, gradient boosting and XGBoost) applied to region-based scores including volume, mean intensity and count of gray matter voxels. To further create diversity and increase robustness, a wide set of hyperparameter configurations for each of the models was used. On the validation and the hold out test data, we obtained a mean squared error (MSE) of 71.15 and 93.68 respectively (rank 12 out of 24, MSE range 92.13–102.25). Among most selected models were XGBoost together with the three region-based scores, the other regression models together with volume or CNNs based on the middle frontal gyrus. We discuss these results in light of previous research findings on fluid intelligence.

[1]  R. Haier,et al.  The Parieto-Frontal Integration Theory (P-FIT) of intelligence: Converging neuroimaging evidence , 2007, Behavioral and Brain Sciences.

[2]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[3]  U. Goswami Analogical reasoning in children , 1993 .

[4]  Adolf Pfefferbaum,et al.  Altered Brain Developmental Trajectories in Adolescents After Initiating Drinking. , 2017, The American journal of psychiatry.

[5]  Gerald Schaefer,et al.  Cost-sensitive decision tree ensembles for effective imbalanced classification , 2014, Appl. Soft Comput..

[6]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[7]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[8]  Jerzy Stefanowski,et al.  Neighbourhood sampling in bagging for imbalanced data , 2015, Neurocomputing.

[9]  L. Gottfredson Why g matters: The complexity of everyday life , 1997 .

[10]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[11]  Anders Krogh,et al.  Learning with ensembles: How overfitting can be useful , 1995, NIPS.

[12]  O. Bertrand,et al.  Cross-Frequency Coupling in Parieto-Frontal Oscillatory Networks During Motor Imagery Revealed by Magnetoencephalography , 2009, Front. Neurosci..

[13]  Konstantinos Kamnitsas,et al.  Ensembles of Multiple Models and Architectures for Robust Brain Tumour Segmentation , 2017, BrainLes@MICCAI.

[14]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[15]  Bartosz Krawczyk,et al.  Learning from imbalanced data: open challenges and future directions , 2016, Progress in Artificial Intelligence.

[16]  Gaurav Pandey,et al.  A Comparative Analysis of Ensemble Classifiers: Case Studies in Genomics , 2013, 2013 IEEE 13th International Conference on Data Mining.

[17]  Jerry Slotkin,et al.  VIII. NIH Toolbox Cognition Battery (CB): composite scores of crystallized, fluid, and overall cognition. , 2013, Monographs of the Society for Research in Child Development.

[18]  Anders M. Dale,et al.  The Adolescent Brain Cognitive Development (ABCD) study: Imaging acquisition across 21 sites , 2018, Developmental Cognitive Neuroscience.

[19]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[20]  Bart Baesens,et al.  Benchmarking Classification Models for Software Defect Prediction: A Proposed Framework and Novel Findings , 2008, IEEE Transactions on Software Engineering.

[21]  R. Cattell Intelligence : its structure, growth and action , 1987 .

[22]  Ulrike Basten,et al.  Where smart brains are different: A quantitative meta-analysis of functional and structural brain imaging studies on intelligence , 2015 .

[23]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[25]  Alan C. Evans,et al.  Intellectual ability and cortical development in children and adolescents , 2006, Nature.

[26]  Emilio Ferrer,et al.  Fluid Reasoning and the Developing Brain , 2009, Frontiers in neuroscience.

[27]  Yalin Wang,et al.  Hippocampal Structure and Human Cognition: Key Role Of , 2022 .

[28]  Susanne M. Jaeggi,et al.  Improving fluid intelligence with training on working memory: a meta-analysis , 2008, Psychonomic Bulletin & Review.

[29]  Francisco Herrera,et al.  A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[30]  Arthur W. Toga,et al.  Construction of a 3D probabilistic atlas of human cortical structures , 2008, NeuroImage.

[31]  Rich Caruana,et al.  Getting the Most Out of Ensemble Selection , 2006, Sixth International Conference on Data Mining (ICDM'06).