The AI Neuropsychologist: Automatic scoring of memory deficits with deep learning

Memory deficits are a hallmark of many different neurological and psychiatric conditions. The Rey-Osterrieth complex neuropsychologists across the globe to assess the degree of non-verbal visual memory deterioration. To obtain a score, a trained clinician inspects a patient’s ROCF drawing and quantifies deviations from the original figure. This manual procedure is time-consuming, slow and scores vary depending on the clinician’s experience, motivation and tiredness. Here, we leverage novel deep learning architectures to automatize the rating of memory deficits. For this, a multi-head convolutional neural network was trained on 20225 ROCF drawings. Unbiased ground truth ROCF scores were obtained from crowdsourced human intelligence. The neural network outperforms both online raters and clinicians. Our AI-powered scoring system provides healthcare institutions worldwide with a digital tool to assess objectively, reliably and time-efficiently the performance in the ROCF test from hand-drawn images. In this study, we developed an AI-based scoring system for a nonverbal visuo-spatial memory test that is being utilized in clinics around the world on a daily basis. For this, we trained two variations of deep learning systems and used a rich data set of 20’225 hand-drawn ROCF images covering the entire life span that was collected from six different research and clinical environments. By leveraging human crowdsourcing we obtained unbiased high-precision training labels. Our best model results from combining a multihead convolutional neural network for regression with a network for multilabel classification. Data and test time augmentation were used to improve the accuracy and made our model more robust against geometric transformations like rotations and changes in perspective.

[1]  Eric W.T. Ngai,et al.  Deep learning in computer vision: A critical review of emerging techniques and application scenarios , 2021 .

[2]  Supplemental Material for Validation of an Automated Scoring Program for a Digital Complex Figure Copy Task Within Healthy Aging and Stroke , 2021, Neuropsychology.

[3]  M. Rabuffetti,et al.  Automated scoring for a Tablet-based Rey Figure copy task differentiates constructional, organisational, and motor abilities , 2021, Scientific Reports.

[4]  D. Shen,et al.  A deep learning system for detecting diabetic retinopathy across the disease spectrum , 2021, Nature Communications.

[5]  Hassan Rivaz,et al.  Two-stage ultrasound image segmentation using U-Net and test time augmentation , 2020, International Journal of Computer Assisted Radiology and Surgery.

[6]  C. Gillebert,et al.  Non-Spatial Impairments Affect False-Positive Neglect Diagnosis Based on Cancellation Tasks , 2020, Journal of the International Neuropsychological Society.

[7]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[8]  Reka Hollandi,et al.  Test-time augmentation for deep learning-based cell segmentation on microscopy images , 2019, Scientific Reports.

[9]  R. Dotsch,et al.  Automated scoring of the Rey-Osterrieth Complex Figure Test using a deep-learning algorithm , 2019, Archives of Clinical Neuropsychology.

[10]  Timnit Gebru,et al.  Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification , 2018, FAT.

[11]  Khurram Khurshid,et al.  Classification of Graphomotor Impressions Using Convolutional Neural Networks: An Application to Automated Neuro-Psychological Screening Tests , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[12]  Suresh Venkatasubramanian,et al.  Runaway Feedback Loops in Predictive Policing , 2017, FAT.

[13]  P. Lakhani,et al.  Deep Learning at Chest Radiography: Automated Classification of Pulmonary Tuberculosis by Using Convolutional Neural Networks. , 2017, Radiology.

[14]  Marley W. Watkins,et al.  The reliability of multidimensional neuropsychological measures: from alpha to omega , 2017, The Clinical neuropsychologist.

[15]  Rossitza Setchi,et al.  Clock Drawing Test Digit Recognition Using Static and Dynamic Features , 2016, KES.

[16]  J. Rehm,et al.  The economic costs of mental disorders , 2016, EMBO reports.

[17]  L. Rabin,et al.  Stability in Test-Usage Practices of Clinical Neuropsychologists in the United States and Canada Over a 10-Year Period: A Follow-Up Survey of INS and NAN Members. , 2016, Archives of clinical neuropsychology : the official journal of the National Academy of Neuropsychologists.

[18]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19]  E. Hing,et al.  Trends in electronic health record system use among office-based physicians: United States, 2007-2012. , 2014, National health statistics reports.

[20]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[21]  Ellen Yi-Luen Do,et al.  Computational clock drawing analysis for cognitive impairment screening , 2010, TEI.

[22]  Jun Soo Kwon,et al.  Clinical and empirical applications of the Rey–Osterrieth Complex Figure Test , 2006, Nature Protocols.

[23]  J. Hodges,et al.  Mild cognitive impairment: applicability of research criteria in a memory clinic and characterization of cognitive profile , 2006, Psychological Medicine.

[24]  M. Gagnon,et al.  Explicit and Objective Scoring Criteria for the Taylor Complex Figure Test , 2004, Journal of clinical and experimental neuropsychology.

[25]  J. Becker,et al.  Normative Data for Determining Significance of Test–Retest Differences on Eight Common Neuropsychological Instruments , 2004, The Clinical neuropsychologist.

[26]  Manfred Spitzer,et al.  Neuropsychological Assessment in Clinical Practice: A Guide to Test Interpretation and Integration , 2002 .

[27]  R. Stern,et al.  The Boston Qualitative Scoring System as a Measure of Executive Functioning in Rey-Osterrieth Complex Figure Performance , 2000, Journal of clinical and experimental neuropsychology.

[28]  Andrew M. Tyrrell,et al.  Automated scoring of a neuropsychological test: the Rey Osterrieth complex figure , 2000, Proceedings of the 26th Euromicro Conference. EUROMICRO 2000. Informatics: Inventing the Future.

[29]  N. Denburg,et al.  Application of psychometric standards to scoring system evaluation: is "new" necessarily "improved"? , 1996, Journal of clinical and experimental neuropsychology.

[30]  J. Meyers,et al.  Rey complex figure: memory error patterns and functional abilities. , 1996, Applied neuropsychology.

[31]  L. Tupler,et al.  Reliability of the Rey-Osterrieth Complex Figure in use with memory-impaired patients. , 1995, Journal of clinical and experimental neuropsychology.

[32]  G. Wells,et al.  Clock Drawing: A Neuropsychological Analysis , 1995 .

[33]  F. Schmitt,et al.  Rey-Osterrieth complex figure: Psychometric characteristics in a geriatric sample , 1991 .

[34]  M. Franzen Reliability and Validity in Neuropsychological Assessment , 1989, Critical Issues in Neuropsychology.

[35]  Marianne Regard,et al.  Children's Production on Verbal and Non-Verbal Fluency Tasks , 1982, Perceptual and motor skills.

[36]  Lipo Wang,et al.  Deep Learning Applications in Medical Image Analysis , 2018, IEEE Access.

[37]  L. Trojano,et al.  Drawing Disorders in Alzheimer's Disease and Other Forms of Dementia. , 2016, Journal of Alzheimer's disease : JAD.

[38]  L. Rabin,et al.  Assessment practices of clinical neuropsychologists in the United States and Canada: a survey of INS, NAN, and APA Division 40 members. , 2005, Archives of clinical neuropsychology : the official journal of the National Academy of Neuropsychologists.

[39]  G. Groth-Marnat Neuropsychological assessment in clinical practice: A guide to test interpretation and integration. , 2000 .

[40]  M. Franzen Validity as Applied to Neuropsychological Assessment , 2000 .

[41]  J. Meyers,et al.  Detection of malingerers using the Rey Complex Figure and Recognition Trial. , 1999, Applied neuropsychology.

[42]  P. Osterrieth,et al.  Le test de copie d'une figure complexe; contribution à l'étude de la perception et de la mémoire. , 1944 .