Supporting diagnosis of attention-deficit hyperactive disorder with novelty detection

OBJECTIVE Computerized continuous performance test (CPT) is a widely used diagnostic tool for attention-deficit hyperactivity disorder (ADHD). It measures the number of correctly detected stimuli as well as response times. Typically, when calculating a cut-off score for discriminating between normal and abnormal, only the normal children's data are collected. Then the average and standard deviation of each measure or variable is computed. If any of variables is larger than 2 sigma above the average, that child is diagnosed as abnormal. We will call this approach as "T-score 70" classifier. However, its performance has a lot to be desired due to a high false negative error. In order to improve the classification accuracy we propose to use novelty detection approaches for supporting ADHD diagnosis. METHODS AND MATERIAL Novelty detection is a model building framework where a classifier is constructed using only one class of training data and a new input pattern is classified according to its similarity to the training data. A total of eight novelty detectors are introduced and applied to our ADHD datasets collected from two modes of tests, visual and auditory. They are evaluated and compared with the T-score model on validation datasets in terms of false positive and negative error rates, and area under receiver operating characteristics curve (AuROC). RESULTS Experimental results show that the cut-off score of 70 is suboptimal which leads to a low false positive error but a very high false negative error. A few novelty detectors such as Parzen density estimators yield much more balanced classification performances. Moreover, most novelty detectors outperform the T-score method for most age groups statistically with a significance level of 1% in terms of AuROC. In particular, we recommend the Parzen and Gaussian density estimators, kernel principal component analysis, one-class support vector machine, and K-means clustering novelty detector which can improve upon the T-score method on average by at least 30% for the visual test and 40% for the auditory test. In addition, their performances are relatively stable over various parameter values as long as they are within reasonable ranges. CONCLUSION The proposed novelty detection approaches can replace the T-score method which has been considered the "gold standard" for supporting ADHD diagnosis. Furthermore, they can be applied to other psychological tests where only normal data are available.

[1]  Jeffrey M. Halperin,et al.  Assessment of the Continuous Performance Test: Reliability and validity in a nonreferred sample. , 1991 .

[2]  Christopher M. Bishop,et al.  Novelty detection and neural network validation , 1994 .

[3]  J. Biederman,et al.  Attention-deficit/hyperactivity disorder: a life-span perspective. , 1998, The Journal of clinical psychiatry.

[4]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[5]  M S Buchsbaum,et al.  Effects of amphetamine on vigilance performance in normal and hyperactive children , 1980, Journal of abnormal child psychology.

[6]  Eric Taylor,et al.  Child and adolescent psychiatry : modern approaches , 1985 .

[7]  M H Boyle,et al.  Correlates, associated impairments and patterns of service utilization of children with attention deficit disorder: findings from the Ontario Child Health Study. , 1989, Journal of child psychology and psychiatry, and allied disciplines.

[8]  D. Cantwell,et al.  Comorbidity in ADHD: implications for research, practice, and DSM-V. , 1997, Journal of the American Academy of Child and Adolescent Psychiatry.

[9]  Heiko Hoffmann,et al.  Kernel PCA for novelty detection , 2007, Pattern Recognit..

[10]  Vic Barnett,et al.  Outliers in Statistical Data , 1980 .

[11]  Sung-Zoon Cho,et al.  A STUDY OF THE DEVELOPMENT AND STANDARDIZATION OF ADHD DIAGNOSTIC SYSTEM , 2000 .

[12]  Joseph Biederman,et al.  Intellectual performance and school failure in children with attention deficit hyperactivity disorder and in their siblings. , 1993, Journal of abnormal psychology.

[13]  Paolo Frasconi,et al.  Learning in multilayered networks used as autoassociators , 1995, IEEE Trans. Neural Networks.

[14]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[15]  Michael Brady,et al.  Novelty detection for the identification of masses in mammograms , 1995 .

[16]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[17]  A. Anastasi Psychological testing, 6th ed. , 1988 .

[18]  T. Achenbach,et al.  "Comorbidity" in Child and Adolescent Psychiatry: Categorical and Quantitative Perspectives , 1990 .

[19]  I. Jolliffe Principal Component Analysis , 2002 .

[20]  Gunter Ritter,et al.  Outliers in statistical pattern recognition and an application to automatic chromosome classification , 1997, Pattern Recognit. Lett..

[21]  Sameer Singh,et al.  Novelty detection: a review - part 1: statistical approaches , 2003, Signal Process..

[22]  Jong-Heun Kim,et al.  DIAGNOSTIC CLASSIFICATION AND ASSESSMENT OF PSYCHIATRICALLY REFERRED CHILDREN WITH INATTENTION OR HYPERACTIVITY , 1996 .

[23]  A. C. Israel,et al.  Behavior Disorders of Childhood , 1984 .

[24]  Sameer Singh,et al.  Novelty detection: a review - part 2: : neural network based approaches , 2003, Signal Process..

[25]  N J Pizzi,et al.  EvIdent(TM): a functional magnetic resonance image analysis system , 2001, Artif. Intell. Medicine.

[26]  Caroline Sabin,et al.  Medical Statistics at a Glance , 2000 .

[27]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[28]  Robert P. W. Duin,et al.  Support Vector Data Description , 2004, Machine Learning.

[29]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[30]  H. Bourlard,et al.  Auto-association by multilayer perceptrons and singular value decomposition , 1988, Biological Cybernetics.

[31]  M. Kramer Nonlinear principal component analysis using autoassociative neural networks , 1991 .

[32]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[33]  David M. J. Tax,et al.  One-class classification , 2001 .

[34]  Sungzoon Cho,et al.  Application of LVQ to novelty detection using outlier training data , 2006, Pattern Recognit. Lett..

[35]  J. Biederman,et al.  Comorbidity of attention deficit hyperactivity disorder with conduct, depressive, anxiety, and other disorders. , 1991, The American journal of psychiatry.

[36]  H E ROSVOLD,et al.  A continuous performance test of brain damage. , 1956, Journal of consulting psychology.