A Comprehensive Database Based on Multiple Data Sources to Facilitate Diagnosis of ASD

Autism spectrum disorder (ASD) is a neurodevelopmental disorder which has an increasing prevalence in children. ASD is clinically highly heterogeneous and lacks objective diagnostic criteria. In recent years, magnetic resonance imaging and genomics have been widely used in the diagnosis of ASD, and some valuable biomarkers have been found, which has improved people’s understanding of the neural and molecular development mechanism of ASD. However, most studies focus on limited data sources with lack of integration research, thus leading to inconsistent or biased results. In this paper, we design a compute-aided diagnosis framework based on multiple ASD-relevant data sources for purpose of distinguishing the ASD patients more accurately. We first establish a multiple data collection procedure from initial diagnosis to regular follow-up visits. Various medical big data including structured and unstructured forms are collected from different devices or protocols and then they are deposited and accessed based on Hadoop platform. Furthermore, we design a classification framework to identify ASD patients by integrating the complementary information from multiple data sources. Deep learning is used to extract features from each data source automatically, and then all extracted features are integrated by the multiple kernel learning method for improving the diagnostic accuracy of ASD.

[1]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[2]  Abraham Z. Snyder,et al.  Opposing Brain Differences in 16p11.2 Deletion and Duplication Carriers , 2014, The Journal of Neuroscience.

[3]  N. Minshew,et al.  Evidence for dysregulation of axonal growth and guidance in the etiology of ASD , 2013, Front. Hum. Neurosci..

[4]  T. Insel,et al.  Wesleyan University From the SelectedWorks of Charles A . Sanislow , Ph . D . 2010 Research Domain Criteria ( RDoC ) : Toward a New Classification Framework for Research on Mental Disorders , 2018 .

[5]  Siyu Han,et al.  Classifying Breast Cancer Subtypes Using Multiple Kernel Learning Based on Omics Data , 2019, Genes.

[6]  Yan Guo,et al.  Architectures and accuracy of artificial neural network for disease classification from omics data , 2019, BMC Genomics.

[7]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.

[8]  Daniel P. Kennedy,et al.  The Autism Brain Imaging Data Exchange: Towards Large-Scale Evaluation of the Intrinsic Brain Architecture in Autism , 2013, Molecular Psychiatry.

[9]  G. Baird,et al.  Autism spectrum disorders at 20 and 42 months of age: stability of clinical and ADI-R diagnosis. , 1999, Journal of child psychology and psychiatry, and allied disciplines.

[10]  M. Just,et al.  Cortical activation and synchronization during sentence comprehension in high-functioning autism: evidence of underconnectivity. , 2004, Brain : a journal of neurology.

[11]  Mehmet Gönen,et al.  Discriminating early- and late-stage cancers using multiple kernel learning on gene sets , 2018, Bioinform..

[12]  G. Baird,et al.  Psychiatric disorders in children with autism spectrum disorders: prevalence, comorbidity, and associated factors in a population-derived sample. , 2008, Journal of the American Academy of Child and Adolescent Psychiatry.

[13]  R. Hu Diagnostic and Statistical Manual of Mental Disorders (DSM-IV) , 2003 .

[14]  Paul M. Thompson,et al.  Autism-Associated Promoter Variant in MET Impacts Functional and Structural Brain Networks , 2012, Neuron.

[15]  Wendy J. Ungar,et al.  National Database for Autism Research (NDAR): Big Data Opportunities for Health Services Research and Health Technology Assessment , 2016, PharmacoEconomics.

[16]  C. Lord,et al.  Standardizing ADOS Scores for a Measure of Severity in Autism Spectrum Disorders , 2009, Journal of autism and developmental disorders.

[17]  D. Pinto,et al.  The Autism Simplex Collection: an international, expertly phenotyped autism sample for genetic and phenotypic analyses , 2014, Molecular Autism.

[18]  Paul M. Thompson,et al.  Altered Structural Brain Connectivity in Healthy Carriers of the Autism Risk Gene, CNTNAP2 , 2011, Brain Connect..

[19]  Francis R. Bach,et al.  Consistency of the group Lasso and multiple kernel learning , 2007, J. Mach. Learn. Res..

[20]  S. Spence,et al.  The autism genetic resource exchange: a resource for the study of autism and related neuropsychiatric conditions. , 2001, American journal of human genetics.