BHARAT: An Integrated Big Data Analytic Model for Early Diagnostic Biomarker of Alzheimer's Disease

Alzheimer's disease (AD) is a devastating neurodegenerative disorder affecting millions of people worldwide. Progressive and relentless efforts are being made for therapeutic development by way of advancing understanding of non-invasive imaging modalities for the causal molecular process of AD. We present a Hadoop-based big data framework integrating non-invasive magnetic resonance imaging (MRI), MR spectroscopy (MRS) as well as neuropsychological test outcomes to identify early diagnostic biomarkers of AD. This big data framework for AD incorporates the three “V”s (volume, variety, velocity) with advanced data mining, machine learning, and statistical modeling algorithms. A large volume of longitudinal information from non-invasive imaging modalities with colligated parametric variety and speed for both data acquisition and processing as velocity complete the fundamental requirements of this big data framework for early AD diagnosis. Brain structural, neurochemical, and behavioral features are extracted from MRI, MRS, and neuropsychological scores, respectively. Subsequently, feature selection and ensemble-based classification are proposed and their outputs are fused based on the combination rule for final accurate classification and validation from clinicians. A multi-modality-based decision framework (BHARAT) for classification of early AD will be immensely helpful.

[1]  K. Shadan,et al.  Available online: , 2012 .

[2]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[3]  Nicola Toschi,et al.  Identification of Mild Alzheimer's Disease through automated classification of structural MRI features , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[4]  H. Braak,et al.  Neuropathology of Alzheimer’s disease: what is new since A. Alzheimer? , 1999, European Archives of Psychiatry and Clinical Neuroscience.

[5]  R. Reitan Validity of the Trail Making Test as an Indicator of Organic Brain Damage , 1958 .

[6]  Abdul Ghaaliq Lalkhen,et al.  Clinical tests: sensitivity and specificity , 2008 .

[7]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[8]  Mahadev Konar,et al.  ZooKeeper: Wait-free Coordination for Internet-scale Systems , 2010, USENIX ATC.

[9]  K. Krishnan,et al.  Prediction of cognitive decline in early Alzheimer's disease , 1998, The Lancet.

[10]  S. H. Hojjati,et al.  Predicting conversion from MCI to AD using resting-state fMRI, graph theoretical approach and SVM , 2017, Journal of Neuroscience Methods.

[11]  T. Kurosaki,et al.  Measurement of functional activities in older adults in the community. , 1982, Journal of gerontology.

[12]  James G. Shanahan,et al.  Large Scale Distributed Data Science using Apache Spark , 2015, KDD.

[13]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[14]  V. Leirer,et al.  Development and validation of a geriatric depression screening scale: a preliminary report. , 1982, Journal of psychiatric research.

[15]  Jongin Kim,et al.  Identification of Alzheimer's disease and mild cognitive impairment using multimodal sparse hierarchical extreme learning machine , 2018, Human brain mapping.

[16]  Carlo Curino,et al.  Apache Hadoop YARN: yet another resource negotiator , 2013, SoCC.

[17]  R. Katzman.,et al.  Pathological verification of ischemic score in differentiation of dementias , 1980, Annals of neurology.

[18]  P. Vemuri,et al.  Assessing subtle structural changes in Alzheimer's disease patients. , 2011, Methods in molecular biology.

[19]  P. Gurevich,et al.  Neuropsychological Testing and Machine Learning Distinguish Alzheimer’s Disease from Other Causes for Cognitive Impairment , 2017, Front. Aging Neurosci..

[20]  Lars George,et al.  HBase - The Definitive Guide: Random Access to Your Planet-Size Data , 2011 .

[21]  P. Mandal,et al.  Brain Metabolic, Structural, and Behavioral Pattern Learning for Early Predictive Diagnosis of Alzheimer's Disease. , 2018, Journal of Alzheimer's disease : JAD.

[22]  Andreas Neumann,et al.  Oozie: towards a scalable workflow management system for Hadoop , 2012, SWEET '12.

[23]  M. Esiri,et al.  Alzheimer's disease Correlation of cortical choline acetyltransferase activity with the severity of dementia and histological abnormalities , 1982, Journal of the Neurological Sciences.

[24]  Ricardo Nitrini,et al.  NeuroImage: Clinical , 2022 .

[25]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[26]  Paulo J. G. Lisboa,et al.  From raw data to data-analysis for magnetic resonance spectroscopy – the missing link: jMRUI2XML , 2015, BMC Bioinformatics.

[27]  Abbas Babajani-Feremi,et al.  Classification of patients with MCI and AD from healthy controls using directed graph measures of resting-state fMRI , 2017, Behavioural Brain Research.

[28]  Manjari Tripathi,et al.  Brain oxidative stress: detection and mapping of anti-oxidant marker 'Glutathione' in different brain regions of healthy male/female, MCI and Alzheimer patients using non-invasive magnetic resonance spectroscopy. , 2012, Biochemical and biophysical research communications.

[29]  Manjari Tripathi,et al.  Brain Glutathione Levels – A Novel Biomarker for Mild Cognitive Impairment and Alzheimer’s Disease , 2015, Biological Psychiatry.

[30]  Ameet Talwalkar,et al.  MLlib: Machine Learning in Apache Spark , 2015, J. Mach. Learn. Res..

[31]  A. Lobo,et al.  [Cognocitive mini-test (a simple practical test to detect intellectual changes in medical patients)]. , 1979, Actas luso-espanolas de neurologia, psiquiatria y ciencias afines.

[32]  Marco Eichelberg,et al.  Ten years of medical imaging standardization and prototypical implementation: the DICOM standard and the OFFIS DICOM toolkit (DCMTK) , 2004, SPIE Medical Imaging.

[33]  Reynold Xin,et al.  Apache Spark , 2016 .

[34]  R. Castellani,et al.  Alzheimer disease. , 2010, Disease-a-month : DM.

[35]  David L Streiner,et al.  What's under the ROC? An Introduction to Receiver Operating Characteristics Curves , 2007, Canadian journal of psychiatry. Revue canadienne de psychiatrie.

[36]  Ignace Lemahieu,et al.  XMedCon: an open-source medical image conversion toolkit , 2003 .

[37]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[38]  Han Liu,et al.  Challenges of Big Data Analysis. , 2013, National science review.