Feature Learning Using Stacked Autoencoder for Shared and Multimodal Fusion of Medical Images

In recent years, deep learning has become a powerful tool for medical image analysis mainly because of their ability to automatically extract more abstract features from large training data. The current methods used for multiple modalities are mostly conventional machine learning, in which people use the handcrafted feature, which is very difficult to construct for large training sizes. Deep learning which is an advancement in the machine learning automatically extracts relevant features from the data. In this paper, we have used deep learning model for the multimodal data. The basic building blocks of the network are stacked autoencoder for the multiple modalities. The performance of deep learning-based models with and without multimodal fusion and shared learning are compared. The results indicates that the use of multimodal fusion and shared learning help to improve deep learning-based medical image analysis.

[1]  Daoqiang Zhang,et al.  Multi-modal multi-task learning for joint prediction of multiple regression and classification variables in Alzheimer's disease , 2012, NeuroImage.

[2]  Rahul Kumar Sevakula,et al.  Assessing Generalization Ability of Majority Vote Point Classifiers , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[3]  A. Madabhushi,et al.  Histopathological Image Analysis: A Review , 2009, IEEE Reviews in Biomedical Engineering.

[4]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[5]  Nishchal K. Verma,et al.  Comparative analysis of Gaussian mixture model, logistic regression and random forest for big data classification using map reduce , 2016, 2016 11th International Conference on Industrial and Information Systems (ICIIS).

[6]  Wen-Huang Cheng,et al.  Computer-aided classification of lung nodules on computed tomography images via deep learning technique , 2015, OncoTargets and therapy.

[7]  M. Jorge Cardoso,et al.  Accurate multimodal probabilistic prediction of conversion to Alzheimer's disease in patients with mild cognitive impairment☆ , 2013, NeuroImage: Clinical.

[8]  Ronald M. Summers,et al.  DeepOrgan: Multi-level Deep Convolutional Networks for Automated Pancreas Segmentation , 2015, MICCAI.

[9]  Daniel Rueckert,et al.  Random forest-based similarity measures for multi-modal classification of Alzheimer's disease , 2013, NeuroImage.

[10]  R. Edelman,et al.  Magnetic resonance imaging (2) , 1993, The New England journal of medicine.

[11]  Jiang Hsieh,et al.  Computed Tomography: Principles, Design, Artifacts, and Recent Advances, Fourth Edition , 2022 .

[12]  D. Goldfarb A family of variable-metric methods derived by variational means , 1970 .

[13]  J. M. Ollinger,et al.  Positron Emission Tomography , 2018, Handbook of Small Animal Imaging.

[14]  Jianzhong Wu,et al.  Stacked Sparse Autoencoder (SSAE) for Nuclei Detection on Breast Cancer Histopathology Images , 2016, IEEE Transactions on Medical Imaging.

[15]  Yan Cui,et al.  Transfer Learning for Molecular Cancer Classification Using Deep Neural Networks , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[16]  Daoqiang Zhang,et al.  Semi-Supervised Multimodal Relevance Vector Regression Improves Cognitive Performance Estimation from Imaging and Biological Biomarkers , 2013, Neuroinformatics.

[17]  Nitish Srivastava,et al.  Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..

[18]  E. Pisano,et al.  Diffraction enhanced x-ray imaging. , 1997, Physics in medicine and biology.

[19]  Jiajun Zhang,et al.  Deep Neural Networks in Machine Translation: An Overview , 2015, IEEE Intelligent Systems.

[20]  Yan Cui,et al.  Layerwise feature selection in Stacked Sparse Auto-Encoder for tumor type prediction , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[21]  Seong-Whan Lee,et al.  Hierarchical feature representation and multimodal fusion with deep learning for AD/MCI diagnosis , 2014, NeuroImage.

[22]  D. Tank,et al.  Brain magnetic resonance imaging with contrast dependent on blood oxygenation. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[23]  C. G. Broyden The Convergence of a Class of Double-rank Minimization Algorithms 1. General Considerations , 1970 .

[24]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[25]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[26]  R. Fletcher,et al.  A New Approach to Variable Metric Algorithms , 1970, Comput. J..

[27]  Daoqiang Zhang,et al.  Multimodal classification of Alzheimer's disease and mild cognitive impairment , 2011, NeuroImage.

[28]  Dumitru Erhan,et al.  Deep Neural Networks for Object Detection , 2013, NIPS.

[29]  D. Shanno Conditioning of Quasi-Newton Methods for Function Minimization , 1970 .

[30]  Vince D. Calhoun,et al.  Deep learning for neuroimaging: a validation study , 2013, Front. Neurosci..

[31]  Tom Heskes,et al.  Neural Decoding with Hierarchical Generative Models , 2010, Neural Computation.

[32]  Degui Xiao,et al.  Medical Image Retrieval: A Multimodal Approach , 2014, Cancer informatics.

[33]  Andrew G. Howard,et al.  Some Improvements on Deep Convolutional Neural Network Based Image Classification , 2013, ICLR.

[34]  Luca Maria Gambardella,et al.  Deep Neural Networks Segment Neuronal Membranes in Electron Microscopy Images , 2012, NIPS.

[35]  Vikas Singh,et al.  Deep Learning Architecture for High-Level Feature Generation Using Stacked Auto Encoder for Business Intelligence , 2018 .