A Unified Deep Learning Framework for Multi-Modal Multi-Dimensional Data

In data science applications, multi-modal and multidimensional nature of data leads to various types of manual features and tailored classifiers to specific problems. Traditional feature engineering demands expertise and the performance from integrated classifiers has reached a plateau after years of development. Emerging technologies such as deep learning make a great performance leap after transition from feature engineering to architecture engineering. However, putting this technology to practical use faces numerous challenges. First, it is known that the success of deep learning is built upon large amount of training data, but most data science problems do not have sufficient training data. In addition, designing the architecture of deep models needs substantial expertise. Moreover, existing deep neural networks have limitations on the type of inputs they can work on, and their output is limited in the type of problems they can solve. To address these challenges, we propose a unified deep learning framework that suits data in multiple modalities and dimensions, and prove that using the proposed framework yields better performance than traditional machine learning approaches. Besides the examples used for validation, the proposed framework demonstrates potential in becoming a foundation, upon which more wide-spread data science problems can benefit from using deep learning.

[1]  Joelle Pineau,et al.  Learning Robust Features using Deep Learning for Automatic Seizure Detection , 2016, MLHC.

[2]  Pushparaja Murugan,et al.  Learning The Sequential Temporal Information with Recurrent Neural Networks , 2018, ArXiv.

[3]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[4]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[5]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[7]  Rafik Goubran,et al.  Localizing 3-D Anatomical Landmarks Using Deep Convolutional Neural Networks , 2017, 2017 14th Conference on Computer and Robot Vision (CRV).

[8]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[9]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[10]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Daniel L Rubin,et al.  A curated mammography data set for use in computer-aided detection and diagnosis research , 2017, Scientific Data.

[13]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[15]  Rafik Goubran,et al.  Abnormality Detection in Mammography using Deep Convolutional Neural Networks , 2018, 2018 IEEE International Symposium on Medical Measurements and Applications (MeMeA).

[16]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Yi Zhou,et al.  Automatic seizure detection using three-dimensional CNN based on multi-channel EEG , 2018, BMC Medical Informatics and Decision Making.

[18]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[19]  Masoumeh Haghpanahi,et al.  Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network , 2019, Nature Medicine.

[20]  François Goulette,et al.  Classification of Point Cloud Scenes with Multiscale Voxel Deep Network , 2018, ArXiv.

[21]  Yan Liu,et al.  Recurrent Neural Networks for Multivariate Time Series with Missing Values , 2016, Scientific Reports.

[22]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[23]  Kathleen M. Robinette,et al.  The CAESAR project: a 3-D surface anthropometry survey , 1999, Second International Conference on 3-D Digital Imaging and Modeling (Cat. No.PR00062).

[24]  Shaohan Hu,et al.  DeepSense: A Unified Deep Learning Framework for Time-Series Mobile Sensing Data Processing , 2016, WWW.