Latent correlation embedded discriminative multi-modal data fusion

Abstract Ever-growing volume of multi-modal data has garnered considerable research enthusiasm in many fields, such as biomedical image analysis and biometrics. It remains a challenge how to effectively fuse multi-modal data for improving the learning performance. Many data fusion methods have been proposed, such as multi-kernel learning (MKL) and canonical component analysis (CCA). Most of the previous fusion methods focus on concatenating the multi-modal data, which fail to preserve the structure information across different modalities. In addition, they treat the samples equally in fusion and ignore the difference of the contributions of samples to the fusion model. In this paper, we proposed a discriminative multi-modal dimensionality reduction method, which can seamlessly fuse the multi-modal data and explore the latent correlation among different modalities for robust representation learning. In optimization, self-paced learning is adopted to dynamically estimate the contribution of each sample to the overall fusion model. Following the easy-to-hard learning sequence, our model can adaptively and sequentially enhance the robustness of the learning system in a knowledge discovery manner. Extensive experimental results on multi-modal brain disease diagnosis and multi-spectral palmprint classification tasks demonstrate the proposed method outperforms the previous multi-modal classification methods.

[1]  David Zhang,et al.  Double-orientation code and nonlinear matching scheme for palmprint recognition , 2016, Pattern Recognit..

[2]  David Zhang,et al.  A feature extraction method for use with bimodal biometrics , 2010, Pattern Recognit..

[3]  Loris Nanni,et al.  Overview of the combination of biometric matchers , 2017, Inf. Fusion.

[4]  Ying Tai,et al.  Nuclear Norm Based Matrix Regression with Applications to Face Recognition with Occlusion and Illumination Changes , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Yueting Zhuang,et al.  Supervised Coupled Dictionary Learning with Group Structures for Multi-modal Retrieval , 2013, AAAI.

[6]  Brian B. Avants,et al.  Sparse canonical correlation analysis relates network-level atrophy to multivariate cognitive measures in a neurodegenerative population , 2014, NeuroImage.

[7]  Kai A. Krueger,et al.  Flexible shaping: How learning in small steps helps , 2009, Cognition.

[8]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Huda Khayrallah,et al.  Deep Generalized Canonical Correlation Analysis , 2017, RepL4NLP@ACL.

[10]  Daphne Koller,et al.  Self-Paced Learning for Latent Variable Models , 2010, NIPS.

[11]  Adam N. Letchford,et al.  Non-convex mixed-integer nonlinear programming: A survey , 2012 .

[12]  Chengjun Liu,et al.  Color Image Discriminant Models and Algorithms for Face Recognition , 2008, IEEE Transactions on Neural Networks.

[13]  Peter J. Bickel,et al.  Maximum Likelihood Estimation of Intrinsic Dimension , 2004, NIPS.

[14]  G. Kerschen,et al.  The Method of Proper Orthogonal Decomposition for Dynamical Characterization and Order Reduction of Mechanical Systems: An Overview , 2005 .

[15]  U. Rajendra Acharya,et al.  Computer-aided diagnosis of Myocardial Infarction using ultrasound images with DWT, GLCM and HOS methods: A comparative study , 2015, Comput. Biol. Medicine.

[16]  Jianhua Dai,et al.  Joint Dictionary Learning for Unsupervised Feature Selection , 2016, ICANN.

[17]  Febo Cincotti,et al.  Towards Noninvasive Hybrid Brain–Computer Interfaces: Framework, Practice, Clinical Application, and Beyond , 2015, Proceedings of the IEEE.

[18]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[19]  Guoliang Shao,et al.  Computer-Aided Diagnosis of Breast DCE-MRI Images Using Bilateral Asymmetry of Contrast Enhancement Between Two Breasts , 2014, Journal of Digital Imaging.

[20]  Rama Chellappa,et al.  Joint Sparse Representation for Robust Multimodal Biometrics Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Peyman Adibi,et al.  Two-stage multiple kernel learning for supervised dimensionality reduction , 2015, Pattern Recognit..

[22]  Yang Yang,et al.  Zero-Shot Hashing via Transferring Supervised Knowledge , 2016, ACM Multimedia.

[23]  Daoqiang Zhang,et al.  Multi-modal dimensionality reduction using effective distance , 2017, Neurocomputing.

[24]  Daoqiang Zhang,et al.  Multi-modal multi-task learning for joint prediction of multiple regression and classification variables in Alzheimer's disease , 2012, NeuroImage.

[25]  Yong Jae Lee,et al.  Learning the easy things first: Self-paced visual category discovery , 2011, CVPR 2011.

[26]  Qi Tian,et al.  Lp-Norm IDF for Large Scale Image Search , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  A. Drzezga,et al.  Current status and future role of brain PET/MRI in clinical and research settings , 2015, European Journal of Nuclear Medicine and Molecular Imaging.

[28]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[29]  G. Sapiro,et al.  A collaborative framework for 3D alignment and classification of heterogeneous subvolumes in cryo-electron tomography. , 2013, Journal of structural biology.

[30]  Keum-Shik Hong,et al.  Decoding of four movement directions using hybrid NIRS-EEG brain-computer interface , 2014, Front. Hum. Neurosci..

[31]  David Zhang,et al.  Combining Left and Right Palmprint Images for More Accurate Personal Identification , 2015, IEEE Transactions on Image Processing.

[32]  Quan-Sen Sun,et al.  A unified multiset canonical correlation analysis framework based on graph embedding for multiple feature extraction , 2015, Neurocomputing.

[33]  Dazhe Zhao,et al.  Sparse shared structure based multi-task learning for MRI based cognitive performance prediction of Alzheimer's disease , 2017, Pattern Recognit..

[34]  Elise A. Piazza,et al.  Measuring speaker–listener neural coupling with functional near infrared spectroscopy , 2016, Scientific Reports.

[35]  Hanwang Zhang,et al.  L2, p-norm and sample constraint based feature selection and classification for AD diagnosis , 2016, Neurocomputing.

[36]  Yili Liu,et al.  A Brain–Computer Interface-Based Vehicle Destination Selection System Using P300 and SSVEP Signals , 2015, IEEE Transactions on Intelligent Transportation Systems.

[37]  Shiguang Shan,et al.  Multi-View Discriminant Analysis , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Fan Zhang,et al.  Discriminant locality preserving projections based on L2, p-norm for image feature extraction and recognition , 2019, J. Vis. Commun. Image Represent..

[39]  Wentao Hu,et al.  The fault feature extraction and classification of gear using principal component analysis and kernel principal component analysis based on the wavelet packet transform , 2014 .

[40]  Li Zhang,et al.  Robust Alternating Low-Rank Representation by joint Lp- and L2, p-norm minimization , 2017, Neural Networks.

[41]  Xiao-Jun Wu,et al.  Fast SRC using quadratic optimisation in downsized coefficient solution subspace , 2019, Signal Process..

[42]  Daoqiang Zhang,et al.  Multimodal classification of Alzheimer's disease and mild cognitive impairment , 2011, NeuroImage.

[43]  Shuicheng Yan,et al.  Correlation Metric for Generalized Feature Extraction , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.