Incomplete multi-view clustering via deep semantic mapping

Abstract Multi-view clustering, which explores complementary information between multiple feature sets by consensus grouping, has benefited many data analytic applications. The majority of previous multi-view clustering studies usually assume that all feature sets appear in complete. In real-world applications, however, it is often the case that some views could suffer from the missing of examples, resulting in incomplete feature sets. The incompleteness of views makes it difficult to synthesize all feature sets and achieve a comprehensive description of data samples. In this paper, we develop a novel incomplete multi-view clustering method, which projects all incomplete multi-view data to a complete and unified representation in a common subspace. Different from existing researches which exploit shallow learning to identify the common subspace, a deep incomplete multi-view clustering (DIMC) incorporating with the constraint of intrinsic geometric structure is proposed here to couple incomplete multi-view samples. To bridge the gap between each view and the common representation, the multi-view deep coupled networks are trained to map high-level semantic features. Besides, to preserve the local invariance for each view, an affinity graph based regularizer is constructed to encode geometrical information. Therefore, a new objective function is developed and the optimization processes are presented. Extensive experiments on several real-world datasets demonstrate that the proposed DIMC is superior to the state-of-the-art incomplete multi-view clustering methods.

[1]  Laurence T. Yang,et al.  PPHOCFS: Privacy Preserving High-Order CFS Algorithm on the Cloud for Clustering Multimedia Data , 2016, ACM Trans. Multim. Comput. Commun. Appl..

[2]  Shiliang Sun,et al.  A survey of multi-view machine learning , 2013, Neural Computing and Applications.

[3]  Yueming Hu,et al.  Distributed Feature Selection for Efficient Economic Big Data Analysis , 2018, IEEE Transactions on Big Data.

[4]  Rong Jin,et al.  Multiple Kernel Learning for Visual Object Recognition: A Review , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Xuran Zhao,et al.  A subspace co-training framework for multi-view clustering , 2014, Pattern Recognit. Lett..

[6]  Jeff A. Bilmes,et al.  Deep Canonical Correlation Analysis , 2013, ICML.

[7]  Fuad E. Alsaadi,et al.  A Novel Switching Delayed PSO Algorithm for Estimating Unknown Parameters of Lateral Flow Immunoassay , 2016, Cognitive Computation.

[8]  Yun Fu,et al.  Incomplete Multi-Modal Visual Data Grouping , 2016, IJCAI.

[9]  Hal Daumé,et al.  A Co-training Approach for Multi-view Spectral Clustering , 2011, ICML.

[10]  Zidong Wang,et al.  Inferring nonlinear lateral flow immunoassay state-space models via an unscented Kalman filter , 2016, Science China Information Sciences.

[11]  Thomas S. Huang,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation. , 2011, IEEE transactions on pattern analysis and machine intelligence.

[12]  Jeff A. Bilmes,et al.  On Deep Multi-View Representation Learning , 2015, ICML.

[13]  Yu-Chiang Frank Wang,et al.  Heterogeneous Domain Adaptation and Classification by Exploiting the Correlation Subspace , 2014, IEEE Transactions on Image Processing.

[14]  Philip S. Yu,et al.  Multiple Incomplete Views Clustering via Weighted Nonnegative Matrix Factorization with L2, 1 Regularization , 2015, ECML/PKDD.

[15]  Fuad E. Alsaadi,et al.  A switching delayed PSO optimized extreme learning machine for short-term load forecasting , 2017, Neurocomputing.

[16]  Shiliang Sun,et al.  Multi-view learning overview: Recent progress and new challenges , 2017, Inf. Fusion.

[17]  Liang Wang,et al.  Incomplete Multi-view Clustering via Subspace Learning , 2015, CIKM.

[18]  Fuad E. Alsaadi,et al.  Deep Belief Networks for Quantitative Analysis of a Gold Immunochromatographic Strip , 2016, Cognitive Computation.

[19]  Roger Levy,et al.  On the Role of Correlation and Abstraction in Cross-Modal Multimedia Retrieval , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Shiguang Shan,et al.  Multi-View Discriminant Analysis , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Huda Khayrallah,et al.  Deep Generalized Canonical Correlation Analysis , 2017, RepL4NLP@ACL.

[22]  Philip S. Yu,et al.  Clustering on Multiple Incomplete Datasets via Collective Kernel Learning , 2013, 2013 IEEE 13th International Conference on Data Mining.

[23]  Ming Shao,et al.  Missing Modality Transfer Learning via Latent Low-Rank Constraint , 2015, IEEE Transactions on Image Processing.

[24]  Shao-Yuan Li,et al.  Partial Multi-View Clustering , 2014, AAAI.

[25]  Ethem Alpaydin,et al.  Multiple Kernel Learning Algorithms , 2011, J. Mach. Learn. Res..

[26]  Liang Wang,et al.  Unified subspace learning for incomplete and unlabeled multi-view data , 2017, Pattern Recognit..

[27]  Yueming Hu,et al.  Local Similarity Imputation Based on Fast Clustering for Incomplete Data in Cyber-Physical Systems , 2018, IEEE Systems Journal.

[28]  Hong Zhang,et al.  Denoising and deblurring gold immunochromatographic strip images via gradient projection algorithms , 2017, Neurocomputing.

[29]  Laurence T. Yang,et al.  An Incremental CFS Algorithm for Clustering Large Data in Industrial Internet of Things , 2017, IEEE Transactions on Industrial Informatics.

[30]  Yun Fu,et al.  Robust Multi-View Subspace Learning through Dual Low-Rank Decompositions , 2016, AAAI.

[31]  Yurong Liu,et al.  A survey of deep neural network architectures and their applications , 2017, Neurocomputing.

[32]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[33]  Jiawei Han,et al.  Multi-View Clustering via Joint Nonnegative Matrix Factorization , 2013, SDM.

[34]  Dacheng Tao,et al.  Multi-View Learning With Incomplete Views , 2015, IEEE Transactions on Image Processing.

[35]  Jianping Fan,et al.  Multi-View Concept Learning for Data Representation , 2015, IEEE Transactions on Knowledge and Data Engineering.