Parameter-Free Incremental Co-Clustering for Multi-Modal Data in Cyber-Physical-Social Systems

With the rapid advances of cyber-physical-social systems (CPSS), large amounts of dynamic multi-modal data are being generated and collected. Analyzing those data effectively and efficiently can help to promote the development and improve the service quality of CPSS applications. As an important technique of multi-modal data analysis, co-clustering, designed to identify groupings of multi-dimensional data based on cross-modality fusion, is often exploited. Unfortunately, most existing co-clustering methods that mainly focus on dealing with static data become infeasible to fuse huge volume of multi-modal data in dynamic CPSS environments. To tackle this problem, this paper proposes a parameter-free incremental co-clustering method to deal with multi-modal data dynamically. In the proposed method, the single-modality similarity measure is extended to multiple modalities and three operations, namely, cluster creating, cluster merging, and instance partitioning, are defined to incrementally integrate new arriving objects to current clustering patterns without introducing additive parameters. Moreover, an adaptive weight scheme is designed to measure the importance of feature modalities based on the intra-cluster scatters. Extensive experiments on three real-world multi-modal datasets collected from CPSS demonstrate that the proposed method outperforms the compared state-of-the-art methods in terms of effectiveness and efficiency, thus it is promising for clustering dynamic multi-modal data in cyber-physical-social systems.

[1]  Yueming Hu,et al.  Distributed Feature Selection for Efficient Economic Big Data Analysis , 2018, IEEE Transactions on Big Data.

[2]  Michael Isard,et al.  A Multi-View Embedding Space for Modeling Internet Images, Tags, and Their Semantics , 2012, International Journal of Computer Vision.

[3]  Xuran Zhao,et al.  A subspace co-training framework for multi-view clustering , 2014, Pattern Recognit. Lett..

[4]  Jian-Ping Mei,et al.  Incremental Fuzzy Clustering With Multiple Medoids for Large Data , 2014, IEEE Transactions on Fuzzy Systems.

[5]  Shiliang Sun,et al.  Multi-view learning overview: Recent progress and new challenges , 2017, Inf. Fusion.

[6]  Laurence T. Yang,et al.  An Incremental CFS Algorithm for Clustering Large Data in Industrial Internet of Things , 2017, IEEE Transactions on Industrial Informatics.

[7]  Chonghui Guo,et al.  Incremental Affinity Propagation Clustering Based on Message Passing , 2014, IEEE Transactions on Knowledge and Data Engineering.

[8]  Laurence T. Yang,et al.  High-order possibilistic c-means algorithms based on tensor decompositions for big data in IoT , 2018, Inf. Fusion.

[9]  Yanhua Chen,et al.  Non-Negative Matrix Factorization for Semisupervised Heterogeneous Data Coclustering , 2010, IEEE Transactions on Knowledge and Data Engineering.

[10]  Sanjay Chakraborty,et al.  Analysis and Study of Incremental DBSCAN Clustering Algorithm , 2014, ArXiv.

[11]  Dacheng Tao,et al.  A Survey on Multi-view Learning , 2013, ArXiv.

[12]  Dong Xu,et al.  Semi-Supervised Heterogeneous Fusion for Multimedia Data Co-Clustering , 2014, IEEE Transactions on Knowledge and Data Engineering.

[13]  Ning Chen,et al.  Predictive Subspace Learning for Multi-view Data: a Large Margin Approach , 2010, NIPS.

[14]  Dean P. Foster,et al.  Large Scale Canonical Correlation Analysis with Iterative Least Squares , 2014, NIPS.

[15]  Liang Wang,et al.  Multi-view clustering via pairwise sparse subspace representation , 2015, Neurocomputing.

[16]  Haroon Idrees,et al.  NMF-KNN: Image Annotation Using Weighted Multi-view Non-negative Matrix Factorization , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Hal Daumé,et al.  A Co-training Approach for Multi-view Spectral Clustering , 2011, ICML.

[18]  Sham M. Kakade,et al.  Multi-view clustering via canonical correlation analysis , 2009, ICML '09.

[19]  Lei Du,et al.  Robust Multi-View Spectral Clustering via Low-Rank and Sparse Decomposition , 2014, AAAI.

[20]  Victor C. M. Leung,et al.  Incomplete multi-view clustering via deep semantic mapping , 2018, Neurocomputing.

[21]  Ethem Alpaydin,et al.  Multiple Kernel Learning Algorithms , 2011, J. Mach. Learn. Res..

[22]  Yves Grandvalet,et al.  Composite kernel learning , 2008, ICML '08.

[23]  Philip S. Yu,et al.  Online Unsupervised Multi-view Feature Selection , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[24]  Xiaoli Li,et al.  Incremental Minimax Optimization based Fuzzy Clustering for Large Multi-view Data , 2016, ArXiv.

[25]  Feiping Nie,et al.  Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Multi-View K-Means Clustering on Big Data , 2022 .

[26]  Feiping Nie,et al.  Large-Scale Multi-View Spectral Clustering via Bipartite Graph , 2015, AAAI.

[27]  Jianping Fan,et al.  Multi-View Concept Learning for Data Representation , 2015, IEEE Transactions on Knowledge and Data Engineering.

[28]  Laurence T. Yang,et al.  Privacy-Preserving Double-Projection Deep Computation Model With Crowdsourcing on Cloud for Big Data Feature Learning , 2018, IEEE Internet of Things Journal.

[29]  Laurence T. Yang,et al.  A Tucker Deep Computation Model for Mobile Multimedia Feature Learning , 2017, ACM Trans. Multim. Comput. Commun. Appl..