An Unsupervised Embedding Learning Feature Representation Scheme for Network Big Data Analysis

With the arrival of the big data era, data come frequently with increasing volume and high dimensionality, which imposes a considerable challenge on data compression in network representation and analysis. How to learn an effective low-dimensional representation has a dramatic influence on performance of specific network learning tasks. In this paper, we propose an unsupervised embedding learning feature representation scheme by deep Siamese neural networks, aiming to learn an efficient low-dimensional feature subspace. Unsupervised embedding learning is one tough but interesting task in which its searching strategy is performed without the guidance of class label information. Siamese network is a neural network that can learn an efficient feature subspace in a supervised mode. It trains two networks with shared weights simultaneously, and feeds them with random sampling from the same dataset. As a result, the feature space is projected onto a low-dimensional subspace such that the similar samples are with small values close to zero, whereas the dissimilar ones come with big values greater than a predefined margin. We further discuss deep Siamese neural network in an unsupervised mode and its applications to embedding learning. The proposed method can be also used to address semi-supervised feature representation problems. Finally, the learned unsupervised embedding is validated on eight publicly available databases including images, voices, and text documents. Extensive experiments demonstrate the superiority of the proposed method against the compared existing state-of-the-art embedding approaches.

[1]  Albert Gordo,et al.  Label Embedding: A Frugal Baseline for Text Recognition , 2015, International Journal of Computer Vision.

[2]  Fadi Dornaika,et al.  Learning Flexible Graph-Based Semi-Supervised Embedding , 2016, IEEE Transactions on Cybernetics.

[3]  Peng Li,et al.  Distance Metric Learning with Eigenvalue Optimization , 2012, J. Mach. Learn. Res..

[4]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Chenglu Wen,et al.  Discriminative Learning of Point Cloud Feature Descriptors Based on Siamese Network , 2018, IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium.

[6]  Dane Taylor,et al.  Clustering Network Layers with the Strata Multilayer Stochastic Block Model , 2015, IEEE Transactions on Network Science and Engineering.

[7]  Jin Gao,et al.  Semi-Supervised Tensor-Based Graph Embedding Learning and Its Application to Visual Discriminant Tracking , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[9]  Alfred O. Hero,et al.  Fine: Information embedding for document classification , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Yi Yang,et al.  Big Data Meet Cyber-Physical Systems: A Panoramic Survey , 2018, IEEE Access.

[11]  Haoran Xie,et al.  Siamese Network-Based Supervised Topic Modeling , 2018, EMNLP.

[12]  Osman Yagan,et al.  Information Propagation in Clustered Multilayer Networks , 2015, IEEE Transactions on Network Science and Engineering.

[13]  Gustavo Carneiro,et al.  Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimizing Global Loss Functions , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Junliang Xing,et al.  Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[15]  Song Guo,et al.  Information and Communications Technologies for Sustainable Development Goals: State-of-the-Art, Needs and Perspectives , 2018, IEEE Communications Surveys & Tutorials.

[16]  Rita Cucchiara,et al.  A Deep Siamese Network for Scene Detection in Broadcast Videos , 2015, ACM Multimedia.

[17]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[18]  Yann LeCun,et al.  Signature Verification Using A "Siamese" Time Delay Neural Network , 1993, Int. J. Pattern Recognit. Artif. Intell..

[19]  Wei Wu,et al.  High Performance Visual Tracking with Siamese Region Proposal Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20]  Song Guo,et al.  Big Data Meet Green Challenges: Big Data Toward Green Applications , 2016, IEEE Systems Journal.

[21]  Wenwu Zhu,et al.  Structural Deep Network Embedding , 2016, KDD.

[22]  Alessandro Moschitti,et al.  Embedding Semantic Similarity in Tree Kernels for Domain Adaptation of Relation Extraction , 2013, ACL.

[23]  Feiping Nie,et al.  Feature Selection via Joint Embedding Learning and Sparse Regression , 2011, IJCAI.

[24]  Jiawei Han,et al.  Embedding Learning with Events in Heterogeneous Information Networks , 2017, IEEE Transactions on Knowledge and Data Engineering.

[25]  Franco Blanchini,et al.  The Smallest Eigenvalue of the Generalized Laplacian Matrix, with Application to Network-Decentralized Estimation for Homogeneous Systems , 2016, IEEE Transactions on Network Science and Engineering.

[26]  Tong Zhang,et al.  Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding , 2015, NIPS.

[27]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[28]  Edward J. Delp,et al.  A Two Stream Siamese Convolutional Neural Network for Person Re-identification , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[29]  Wanjiun Liao,et al.  A Mathematical Theory for Clustering in Metric Spaces , 2015, IEEE Transactions on Network Science and Engineering.

[30]  Zhetao Li,et al.  Machine-Learning-Based Online Distributed Denial-of-Service Attack Detection Using Spark Streaming , 2018, 2018 IEEE International Conference on Communications (ICC).

[31]  Menglong Yan,et al.  Change Detection Based on Deep Siamese Convolutional Network for Optical Aerial Images , 2017, IEEE Geoscience and Remote Sensing Letters.

[32]  Wenzhong Guo,et al.  Sparse Multigraph Embedding for Multimodal Feature Representation , 2017, IEEE Transactions on Multimedia.

[33]  Witold Pedrycz,et al.  Subspace learning for unsupervised feature selection via matrix factorization , 2015, Pattern Recognit..

[34]  Avinash C. Kak,et al.  PCA versus LDA , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Yen-Wei Chen,et al.  Multilinear Supervised Neighborhood Embedding of a Local Descriptor Tensor for Scene/Object Recognition , 2012, IEEE Transactions on Image Processing.

[36]  Xiong Luo,et al.  Short-Term Wind Speed Forecasting via Stacked Extreme Learning Machine With Generalized Correntropy , 2018, IEEE Transactions on Industrial Informatics.

[37]  Chong Luo,et al.  A Twofold Siamese Network for Real-Time Object Tracking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[38]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[39]  Yan Zhang,et al.  Discriminative sparse flexible manifold embedding with novel graph for robust visual representation and label propagation , 2017, Pattern Recognit..

[40]  Jianbing Shen,et al.  Triplet Loss in Siamese Network for Object Tracking , 2018, ECCV.

[41]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[42]  William Zhu,et al.  Sparse Graph Embedding Unsupervised Feature Selection , 2018, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[43]  Hong Yan,et al.  Coupled Kernel Embedding for Low-Resolution Face Image Recognition , 2012, IEEE Transactions on Image Processing.

[44]  Ivor W. Tsang,et al.  Flexible Manifold Embedding: A Framework for Semi-Supervised and Unsupervised Dimension Reduction , 2010, IEEE Transactions on Image Processing.

[45]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[46]  Ali Farhadi,et al.  Unsupervised Deep Embedding for Clustering Analysis , 2015, ICML.

[47]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[48]  Shuicheng Yan,et al.  Neighborhood preserving embedding , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[49]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[50]  Huchuan Lu,et al.  Structured Siamese Network for Real-Time Visual Tracking , 2018, ECCV.

[51]  Li Xu,et al.  Online Internet traffic monitoring system using spark streaming , 2018, Big Data Min. Anal..

[52]  Jian Yang,et al.  Approximate Orthogonal Sparse Embedding for Dimensionality Reduction , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[53]  Ting Liu,et al.  Learning Semantic Representations of Users and Products for Document Level Sentiment Classification , 2015, ACL.

[54]  Yann LeCun,et al.  Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[55]  Song Wang,et al.  Learning Dynamic Siamese Network for Visual Object Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[56]  David J. Fleet,et al.  Hamming Distance Metric Learning , 2012, NIPS.