Multi-view Outlier Detection in Deep Intact Space

Recently, multi-view outlier detection has emerged as a challenging research topic in outlier detection because of complex distributions of data across different views. There are mainly three types of outliers, i.e., attribute outliers, class outliers and class-attribute outliers. Most existing multi-view outlier detection approaches only detect part of the three types of outliers in a pairwise manner across different views, which is not able to accomplish the task of multi-view outlier detection comprehensively and uniformly. Outlier detection in a pairwise manner across different views also leads to time-consuming computation. We propose a new algorithm termed Multi-view Outlier Detection in Deep Intact Space (MODDIS) to find all the three types of outliers simultaneously and avoid comparing different views in a pairwise manner. Rather than leveraging subspace clustering, the performance of which is seriously affected by the dependence of subspaces on most real datasets, neural networks are employed in MODDIS in that neural networks have a stronger representation learning ability. Meanwhile, based on the view insufficiency assumption, a multi-view intact outlierness space assumption is proposed. Based on this assumption, a multi-view latent intact space is constructed to encode outlierness information of all views, where outlierness in any view is a snapshot from some perspective. Finally, an outlier detection measurement is defined in the latent intact space. Experiments are conducted on several UCI datasets and the empirical results demonstrate the effectiveness of our proposed method.

[1]  Chang-Dong Wang,et al.  Discriminative metric learning for multi-view graph partitioning , 2018, Pattern Recognit..

[2]  Chang-Dong Wang,et al.  Multi-view Intact Space Clustering , 2017, 2017 4th IAPR Asian Conference on Pattern Recognition (ACPR).

[3]  Jiawei Han,et al.  Survey on web spam detection: principles and algorithms , 2012, SKDD.

[4]  Yun Fu,et al.  Latent Discriminant Subspace Representations for Multi-View Outlier Detection , 2018, AAAI.

[5]  Handong Zhao,et al.  Dual-Regularized Multi-View Outlier Detection , 2015, IJCAI.

[6]  Kevin Duh,et al.  Managing information disparity in multilingual document collections , 2013, TSLP.

[7]  Wenwu Zhu,et al.  Structural Deep Network Embedding , 2016, KDD.

[8]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[9]  Brendan J. Frey,et al.  Winner-Take-All Autoencoders , 2014, NIPS.

[10]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Karsten M. Borgwardt,et al.  Rapid Distance-Based Outlier Detection via Sampling , 2013, NIPS.

[12]  Jimeng Sun,et al.  Neighborhood formation and anomaly detection in bipartite graphs , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[13]  Shilpa Sonawani,et al.  A Survey of Fuzzy Clustering Techniques for Intrusion Detection System , 2014 .

[14]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[15]  Kai Ming Ting,et al.  Defying the gravity of learning curve: a characteristic of nearest neighbour anomaly detectors , 2016, Machine Learning.

[16]  Ming Shao,et al.  Multi-View Low-Rank Analysis for Outlier Detection , 2015, SDM.

[17]  Kai Ming Ting,et al.  LeSiNN: Detecting Anomalies by Identifying Least Similar Nearest Neighbours , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[18]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[19]  Chang-Dong Wang,et al.  Multi-view Spectral Clustering via Multi-view Weighted Consensus and Matrix-Decomposition Based Discretization , 2019, DASFAA.

[20]  Chang-Dong Wang,et al.  Overlapping Community Detection in Multi-view Brain Network , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[21]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[22]  Witold Pedrycz,et al.  Anomaly detection in time series data using a fuzzy c-means clustering , 2013, 2013 Joint IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS).

[23]  Yanyan Shen,et al.  DMDP: A Dynamic Multi-source Default Probability Prediction Framework , 2019, Data Science and Engineering.

[24]  Chang-Dong Wang,et al.  Multi-view Unit Intact Space Learning , 2017, KSEM.

[25]  Chang-Dong Wang,et al.  Multi-view Proximity Learning for Clustering , 2018, DASFAA.

[26]  Tomoharu Iwata,et al.  Clustering-based anomaly detection in multi-view data , 2013, CIKM.

[27]  Deepak S. Turaga,et al.  A Spectral Framework for Detecting Inconsistency across Multi-source Object Relationships , 2011, 2011 IEEE 11th International Conference on Data Mining.

[28]  Ling Chen,et al.  Learning Representations of Ultrahigh-dimensional Data for Random Distance-based Outlier Detection , 2018, KDD.

[29]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[30]  Philip S. Yu,et al.  DeepCF: A Unified Framework of Representation Learning and Matching Function Learning in Recommender System , 2019, AAAI.

[31]  Chang-Dong Wang,et al.  oComm: Overlapping Community Detection in Multi-View Brain Network , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[32]  Dacheng Tao,et al.  Multi-View Intact Space Learning , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Charu C. Aggarwal,et al.  NetWalk: A Flexible Deep Embedding Approach for Anomaly Detection in Dynamic Networks , 2018, KDD.

[34]  Hwanjo Yu,et al.  DILOF: Effective and Memory Efficient Local Outlier Detection in Data Streams , 2018, KDD.

[35]  Chang-Dong Wang,et al.  A Harmonic Motif Modularity Approach for Multi-layer Network Community Detection , 2018, 2018 IEEE International Conference on Data Mining (ICDM).