论文信息 - A fuzzy neighborhood rough set method for anomaly detection in large scale data

A fuzzy neighborhood rough set method for anomaly detection in large scale data

Mining Outlier in database is to find exceptional objects that deviate from the rest of the datasets. Besides classical outlier analysis algorithms, recent studies have focused on mining local outliers. The outliers that have density distribution significantly different from their neighborhood. However, the existing outlier detection algorithms suffer the drawbacks that they are inefficient in dealing with large scale datasets. In this paper, we propose a novel approach for outlier detection with voluminous data. This approach involves a neighborhood fuzzy rough set theory to rank outlier according to fuzzy membership function computed in rough approximation space. In order to improve the speed of computation, an efficient parallel computing system based on Map Reduce model is developed

Ziyati Elhoussaine | EL Meziati Marouane

[1] Takafumi Kanamori,et al. Statistical outlier detection using direct density ratio estimation , 2011, Knowledge and Information Systems.

[2] Karanjit Singh,et al. Nearest Neighbour Based Outlier Detection Techniques , 2012 .

[3] Witold Pedrycz,et al. Granular Computing: Perspectives and Challenges , 2013, IEEE Transactions on Cybernetics.

[4] T. Y. Lin,et al. Neighborhood systems and relational databases , 1988, CSC '88.

[5] B. Muthukumar,et al. Intrusion Detection System (IDS): Anomaly Detection Using Outlier Detection Approach , 2015 .

[6] Guoyin Wang,et al. Granular computing: from granularity optimization to multi-granularity joint problem solving , 2016, Granular Computing.

[7] Cesare Alippi,et al. Credit Card Fraud Detection: A Realistic Modeling and a Novel Learning Strategy , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[8] M. Suresh,et al. A Study of Rough Sets Theory and It ’ s Application Over Various Fields , 2017 .

[9] Vinod S. Wadne,et al. Unsupervised Distance-Based Outlier Detection Using Nearest Neighbours Algorithm on Distributed Approach: Survey , 2015 .

[10] Kanishka Bhaduri,et al. Algorithms for speeding up distance-based outlier detection , 2011, KDD.

[11] Theresa Beaubouef,et al. Rough Sets , 2019, Lecture Notes in Computer Science.

[12] Yiyu Yao,et al. Information granulation and rough set approximation , 2001, Int. J. Intell. Syst..