SAFE: A Neural Survival Analysis Model for Fraud Early Detection

Many online platforms have deployed anti-fraud systems to detect and prevent fraudulent activities. However, there is usually a gap between the time that a user commits a fraudulent action and the time that the user is suspended by the platform. How to detect fraudsters in time is a challenging problem. Most of the existing approaches adopt classifiers to predict fraudsters given their activity sequences along time. The main drawback of classification models is that the prediction results between consecutive timestamps are often inconsistent. In this paper, we propose a survival analysis based fraud early detection model, SAFE, which maps dynamic user activities to survival probabilities that are guaranteed to be monotonically decreasing along time. SAFE adopts recurrent neural network (RNN) to handle user activity sequences and directly outputs hazard values at each timestamp, and then, survival probability derived from hazard values is deployed to achieve consistent predictions. Because we only observe the user suspended time instead of the fraudulent activity time in the training data, we revise the loss function of the regular survival model to achieve fraud early detection. Experimental results on two real world datasets demonstrate that SAFE outperforms both the survival analysis model and recurrent neural network model alone as well as state-of-theart fraud early detection approaches.

[1]  Huan Liu,et al.  Gleaning Wisdom from the Past: Early Detection of Emerging Rumors in Social Media , 2017, SDM.

[2]  Yang Xiang,et al.  Wikipedia Vandal Early Detection: From User Behavior to User Embedding , 2017, ECML/PKDD.

[3]  Yoshua Bengio,et al.  Deep Learning for Patient-Specific Kidney Graft Survival Analysis , 2017, ArXiv.

[4]  Utkarsh Upadhyay,et al.  Recurrent Marked Temporal Point Processes: Embedding Event History to Vector , 2016, KDD.

[5]  Qiaozhu Mei,et al.  Enquiring Minds: Early Detection of Rumors in Social Media from Enquiry Posts , 2015, WWW.

[6]  Ângelo Cardoso,et al.  A Recurrent Neural Network Survival Model: Predicting Web User Return Time , 2018, ECML/PKDD.

[7]  Adler J. Perotte,et al.  Deep Survival Analysis , 2016, MLHC.

[8]  Chandan K. Reddy,et al.  Machine Learning for Survival Analysis: A Survey , 2017, ArXiv.

[9]  Neil Shah,et al.  False Information on Web and Social Media: A Survey , 2018, ArXiv.

[10]  Xiaowei Ying,et al.  Spectrum based fraud detection in social networks , 2011, ICDE.

[11]  Ying Li,et al.  Early Prediction of Diabetes Complications from Electronic Health Records: A Multi-Task Survival Analysis Approach , 2018, AAAI.

[12]  Lawrence Carin,et al.  Adversarial Time-to-Event Modeling , 2018, ICML.

[13]  E Biganzoli,et al.  Feed forward neural networks for the analysis of censored survival data: a partial logistic regression approach. , 1998, Statistics in medicine.

[14]  Zhi-Hua Zhou,et al.  A spectral approach to detecting subtle anomalies in graphs , 2013, Journal of Intelligent Information Systems.

[15]  Russell Greiner,et al.  Learning Patient-Specific Cancer Survival Distributions as a Sequence of Dependent Regressors , 2011, NIPS.

[16]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[17]  Alexander J. Smola,et al.  Neural Survival Recommender , 2017, WSDM.

[18]  Ratna Babu Chinnam,et al.  Survival Analysis based Framework for Early Prediction of Student Dropouts , 2016, CIKM.

[19]  Uri Shaham,et al.  DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network , 2016, BMC Medical Research Methodology.

[20]  V. S. Subrahmanian,et al.  VEWS: A Wikipedia Vandal Early Warning System , 2015, KDD.

[21]  Egil Martinsson,et al.  WTTE-RNN : Weibull Time To Event Recurrent Neural Network A model for sequential prediction of time-to-event in the case of discrete or continuous censored data, recurrent events or time-varying covariates , 2017 .

[22]  Jun Yan Survival Analysis: Techniques for Censored and Truncated Data , 2004 .

[23]  Changhee Lee,et al.  DeepHit: A Deep Learning Approach to Survival Analysis With Competing Risks , 2018, AAAI.

[24]  Ying Cai,et al.  Spatio-Temporal Check-in Time Prediction with Recurrent Neural Network based Survival Analysis , 2018, IJCAI.

[25]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[26]  Fabrizio Silvestri,et al.  Improving Post-Click User Engagement on Native Ads via Survival Analysis , 2016, WWW.

[27]  Leman Akoglu,et al.  Fast Memory-efficient Anomaly Detection in Streaming Heterogeneous Graphs , 2016, KDD.

[28]  D.,et al.  Regression Models and Life-Tables , 2022 .

[29]  Ahmed M. Alaa,et al.  Deep Multi-task Gaussian Processes for Survival Analysis with Competing Risks , 2017, NIPS.

[30]  Jun Li,et al.  Spectrum-based Deep Neural Networks for Fraud Detection , 2017, CIKM.