A two-route CNN model for bank account classification with heterogeneous data

Classifying bank accounts by using transaction data is encouraging in cracking down on illegal financial activities. However, few research simultaneously use heterogenous features, which are embedded in the time series data. In this paper, a two route convolution neural network TRHD-CNN model, fed with two types of heterogeneous feature matrices, is proposed for classifying the bank accounts. TRHD-CNN adopts divide and conquer strategy to extract characteristics from two types of data source independently. The strategy is proved able in mining complementary classification characteristics. We firstly transfer the original log data into a directed and dynamic transaction network. On the basis of that, two feature generation methods are devised for extracting information from local topological structure and time series transaction respectively. A DirectedWalk method is developed in this paper to learning the network vector of vertices used for embedding the neighbor relationship of bank account. The extensive experimental results, conducted on a real bank transaction dataset that contains illegal pyramid selling accounts, show the significant advantage of TRHD-CNN over the existing methods. TRHD-CNN can provide recall scores up to 5.15% higher than competing methods. In addition, the two-route architecture of TRHD-CNN is easy to extend to multi-route scenarios and other fields.

[1]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[2]  Liu Yang Based on social network crime organization relation mining and central figure determining , 2012, 2012 IEEE International Conference on Computer Science and Automation Engineering.

[3]  Masoumeh Zareapoor,et al.  FraudMiner: A Novel Credit Card Fraud Detection Model Based on Frequent Itemset Mining , 2014, TheScientificWorldJournal.

[4]  Su-nan Wang,et al.  A Money Laundering Risk Evaluation Method Based on Decision Tree , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[5]  Ekrem Duman,et al.  Detecting credit card fraud by decision trees and support vector machines , 2011 .

[6]  Yuan He,et al.  Cascaded heterogeneous convolutional neural networks for handwritten digit recognition , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[7]  Tianqing Zhu Suspicious Financial Transaction Detection Based on Empirical Mode Decomposition Method , 2006, APSCC.

[8]  Andrew Zisserman,et al.  Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.

[9]  Yong Hu,et al.  The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature , 2011, Decis. Support Syst..

[10]  Zhu Tianqing Suspicious Financial Transaction Detection Based on Empirical Mode Decomposition Method , 2006, 2006 IEEE Asia-Pacific Conference on Services Computing (APSCC'06).

[11]  Liqing Zhang,et al.  Credit Card Fraud Detection Using Convolutional Neural Networks , 2016, ICONIP.

[12]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[13]  Tatsuya Minegishi,et al.  Proposal of Credit Card Fraudulent Use Detection by Online-type Decision Tree Construction and Verification of Generality , 2013 .

[14]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[15]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[16]  Wei Wang,et al.  Detecting Pyramid Scheme Accounts with Time Series Financial Transactions , 2018, 2018 IEEE Third International Conference on Data Science in Cyberspace (DSC).

[17]  Xinghuo Yu,et al.  Detection of money laundering groups using supervised learning in networks , 2016, ArXiv.

[18]  Pengzhu Zhang,et al.  Sequence Matching for Suspicious Activity Detection in Anti-Money Laundering , 2008, ISI Workshops.

[19]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[20]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[22]  Padraig Cunningham,et al.  Mining Dense Structures to Uncover Anomalous Behaviour in Financial Network Data , 2011, MSM/MUSE.

[23]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[24]  J. M. Serrano,et al.  Association rules applied to credit card fraud detection , 2009, Expert Syst. Appl..

[25]  Shenghua Gao,et al.  Single-Image Crowd Counting via Multi-Column Convolutional Neural Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[27]  Anazida Zainal,et al.  Fraud detection system: A survey , 2016, J. Netw. Comput. Appl..

[28]  Y. Sahin,et al.  Detecting credit card fraud by ANN and logistic regression , 2011, 2011 International Symposium on Innovations in Intelligent Systems and Applications.