Sequence Matching for Suspicious Activity Detection in Anti-Money Laundering

Developing effective suspicious activity detection methods has become an increasingly critical problem for governments and financial institutions in their efforts to fight money laundering. Previous anti-money laundering (AML) systems were mostly rule-based systems which suffered from low efficiency and could can be easily learned and evaded by money launders. Recently researchers have begun to use machine learning methods to solve the suspicious activity detection problem. However nearly all these methods focus on detecting suspicious activities on accounts or individual level. In this paper we propose a sequence matching based algorithm to identify suspicious sequences in transactions. Our method aims to pick out suspicious transaction sequences using two kinds of information as reference sequences: 1) individual account's transaction history and 2) transaction information from other accounts in a peer group. By introducing the reference sequences, we can combat those who want to evade regulations by simply learning and adapting reporting criteria, and easily detect suspicious patterns. The initial results show that our approach is highly accurate.

[1]  Tianqing Zhu An Outlier Detection Model Based on Cross Datasets Comparison for Financial Surveillance , 2006, APSCC.

[2]  Cláudia Antunes,et al.  Temporal Data Mining: an overview , 2001 .

[3]  Piotr Indyk,et al.  Mining the stock market (extended abstract): which measure is best? , 2000, KDD '00.

[4]  Ted E. Senator,et al.  Ongoing management and application of discovered knowledge in a large regulatory organization: a case study of the use and impact of NASD Regulation's Advanced Detection System (RADS) , 2000, KDD '00.

[5]  Su-nan Wang,et al.  A Money Laundering Risk Evaluation Method Based on Decision Tree , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[6]  Ted E. Senator,et al.  The Financial Crimes Enforcement Network AI System (FAIS) Identifying Potential Money Laundering from Reports of Large Cash Transactions , 1995, AI Mag..

[7]  Kyuseok Shim,et al.  Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[8]  Ted E. Senator,et al.  Restructuring Databases for Knowledge Discovery by Consolidation and Link Formation , 1995, KDD.

[9]  Edit Gombay,et al.  Sequential change-point detection with likelihood ratios , 2000 .

[10]  David Sankoff,et al.  Multiple Genome Rearrangement and Breakpoint Phylogeny , 1998, J. Comput. Biol..

[11]  Pengzhu Zhang,et al.  Research on Constraints in Anti-Money Laundering (AML) Business Process in China Based on Theory of Constraints , 2008, Proceedings of the 41st Annual Hawaii International Conference on System Sciences (HICSS 2008).

[12]  Jun Tang,et al.  Developing an intelligent data discriminating system of anti-money laundering based on SVM , 2005, 2005 International Conference on Machine Learning and Cybernetics.

[13]  Carla E. Brodley,et al.  Temporal sequence learning and data reduction for anomaly detection , 1998, CCS '98.

[14]  Alberto O. Mendelzon,et al.  Similarity-based queries for time series data , 1997, SIGMOD '97.

[15]  Zhu Tianqing Suspicious Financial Transaction Detection Based on Empirical Mode Decomposition Method , 2006, 2006 IEEE Asia-Pacific Conference on Services Computing (APSCC'06).

[16]  Arindam Banerjee,et al.  Clickstream clustering using weighted longest common subsequences , 2001 .

[17]  Dragomir Anguelov,et al.  Mining The Stock Market : Which Measure Is Best ? , 2000 .