Financial Default Prediction via Motif-preserving Graph Neural Network with Curriculum Learning

User financial default prediction plays a critical role in credit risk forecasting and management. It aims at predicting the probability that the user will fail to make the repayments in the future. Previous methods mainly extract a set of user individual features regarding his own profiles and behaviors and build a binary-classification model to make default predictions. However, these methods cannot get satisfied results, especially for users with limited information. Although recent efforts suggest that default prediction can be improved by social relations, they fail to capture the higher-order topology structure at the level of small subgraph patterns. In this paper, we fill in this gap by proposing a motif-preserving Graph Neural Network with curriculum learning (MotifGNN) to jointly learn the lower-order structures from the original graph and higher-order structures from multi-view motif-based graphs for financial default prediction. Specifically, to solve the problem of weak connectivity in motif-based graphs, we design the motif-based gating mechanism. It utilizes the information learned from the original graph with good connectivity to strengthen the learning of the higher-order structure. And considering that the motif patterns of different samples are highly unbalanced, we propose a curriculum learning mechanism on the whole learning process to more focus on the samples with uncommon motif distributions. Extensive experiments on one public dataset and two industrial datasets all demonstrate the effectiveness of our proposed method.

[1]  Bin Wang,et al.  Graph convolutional networks fusing motif-structure information , 2022, Scientific Reports.

[2]  Beizhan Wang,et al.  MBRep: Motif-based representation learning in heterogeneous networks , 2021, Expert Syst. Appl..

[3]  Zhilong Chen,et al.  Predicting Customer Value with Social Relationships via Motif-based Graph Attention Networks , 2021, WWW.

[4]  Junyu Dong,et al.  Motif-Preserving Dynamic Attributed Network Embedding , 2021, WWW.

[5]  Jianfeng Chi,et al.  Credit Risk and Limits Forecasting in E-Commerce Consumer Lending Service via Multi-view-aware Mixture-of-experts Nets , 2021, WSDM.

[6]  Xiao Wang,et al.  Beyond Low-frequency Information in Graph Convolutional Networks , 2021, AAAI.

[7]  Peng Cui,et al.  Temporal-Aware Graph Neural Network for Credit Risk Prediction , 2021, SDM.

[8]  Xiao Wang,et al.  Motif-Preserving Temporal Network Embedding , 2020, IJCAI.

[9]  Ryan A. Rossi,et al.  Role-Based Graph Embeddings , 2020, IEEE Transactions on Knowledge and Data Engineering.

[10]  Zhiqiang Zhang,et al.  Financial Risk Analysis for SMEs with Graph-based Supply Chain Mining , 2020, IJCAI.

[11]  Jiayu Tang,et al.  Financial Defaulter Detection on Online Credit Payment via Multi-view Attributed Heterogeneous Information Network , 2020, WWW.

[12]  Ming Gu,et al.  Understanding Default Behavior in Online Lending , 2019, CIKM.

[13]  Ryan A. Rossi,et al.  Graph Convolutional Networks with Motif-based Attention , 2019, CIKM.

[14]  Jun Zhou,et al.  A Semi-Supervised Graph Attentive Network for Financial Fraud Detection , 2019, 2019 IEEE International Conference on Data Mining (ICDM).

[15]  Manoj Reddy Dareddy,et al.  motif2vec: Motif Aware Node Representation Learning for Heterogeneous Networks , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[16]  Yuan Qi,et al.  Cash-Out User Detection Based on Attributed Heterogeneous Information Network with a Hierarchical Attention Mechanism , 2019, AAAI.

[17]  Yanfang Ye,et al.  Heterogeneous Graph Attention Network , 2019, WWW.

[18]  Joshua E. Blumenstock,et al.  Multi-GCN: Graph Convolutional Networks for Multi-View Networks, with Applications to Global Poverty , 2019, AAAI.

[19]  Le Song,et al.  Heterogeneous Graph Neural Networks for Malicious Account Detection , 2018, CIKM.

[20]  Ryan A. Rossi,et al.  Higher-order Spectral Clustering for Heterogeneous Graphs , 2018, ArXiv.

[21]  Ji Feng,et al.  Distributed Deep Forest and its Application to Automatic Detection of Cash-Out Fraud , 2018, ACM Trans. Intell. Syst. Technol..

[22]  Ryan A. Rossi,et al.  Higher-order Network Representation Learning , 2018, WWW.

[23]  Xiaolong Li,et al.  GeniePath: Graph Neural Networks with Adaptive Receptive Paths , 2018, AAAI.

[24]  Cao Xiao,et al.  FastGCN: Fast Learning with Graph Convolutional Networks via Importance Sampling , 2018, ICLR.

[25]  Alfredo De Santis,et al.  Using generative adversarial networks for improving classification effectiveness in credit card fraud detection , 2017, Inf. Sci..

[26]  Yuan Zhang,et al.  Enhancing the Network Embedding Quality with Structural Similarity , 2017, CIKM.

[27]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[28]  Haoteng Yin,et al.  Spatio-temporal Graph Convolutional Neural Network: A Deep Learning Framework for Traffic Forecasting , 2017, IJCAI.

[29]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[30]  J. Kleinberg,et al.  Detecting Strong Ties Using Network Motifs , 2017, WWW.

[31]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[32]  Jure Leskovec,et al.  Higher-order organization of complex networks , 2016, Science.

[33]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[34]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[35]  Siddhartha Bhattacharyya,et al.  Data mining for credit card fraud: A comparative study , 2011, Decis. Support Syst..

[36]  Niall M. Adams,et al.  Transaction aggregation as a strategy for credit card fraud detection , 2009, Data Mining and Knowledge Discovery.

[37]  Matthieu Latapy,et al.  Main-memory triangle computations for very large (sparse (power-law)) graphs , 2008, Theor. Comput. Sci..

[38]  Mercedes Ayuso,et al.  A Bayesian dichotomous model with asymmetric link for fraud in insurance , 2008 .

[39]  Chao-Hsien Chu,et al.  A Review of Data Mining-Based Financial Fraud Detection Research , 2007, 2007 International Conference on Wireless Communications, Networking and Mobile Computing.

[40]  A. Arenas,et al.  Motif-based communities in complex networks , 2007, 0710.0059.

[41]  Andre Levchenko,et al.  Dynamic Properties of Network Motifs Contribute to Biological Network Organization , 2005, PLoS biology.

[42]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[43]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[44]  F. Massey The Kolmogorov-Smirnov Test for Goodness of Fit , 1951 .

[45]  Ryan A. Rossi,et al.  Estimation of Graphlet Counts in Massive Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[46]  Chee Peng Lim,et al.  Credit Card Fraud Detection Using AdaBoost and Majority Voting , 2019, IEEE Access.

[47]  Guido Dedene,et al.  Strategies for detecting fraudulent claims in the automobile insurance industry , 2007, Eur. J. Oper. Res..