论文信息 - Credit Risk Modeling Using Transfer Learning and Domain Adaptation

Credit Risk Modeling Using Transfer Learning and Domain Adaptation

In the domain of credit risk assessment lenders may have limited or no data on the historical lending outcomes of credit applicants. Typically this disproportionately affects Micro, Small, and Medium Enterprises (MSMEs), for which credit may be restricted or too costly, due to the difficulty of predicting the Probability of Default (PD). However, if data from other related credit risk domains is available Transfer Learning may be applied to successfully train models, e.g., from the credit card lending and debt consolidation (CD) domains to predict in the small business lending domain. In this article, we report successful results from an approach using transfer learning to predict the probability of default based on the novel concept of Progressive Shift Contribution (PSC) from source to target domain. Toward real-world application by lenders of this approach, we further address two key questions. The first is to explain transfer learning models, and the second is to adjust features when the source and target domains differ. To address the first question, we apply Shapley values to investigate how and why transfer learning improves model accuracy, and also propose and test a domain adaptation approach to address the second. These results show that adaptation improves model accuracy in addition to the improvement from transfer learning. We extend this by proposing and testing a combined strategy of feature selection and adaptation to convert values of source domain features to better approximate values of target domain features. Our approach includes a strategy to choose features for adaptation and an algorithm to adapt the values of these features. In this setting, transfer learning appears to improve model accuracy by increasing the contribution of less predictive features. Although the percentage improvements are small, such improvements in real world lending could be of significant economic importance.

Michael Bain | Ashesh Mahidadia | Hendra Suryanto | Charles Guan | Ada Guan

[1] Bang An,et al. Adaptive Transfer Learning on Graph Neural Networks , 2021, KDD.

[2] Wouter M. Kouw,et al. A Review of Domain Adaptation without Target Labels , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Behnam Neyshabur,et al. What is being transferred in transfer learning? , 2020, NeurIPS.

[4] Hendra Suryanto,et al. Transfer Learning in Credit Risk , 2019, ECML/PKDD.

[5] Jing Zhang,et al. Unsupervised Domain Adaptation: A Multi-task Learning-based Method , 2018, Knowl. Based Syst..

[6] Meixuan Chen,et al. Domain Adaptation Approach for Credit Risk Analysis , 2018 .

[7] Julia Powles,et al. "Meaningful Information" and the Right to Explanation , 2017, FAT.

[8] Yiqiang Chen,et al. Balanced Distribution Adaptation for Transfer Learning , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[9] Scott Lundberg,et al. A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[10] Maayan Harel,et al. Learn on Source, Refine on Target: A Model Transfer Learning Framework with Random Forests , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] Marta Mejail,et al. Transfer Learning Decision Forests for Gesture Recognition , 2017, Gesture Recognition.

[12] Razvan Pascanu,et al. Progressive Neural Networks , 2016, ArXiv.

[13] Taghi M. Khoshgoftaar,et al. A survey of transfer learning , 2016, Journal of Big Data.

[14] Yi Hu,et al. A Transfer Learning Based Classifier Ensemble Model for Customer Credit Scoring , 2014, 2014 Seventh International Joint Conference on Computational Sciences and Optimization.

[15] Shiliang Sun,et al. Transfer Learning with Part-Based Ensembles , 2013, MCS.

[16] S. Biju,et al. Financial Inclusion and Inclusive Growth , 2013 .

[17] Sethuraman Panchanathan,et al. Multi-source domain adaptation and its application to early detection of fatigue , 2011, KDD.

[18] Ivor W. Tsang,et al. Domain Adaptation via Transfer Component Analysis , 2009, IEEE Transactions on Neural Networks.

[19] M. Rezac,et al. How to Measure the Quality of Credit Scoring Models , 2011 .

[20] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[21] Jiawei Han,et al. Knowledge transfer via multiple model local structure mapping , 2008, KDD.

[22] Rong Yan,et al. Cross-domain video concept detection using adaptive svms , 2007, ACM Multimedia.

[23] Rajat Raina,et al. Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.

[24] Leo Breiman,et al. Technical note: Some properties of splitting criteria , 2004, Machine Learning.

[25] Peter A. Flach. The Geometry of ROC Space: Understanding Machine Learning Metrics through ROC Isometrics , 2003, ICML.