Credit Scoring for Good: Enhancing Financial Inclusion with Smartphone-Based Microlending

Globally, two billion people and more than half of the poorest adults do not use formal financial services. Consequently, there is increased emphasis on developing financial technology that can facilitate access to financial products for the unbanked. In this regard, smartphone-based microlending has emerged as a potential solution to enhance financial inclusion. We propose a methodology to improve the predictive performance of credit scoring models used by these applications. Our approach is composed of several steps, where we mostly focus on engineering appropriate features from the user data. Thereby, we construct pseudo-social networks to identify similar people and combine complex network analysis with representation learning. Subsequently we build credit scoring models using advanced machine learning techniques with the goal of obtaining the most accurate credit scores, while also taking into consideration ethical and privacy regulations to avoid unfair discrimination. A successful deployment of our proposed methodology could improve the performance of microlending smartphone applications and help enhance financial wellbeing worldwide.

[1]  J. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research , 2015, Eur. J. Oper. Res..

[2]  Jose De Luna-Martinez,et al.  Financial inclusion in Malaysia : distilling lessons for other countries , 2017 .

[3]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[4]  David Martens,et al.  What does your Facebook profile reveal about your creditworthiness? Using alternative data for microfinance , 2019, J. Oper. Res. Soc..

[5]  Foster J. Provost,et al.  Mining Massive Fine-Grained Behavior Data to Improve Predictive Analytics , 2016, MIS Q..

[6]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[7]  Burcin Bozkaya,et al.  Money Walks: Implicit Mobility Behavior and Financial Well-Being , 2015, PloS one.

[8]  Alan Murray,et al.  Finding Similar Mobile Consumers with a Privacy-Friendly Geosocial Design , 2015, Inf. Syst. Res..

[9]  Bart Baesens,et al.  Social network analysis for customer churn prediction , 2014, Appl. Soft Comput..

[10]  Bart Baesens,et al.  Development and application of consumer credit scoring models using profit-based classification measures , 2014, Eur. J. Oper. Res..

[11]  Bart Baesens,et al.  Social network analytics for churn prediction in telco: Model building, evaluation and network architecture , 2017, Expert Syst. Appl..

[12]  Foster J. Provost,et al.  Classification in Networked Data: a Toolkit and a Univariate Case Study , 2007, J. Mach. Learn. Res..

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  Harald Scheule,et al.  Credit risk analytics : measurement techniques, applications, and examples in SAS , 2016 .

[15]  H. V. Jagadish,et al.  Research Challenges in Financial Data Modeling and Analysis , 2017, Big Data.

[16]  Chrysanthos Dellarocas,et al.  Credit Scoring with Social Network Data , 2014, Mark. Sci..

[17]  João Gama,et al.  Credit Scoring in Microfinance Using Non-traditional Data , 2017, EPIA.

[18]  Bart Baesens,et al.  The value of big data for credit scoring: Enhancing financial inclusion using mobile phone data and social network analytics , 2019, Appl. Soft Comput..

[19]  Siva Viswanathan,et al.  Judging Borrowers by the Company They Keep: Friendship Networks and Information Asymmetry in Online Peer-to-Peer Lending , 2011, Manag. Sci..

[20]  Monique Snoeck,et al.  GOTCHA! Network-Based Fraud Detection for Social Security Fraud , 2017, Manag. Sci..

[21]  Bart Baesens,et al.  Predicting interpurchase time in a retail environment using customer-product networks: An empirical study and evaluation , 2018, Expert Syst. Appl..

[22]  Vasant Dhar,et al.  Prediction in Economic Networks , 2014, Inf. Syst. Res..

[23]  Vincent D. Blondel,et al.  A survey of results on mobile phone datasets analysis , 2015, EPJ Data Science.