Framework to predict NPA/Willful defaults in corporate loans: a big data approach

Growth and development of the economy is dependent on the banking system. Bad loans which are Non-Performing Assets (NPA) are the measure for assessing the financial health of the bank. It is very important to control NPA as it affects the profitability, and deteriorates the quality of assets of the bank. It is observed that there is a significant rise in the number of willful defaulters. Hence systematic identification, awareness and assessment of parameters is essential for early prediction of willful default behavior. The main objective of the paper is to identify exhaustive list of parameters essential for predicting whether the loan will become NPA and thereby willful default. This process includes understanding of existing system to check NPAs and identifying the critical parameters. Also propose a framework for NPA/Willful default identification. The framework classifies the data comprising of structured and unstructured parameters as NPA/Willful default or not. In order to select the best classification model in the framework an experimentation is conducted on loan dataset on big data platform. Since the loan data is structured, unstructured component is incorporated by generating synthetic data. The results indicate that neural network model gives best accuracy and hence considered in the framework.

[1]  J. Hossen,et al.  A Survey of Machine Learning Techniques for Self-tuning Hadoop Performance , 2018 .

[2]  Suharjito Suharjito,et al.  Failure prediction of e-banking application system using adaptive neuro fuzzy inference system (ANFIS) , 2019, International Journal of Electrical and Computer Engineering (IJECE).

[3]  Wenjun Wang,et al.  Training Backpropagation Neural Network in MapReduce , 2014, INFOCOM 2014.

[4]  Yong Lu,et al.  P2P Lending Fraud Detection: A Big Data Approach , 2015, PAISI.

[5]  Radhika M. Pai,et al.  Stock market prediction: A big data approach , 2015, TENCON 2015 - 2015 IEEE Region 10 Conference.

[6]  Vivek Mukherjee,et al.  Willful Default In Developing Country Banking System: A Theoretical Exercise , 2013 .

[7]  D. Glennon,et al.  An Analysis of SBA Loan Defaults by Maturity Structure , 2005 .

[8]  S. Lai An analysis of private loan guarantees , 1992 .

[9]  Rinkle Rani,et al.  A Novel Approach for Clustering Big Data based on MapReduce , 2018 .

[10]  Kamal Eddine El Kadiri,et al.  A Novel Hybrid Classification Approach for Sentiment Analysis of Text Document , 2018 .

[11]  J. Sinkey,et al.  Loan-loss experience and risk-taking behavior at large commercial banks , 1991 .

[12]  R. Kant,et al.  Frauds in the Indian Banking Industry , 2016 .

[13]  Songtao Zheng,et al.  Naïve Bayes Classifier: A MapReduce Approach , 2014 .

[14]  K. Subrahmanyam,et al.  Big Data and MapReduce Challenges, Opportunities and Trends , 2016 .

[15]  Catastrophic Default and Credit Risk for Lending Institutions , 1999 .

[16]  Ferhat Özgür Çatak,et al.  A MapReduce based distributed SVM algorithm for binary classification , 2013, ArXiv.