Analyzing credit risk among Chinese P2P-lending businesses by integrating text-related soft information

Abstract Text-related soft information effectively alleviates the information asymmetry associated with P2P lending and reduces credit risk. Most existing studies use nonsemantic text information to construct credit evaluation models and predict the borrower's level of risk. However, the semantic information also reflect the ability and willingness of borrowers to repay and might be able to explain borrowers’ credit statuses. This paper examines whether semantic loan description text information helps predict the credit risk of different types of borrowers using a Chinese P2P platform. We use the 5P credit evaluation theory and the word embedding model to extract the semantic features of loan descriptions across five dimensions. Then, the AdaBoost ensemble learning strategy is applied to construct a credit evaluation model to improve the learning performance of an intelligent algorithm. The extracted semantic features are integrated into the evaluation model to study their explanatory ability with regard to the credit status of different types of borrowers. We conducted empirical research on the Renrendai P2P platform. Our conclusions show that the semantic features of textual soft information significantly improve the predictability of credit evaluation models and that the promotion effect is most significant for first-time borrowers. This paper has important practical significance for P2P platforms and the credit risk management of lenders. Furthermore, it has theoretical value for research concerning heterogeneous information-based credit risk analysis methods in big data environments.

[1]  Chih-Chou Chiu,et al.  Credit scoring using the hybrid neural discriminant technique , 2002, Expert Syst. Appl..

[2]  Tian-Shyug Lee,et al.  Mining the customer credit using classification and regression tree and multivariate adaptive regression splines , 2006, Comput. Stat. Data Anal..

[3]  Siva Viswanathan,et al.  Judging Borrowers by the Company They Keep: Friendship Networks and Information Asymmetry in Online Peer-to-Peer Lending , 2011, Manag. Sci..

[4]  P. Nath,et al.  A model of trust in online relationship banking , 2003 .

[5]  Joakim Nivre,et al.  Real-valued Syntactic Word Vectors (RSV) for Greedy Neural Dependency Parsing , 2017, NODALIDA.

[6]  R. Malhotra,et al.  Evaluating Consumer Loans using Neural Networks , 2003 .

[7]  Kyung Sup Kwak,et al.  Opinion mining based on fuzzy domain ontology and Support Vector Machine: A proposal to automate online review classification , 2016, Appl. Soft Comput..

[8]  Mingfeng Lin,et al.  Lemon or Cherry? the Value of Texts in Debt Crowdfunding Lemon or Cherry? the Value of Texts in Debt Crowdfunding , 2022 .

[9]  Martin L. Martens,et al.  DO THE STORIES THEY TELL GET THEM THE MONEY THEY NEED? THE ROLE OF ENTREPRENEURIAL NARRATIVES IN RESOURCE ACQUISITION , 2007 .

[10]  Rajkamal Iyer,et al.  Screening Peers Softly: Inferring the Quality of Small Borrowers , 2009, Manag. Sci..

[11]  Zhao Wang,et al.  Loan default prediction by combining soft information extracted from descriptive text in online peer-to-peer lending , 2018, Ann. Oper. Res..

[12]  Jan Sedivý,et al.  Deep Neural Networks for Web Page Information Extraction , 2016, AIAI.

[13]  Richard Tzong-Han Tsai,et al.  A Telecom-Domain Online Customer Service Assistant Based on Question Answering with Word Embedding and Intent Classification , 2017, IJCNLP.

[14]  Hussein A. Abdou An evaluation of alternative scoring models in private banking , 2009 .

[15]  Johan A. K. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring , 2003, J. Oper. Res. Soc..

[16]  Xiaohui Yan,et al.  A biterm topic model for short texts , 2013, WWW.

[17]  Robert A. Eisenbeis,et al.  Problems in applying discriminant analysis in credit scoring models , 1978 .

[18]  Nitin Bhatt,et al.  Determinants of Repayment in Microcredit: Evidence from Programs in the United States , 2002 .

[19]  Miguel Costa,et al.  A data mining based system for credit-card fraud detection in e-tail , 2017, Decis. Support Syst..

[20]  R. BliegeBird,et al.  Signaling Theory, Strategic Interaction, and Symbolic Capital1 , 2005, Current Anthropology.

[21]  B. Funk,et al.  Online Peer-to-Peer Lending - A Literature Review , 2011 .

[22]  Chris Hendry,et al.  Ordering Top Pay: Interpreting the Signals* , 2005 .

[23]  Difang Wan,et al.  Group social capital and lending outcomes in the financial credit market: An empirical study of online peer-to-peer lending , 2016, Electron. Commer. Res. Appl..

[24]  Walter Aerts,et al.  Does media reputation affect properties of accounts payable , 2015 .

[25]  Vijay S. Desai,et al.  A comparison of neural networks and linear scoring models in the credit union environment , 1996 .

[26]  Jeremy Michels Do Unverifiable Disclosures Matter? Evidence from Peer-to-Peer Lending , 2012 .

[27]  Sung-Hyon Myaeng,et al.  Toward advice mining: conditional random fields for extracting advice-revealing text units , 2013, CIKM.

[28]  J. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research , 2015, Eur. J. Oper. Res..

[29]  J. Wiginton A Note on the Comparison of Logit and Discriminant Models of Consumer Credit Behavior , 1980, Journal of Financial and Quantitative Analysis.

[30]  M. Spence Job Market Signaling , 1973 .

[31]  Junping Du,et al.  A semantic modeling method for social network short text based on spatial and temporal characteristics , 2017, J. Comput. Sci..

[32]  Yibo Wang,et al.  Leveraging deep learning with LDA-based text analytics to detect automobile insurance fraud , 2018, Decis. Support Syst..

[33]  A. Ghose,et al.  Cultural Differences and Geography as Determinants of Online Prosocial Lending , 2014, MIS Q..

[34]  Hussein A. Abdou,et al.  Credit Scoring, Statistical Techniques and Evaluation Criteria: A Review of the Literature , 2011, Intell. Syst. Account. Finance Manag..

[35]  D. Durand Risk elements in consumer instalment financing , 1942 .

[36]  Gregor Dorfleitner,et al.  Description-text related soft information in peer-to-peer lending – Evidence from two leading European platforms , 2016 .

[37]  J. Cramer,et al.  Scoring Bank Loans that may go wrong – A Case Study , 2000 .

[38]  Juan Feng,et al.  Borrower’s default and self-disclosure of social media information in P2P lending , 2016 .

[39]  Hui Li,et al.  Listed companies' financial distress prediction based on weighted majority voting combination of multiple classifiers , 2008, Expert Syst. Appl..

[40]  Xi Lin,et al.  Judging online peer-to-peer lending behavior: A comparison of first-time and repeated borrowing requests , 2016, Inf. Manag..

[41]  Dean S. Karlan,et al.  Social Connections and Group Banking , 2005 .

[42]  Huimin Zhao,et al.  A prediction-driven mixture cure model and its application in credit scoring , 2019, Eur. J. Oper. Res..

[43]  Thomas L. Albright,et al.  An investigation of the effect of Balanced Scorecard implementation on financial performance , 2004 .

[44]  Stephen S. Standifird Reputation and e-commerce: eBay auctions and the asymmetrical impact of positive and negative ratings , 2001 .

[45]  Steven Finlay,et al.  Multiple classifier architectures and their application to credit risk assessment , 2011, Eur. J. Oper. Res..

[46]  ZhouLina,et al.  Group social capital and lending outcomes in the financial credit market , 2016 .

[47]  FuXin,et al.  Judging online peer-to-peer lending behavior , 2016 .

[48]  David West,et al.  Neural network credit scoring models , 2000, Comput. Oper. Res..

[49]  Alok Gupta,et al.  Putting Money Where the Mouths Are: The Relation Between Venture Financing and Electronic Word-of-Mouth , 2012, Inf. Syst. Res..

[50]  Petr Hájek,et al.  Municipal credit rating modelling by neural networks , 2011, Decis. Support Syst..

[51]  Chi Fai Cheung,et al.  Mining knowledge from natural language texts using fuzzy associated concept mapping , 2008, Inf. Process. Manag..