Credit quality assessments using manifold based semi-supervised discriminant analysis and support vector machines

Due to the large scale of financial data in credit quality forecasting, dimensionality reduction is a key step to enhance classifier performance. By using manifold based semi-supervised discriminant analysis (SSDA) and support vector machines, this study develops a novel prediction system for credit quality assessment, where SSDA makes efficient use of labeled and unlabeled (testing) data points to gain a perfect low dimensional approximation of data manifold and simultaneously maintain the discriminating power. More specifically, the labeled data points are used to maximize the separability between different classes, and the testing data points are used to estimate the intrinsic geometric structure of the data space. Empirical results indicate that SSDA outperforms other dimensionality reduction methods with a significant performance improvement, and our hybrid classifier substantially outperforms other conventional classifiers.

[1]  Jiawei Han,et al.  Semi-supervised Discriminant Analysis , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[2]  Jen-Ying Shih,et al.  A study of Taiwan's issuer credit rating systems using support vector machines , 2006, Expert Syst. Appl..

[3]  Erkki Oja,et al.  Independent Component Analysis , 2001 .

[4]  Hussein A. Abdou,et al.  Neural nets versus conventional techniques in credit scoring in Egyptian banking , 2008, Expert Syst. Appl..

[5]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[6]  Jason Weston,et al.  Support vector machines for multi-class pattern recognition , 1999, ESANN.

[7]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[8]  A. Atiya,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[9]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[10]  Li-Chiu Chi,et al.  Neural networks analysis in business failure prediction of Chinese importers: A between-countries approach , 2005, Expert Syst. Appl..

[11]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[12]  E. Oja,et al.  Independent Component Analysis , 2001 .

[13]  Mikhail Belkin,et al.  Beyond the point cloud: from transductive to semi-supervised learning , 2005, ICML.

[14]  Mikhail Belkin,et al.  Manifold Regularization : A Geometric Framework for Learning from Examples , 2004 .

[15]  Soushan Wu,et al.  Credit rating analysis with support vector machines and neural networks: a market comparative study , 2004, Decis. Support Syst..

[16]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[17]  J. Crook,et al.  Credit scoring using neural and evolutionary techniques , 2000 .

[18]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[19]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[20]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.