An Integrated Model for Financial Data Mining

Nowadays, financial data analysis is becoming increasingly important in the business market. As companies collect more and more data from daily operations, they expect to extract useful knowledge from existing collected data to help make reasonable decisions for new customer requests, e.g. user credit category, churn analysis, real estate analysis, etc. Financial institutes have applied different data mining techniques to enhance their business performance. However, simple approach of these techniques could raise a performance issue. Besides, there are very few general models for both understanding and forecasting different financial fields. We present in this paper an integrated model for analyzing financial data. We also evaluate this model with different real-world data to show its performance.

[1]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[2]  Gunnar Rätsch,et al.  Advanced Lectures on Machine Learning , 2004, Lecture Notes in Computer Science.

[3]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[4]  Halima Bensmail,et al.  Analyzing Imputed Financial Data: A New Approach to Cluster Analysis , 2004 .

[5]  Bo K. Wong,et al.  Neural network applications in business: A review and analysis of the literature (1988-1995) , 1997, Decis. Support Syst..

[6]  Joachim Diederich,et al.  Survey and critique of techniques for extracting rules from trained artificial neural networks , 1995, Knowl. Based Syst..

[7]  Daniel Sánchez,et al.  ART: A Hybrid Classification Model , 2004, Machine Learning.

[8]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[9]  Martin T. Hagan,et al.  Neural network design , 1995 .

[10]  David Heckerman,et al.  Bayesian Networks for Knowledge Discovery , 1996, Advances in Knowledge Discovery and Data Mining.

[11]  Inderjit S. Dhillon,et al.  A Data-Clustering Algorithm on Distributed Memory Multiprocessors , 1999, Large-Scale Parallel Data Mining.

[12]  Ingoo Han,et al.  Hybrid genetic algorithms and support vector machines for bankruptcy prediction , 2006, Expert Syst. Appl..

[13]  Zikrija Avdagic,et al.  On-line evolving clustering for financial statements' anomalies detection , 2009, 2009 XXII International Symposium on Information, Communication and Automation Technologies.

[14]  Vijay K Chaudhari,et al.  Neural network learning improvement using K-means clustering algorithm to improve the performance of web traffic mining , 2011, 2011 3rd International Conference on Electronics Computer Technology.

[15]  J. Ross Quinlan Learning First-Order Definitions of Functions , 1996, J. Artif. Intell. Res..

[16]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[18]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[19]  Mohammed J. Zaki,et al.  Large-Scale Parallel Data Mining , 2002, Lecture Notes in Computer Science.

[20]  Rüdiger W. Brause,et al.  Neural data mining for credit card fraud detection , 1999, Proceedings 11th International Conference on Tools with Artificial Intelligence.

[21]  Ying Huang,et al.  A Rule-Based Method for Customer Churn Prediction in Telecommunication Services , 2011, PAKDD.

[22]  Andreas S. Weigend,et al.  Data Mining in Finance: Report from the Post-Nncm-96 Workshop on Teaching Computer Intensive Methods for Financial Modeling and Data Analysis , 1997 .

[23]  Ingoo Han,et al.  Hybrid neural network models for bankruptcy predictions , 1996, Decis. Support Syst..

[24]  Deborah R. Carvalho,et al.  A hybrid decision tree/genetic algorithm method for data mining , 2004, Inf. Sci..