Database Classification by Integrating a Case-Based Reasoning and Support Vector Machine for Induction

Database classification suffers from two common problems, i.e., the high dimensionality and nonstationary variations within the large historic data. This paper presents a hybrid classification model by integrating a case-based reasoning technique, a Support Vector Machine (SVM), and Genetic Algorithms to construct a decision-making system for data classification in various database applications. The model is mainly based on the concept that the historic database can be transformed into a smaller case-base together with a group of SVM models. As a result, the model can more accurately respond to the current data under classifying from the inductions by these SVM models generated from these smaller case bases. Hit rate is applied as a performance measure and the effectiveness of our proposed model is demonstrated by experimentally compared with other approaches on different database classification applications. The average hit rate of our proposed model is the highest among others.

[1]  Reshma Khemchandani,et al.  Knowledge based proximal support vector machines , 2009, Eur. J. Oper. Res..

[2]  Seyed Kamaledin Setarehdan,et al.  Support vector machine-based arrhythmia classification using reduced features of heart rate variability signal , 2008, Artif. Intell. Medicine.

[3]  Jeen-Shing Wang,et al.  Self-adaptive neuro-fuzzy inference systems for classification applications , 2002, IEEE Trans. Fuzzy Syst..

[4]  Kyoung-jae Kim,et al.  Financial time series forecasting using support vector machines , 2003, Neurocomputing.

[5]  Pei-Chann Chang,et al.  A hybrid model by clustering and evolving fuzzy rules for sales decision supports in printed circuit board industry , 2006, Decis. Support Syst..

[6]  Paris A. Mastorocostas,et al.  A constrained orthogonal least-squares method for generating TSK fuzzy models: Application to short-term load forecasting , 2001, Fuzzy Sets Syst..

[7]  James Nga-Kwok Liu,et al.  Smart shopper: an agent-based web-mining approach to Internet shopping , 2003, IEEE Trans. Fuzzy Syst..

[8]  Manuel Landajo,et al.  Forecasting business profitability by using classification techniques: A comparative analysis based on a Spanish case , 2005, Eur. J. Oper. Res..

[9]  Kin Keung Lai,et al.  Mining Stock Market Tendency Using GA-Based Support Vector Machines , 2005, WINE.

[10]  Pei-Chann Chang,et al.  A TSK type fuzzy rule based system for stock price prediction , 2008, Expert Syst. Appl..

[11]  Francis Eng Hock Tay,et al.  Modified support vector machines in financial time series forecasting , 2002, Neurocomputing.

[12]  Kun Chang Lee,et al.  An intelligent approach to time series identification by a neural network-driven decision tree classifier , 1996, Decis. Support Syst..

[13]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[14]  Emre Çomak,et al.  A new training method for support vector machines: Clustering k-NN support vector machines , 2008, Expert Syst. Appl..

[15]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[16]  S. Yoo,et al.  Support Vector Machine Based Arrhythmia Classification Using Reduced Features , 2005 .

[17]  Shokri Z. Selim,et al.  K-Means-Type Algorithms: A Generalized Convergence Theorem and Characterization of Local Optimality , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Roshdy S. Youssif,et al.  Combining genetic algorithms and neural networks to build a signal pattern classifier , 2004, Neurocomputing.

[19]  Xiaonan Li,et al.  Operations research and data mining , 2008, Eur. J. Oper. Res..

[20]  Salvatore Greco,et al.  Rough sets theory for multicriteria decision analysis , 2001, Eur. J. Oper. Res..

[21]  Chin-Teng Lin,et al.  An efficient quantum neuro-fuzzy classifier based on fuzzy entropy and compensatory operation , 2008, Soft Comput..

[22]  Erwie Zahara,et al.  A hybridized approach to data clustering , 2008, Expert Syst. Appl..

[23]  Ioannis B. Theocharis,et al.  A modified PNN algorithm with optimal PD modeling using the orthogonal least squares method , 2004, Inf. Sci..