Hybrid decision tree based on inferred attribute

This paper proposes a novel hybrid approach based on Regression, Rough sets and a Classifier aiming at effective dimensionality reduction henceforth a classifier with less time complexity. It focuses on dimension enhancement by embedding Inferred Attribute into the decision table, named as Augmented Decision Table (ADT). An effective minimal set of attributes (reduct) are derived by employing Rough set theory on ADT. The projected ADT based on the reduct is used for building a classifier. The philosophy is demonstrated by considering popular functional forms for Inferred Attribute like Linear Regression and Quadratic Regression and by adopting CART as the classifier. It is observed that inclusion of Inferred Attribute reduces the size of the reduct significantly without compromising the accuracy. However improvement in classification time depends on the choice of the Inferred Attribute model form.

[1]  K. Hoshi,et al.  An analysis of thyroid function diagnosis using Bayesian-type and SOM-type neural networks. , 2005, Chemical & pharmaceutical bulletin.

[2]  Chih-Jen Lin,et al.  A Simple Decomposition Method for Support Vector Machines , 2002, Machine Learning.

[3]  K.S. Nikita,et al.  Classification of medical data with a robust multi-level combination scheme , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[4]  Kemal Polat,et al.  A cascade learning system for classification of diabetes disease: Generalized Discriminant Analysis and Least Square Support Vector Machine , 2008, Expert Syst. Appl..

[5]  Wang Rong,et al.  New Decision Tree Algorithm , 2009 .

[6]  Peter L. Hammer,et al.  Logical analysis of data—An overview: From combinatorial optimization to medical applications , 2006, Ann. Oper. Res..

[7]  Filippo Menczer,et al.  Feature selection in data mining , 2003 .

[8]  Dursun Delen,et al.  Predicting breast cancer survivability: a comparison of three data mining methods , 2005, Artif. Intell. Medicine.

[9]  Martin D. Buhmann,et al.  Radial Basis Functions , 2021, Encyclopedia of Mathematical Geosciences.

[10]  Joseph L. Fleiss,et al.  An Introduction to Applied Probability , 2004 .

[11]  Marcin S. Szczuka,et al.  RSES and RSESlib - A Collection of Tools for Rough Set Computations , 2000, Rough Sets and Current Trends in Computing.

[12]  Ron Kohavi,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998 .

[13]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[14]  Jihoon Yang,et al.  Feature Subset Selection Using a Genetic Algorithm , 1998, IEEE Intell. Syst..

[15]  Richard Nock,et al.  A hybrid filter/wrapper approach of feature selection using information theory , 2002, Pattern Recognit..

[16]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[17]  David A. Bell,et al.  A Formalism for Relevance and Its Application in Feature Subset Selection , 2000, Machine Learning.

[18]  Durga Toshniwal,et al.  Hybrid prediction model for Type-2 diabetic patients , 2010, Expert Syst. Appl..

[19]  Mu-Chen Chen,et al.  Prediction model building and feature selection with support vector machines in breast cancer diagnosis , 2008, Expert Syst. Appl..

[20]  Huan Liu,et al.  Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution , 2003, ICML.

[21]  Chih-Jen Lin,et al.  Combining SVMs with Various Feature Selection Strategies , 2006, Feature Extraction.

[22]  P. M. Mujumdar,et al.  Ranking based uncertainty quantification for a multifidelity design approach , 2007 .

[23]  T. Yıldırım,et al.  MEDICAL DIAGNOSIS ON PIMA INDIAN DIABETES USING GENERAL REGRESSION NEURAL NETWORKS , 2003 .

[24]  Martin D. Buhmann,et al.  Radial Basis Functions: Theory and Implementations: Preface , 2003 .

[25]  Nikola K. Kasabov,et al.  On-line pattern analysis by evolving self-organizing maps , 2003, Neurocomputing.

[26]  C. Chatfield Probability and statistics in engineering and management science , 1973 .

[27]  K. Saastamoinen,et al.  Medical Data Classification using Logical Similarity Based Measures , 2006, 2006 IEEE Conference on Cybernetics and Intelligent Systems.

[28]  Qiang Shen,et al.  Rough set-aided keyword reduction for text categorization , 2001, Appl. Artif. Intell..

[29]  Yongqing Zheng,et al.  A New Decision Tree Algorithm Based on Rough Set Theory , 2009, 2009 Asia-Pacific Conference on Information Processing.

[30]  Sonajharia Minz,et al.  Rough Set Based Decision Tree Model for Classification , 2003, DaWaK.

[31]  Pedro Larrañaga,et al.  Dimensionality Reduction in Unsupervised Learning of Conditional Gaussian Networks , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  K Sudhakar,et al.  Customized Regression Model for Improving Low Fidelity Analysis Tool , 2006 .