A Hybrid Selection Method Based on HCELFS and SVM for the Diagnosis of Oral Cancer Staging

A diagnostic model based on Support Vector Machines (SVM) with a proposed hybrid feature selection method is developed to diagnose the stage of oral cancer in patients. The hybrid feature selection method, named Hybrid Correlation Evaluator and Linear Forward Selection (HCELFS), combines the advantages of filters and wrappers to select the optimal feature subset from the original feature set. In HCELFS, Correlation Attribute Evaluator acts as filters to remove redundant features and Linear Forward Selection with SVM acts as the wrappers to select the ideal feature subset from the remaining features. This study conducted experiments in WEKA with ten fold cross validation. The experimental results with oral cancer data sets demonstrate that our proposed model has a better performance than well-known feature selection algorithms.

[1]  A. Micheli,et al.  Experimental Comparison of Machine Learning Approaches to Medical Domains: a Case Study of Genotype Influence on Oral Cancer Development , 2005 .

[2]  Weixin Xie,et al.  A Novel Hybrid Feature Selection Method Based on IFSFFS and SVM for the Diagnosis of Erythemato-Squamous Diseases , 2010, WAPA.

[3]  Gaston L'Huillier,et al.  SVM-Based Feature Selection and Classification for Email Filtering , 2013 .

[4]  Toshiyuki Kawazu,et al.  Application of neural networks to the prediction of lymph node metastasis in oral cancer , 2006, Oral Radiology.

[5]  John G.F. Cleland,et al.  Feature Selection Approaches with Missing Values Handling for Data Mining - A Case Study of Heart Failure Dataset , 2011 .

[6]  Yorgos Goletsis,et al.  Multiparametric Decision Support System for the Prediction of Oral Cancer Reoccurrence , 2012, IEEE Transactions on Information Technology in Biomedicine.

[7]  P. Jaganathan,et al.  A Comparative Study of Improved F-Score with Support Vector Machine and RBF Network for Breast Cancer Classification , 2012 .

[8]  Carlos J. Perez,et al.  Computer-aided diagnosis system: A Bayesian hybrid classification method , 2013, Comput. Methods Programs Biomed..

[9]  Nyi Nyi Naing,et al.  Oral cancer presentation among Malay patients in hospital Universiti Sains Malaysia, Kelantan. , 2009, Asian Pacific journal of cancer prevention : APJCP.

[10]  W. Neville,et al.  Oral and Maxillofacial Pathology , 2004 .

[11]  Hiok Chai Quek,et al.  GenSo-FDSS: a neural-fuzzy decision support system for pediatric ALL cancer subtype identification using gene expression data , 2005, Artif. Intell. Medicine.

[12]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[13]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[14]  Sameem Abdul Kareem,et al.  Oral cancer prognosis based on clinicopathologic and genomic markers using a hybrid of feature selection and machine learning methods , 2013, BMC Bioinformatics.

[15]  Noor Maizura Mohamad Noor,et al.  Data preparation for pre-processing on oral cancer dataset , 2013, 2013 13th International Conference on Control, Automation and Systems (ICCAS 2013).

[16]  Noor Maizura Mohamad Noor,et al.  Demographic profile of oral cancer patients in east coast of peninsular Malaysia , 2013 .