论文信息 - On the classification techniques in data mining for microarray data classification

On the classification techniques in data mining for microarray data classification

Cancer is one of the deadly diseases, according to data from WHO by 2015 there are 8.8 million more deaths caused by cancer, and this will increase every year if not resolved earlier. Microarray data has become one of the most popular cancer-identification studies in the field of health, since microarray data can be used to look at levels of gene expression in certain cell samples that serve to analyze thousands of genes simultaneously. By using data mining technique, we can classify the sample of microarray data thus it can be identified with cancer or not. In this paper we will discuss some research using some data mining techniques using microarray data, such as Support Vector Machine (SVM), Artificial Neural Network (ANN), Naive Bayes, k-Nearest Neighbor (kNN), and C4.5, and simulation of Random Forest algorithm with technique of reduction dimension using Relief. The result of this paper show performance measure (accuracy) from classification algorithm (SVM, ANN, Naive Bayes, kNN, C4.5, and Random Forets).The results in this paper show the accuracy of Random Forest algorithm higher than other classification algorithms (Support Vector Machine (SVM), Artificial Neural Network (ANN), Naive Bayes, k-Nearest Neighbor (kNN), and C4.5). It is hoped that this paper can provide some information about the speed, accuracy, performance and computational cost generated from each Data Mining Classification Technique based on microarray data.

Adiwijaya | Husna Aydadenta | Husna Aydadenta

[1] Ramón Díaz-Uriarte,et al. Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[2] Adiwijaya,et al. CANCER DETECTION BASED ON MICROARRAY DATA CLASSIFICATION USING PCA AND MODIFIED BACK PROPAGATION , 2016 .

[3] Duncan Fyfe Gillies,et al. A Review of Feature Selection and Feature Extraction Methods Applied on Microarray Data , 2015, Adv. Bioinformatics.

[4] Mohd Saberi Mohamad,et al. Random forest for gene selection and microarray data classification , 2011, Bioinformation.

[5] Hua Wang,et al. A Comparative Study of Classification Methods For Microarray Data Analysis , 2006, AusDM.

[6] C. Devi Arockia Vanitha,et al. Gene Expression Data Classification Using Support Vector Machine and Mutual Information-based Gene Selection☆ , 2015 .

[7] C. Gunavathi,et al. Classification of Microarray Data Based OnFeature Selection Method , 2014 .

[8] Daniel T. Larose,et al. Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[9] Ch. Ravi Sekhar,et al. Multimodal Choice Modeling Using Random Forest Decision Trees , 2016 .