Comparison of the histogram of oriented gradient, GLCM, and shape feature extraction methods for breast cancer classification using SVM

Breast cancer originates from the ducts or lobules of the breast and is the second leading cause of death after cervical cancer. Therefore, early breast cancer screening is required, one of which is mammography. Mammography images can be automatically identified using Computer-Aided Diagnosis by leveraging machine learning classifications. This study analyzes the Support Vector Machine (SVM) in classifying breast cancer. It compares the performance of three features extraction methods used in SVM, namely Histogram of Oriented Gradient (HOG), GLCM, and shape feature extraction. The dataset consists of 320 mammogram image data from MIAS containing 203 normal images and 117 abnormal images. Each extraction method used three kernels, namely Linear, Gaussian, and Polynomial. The shape feature extraction-SVM using Linear kernel shows the best performance with an accuracy of 98.44 %, sensitivity of 100 %, and specificity of 97.50 %.

[1]  N. Muhammad,et al.  Cancers , 2018, Examining the Causal Relationship Between Genes, Epigenetics, and Human Health.

[2]  Hamidullah Binol,et al.  Recognition of the stomach cancer images with probabilistic HOG feature vector histograms by using HOG features , 2017, 2017 IEEE 15th International Symposium on Intelligent Systems and Informatics (SISY).

[3]  Luthfiana Ratnawati,et al.  Penerapan Random Forest untuk Mengukur Tingkat Keparahan Penyakit pada Daun Apel , 2020, Jurnal Sains dan Seni ITS.

[4]  Nisreen I. R. Yassin,et al.  Machine learning techniques for breast cancer computer aided diagnosis using different image modalities: A systematic review , 2018, Comput. Methods Programs Biomed..

[5]  Nurajijah Nurajijah,et al.  Algoritma Naïve Bayes, Decision Tree, dan SVM untuk Klasifikasi Persetujuan Pembiayaan Nasabah Koperasi Syariah , 2019, Jurnal Teknologi dan Sistem Komputer.

[6]  David R. Dance,et al.  Mammographic Image Analysis Society (MIAS) database v1.21 , 2015 .

[7]  A. Jemal,et al.  Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries , 2018, CA: a cancer journal for clinicians.

[8]  A. Jemal,et al.  Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries , 2021, CA: a cancer journal for clinicians.

[9]  Ashlesha Deverakonda,et al.  Diagnosis and Treatment of Cervical Cancer: A Review , 2016 .

[10]  Chandra Mohan Bhuma,et al.  The Analysis of Digital Mammograms Using HOG and GLCM Features , 2018, 2018 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT).

[11]  Ahmad Hanif Asyhar,et al.  Cervical Cancer Identification Based Texture Analysis Using GLCM-KELM on Colposcopy Data , 2020, 2020 International Conference on Artificial Intelligence in Information and Communication (ICAIIC).

[12]  Ahmad Hanif Asyhar,et al.  Whirlwind Classification with Imbalanced Upper Air Data Handling using SMOTE Algorithm and SVM Classifier , 2020, Journal of Physics: Conference Series.

[13]  Faisal Dharma Adhinata,et al.  People counter on CCTV video using histogram of oriented gradient and Kalman filter methods , 2020 .

[14]  Ahmad Hanif Asyhar,et al.  Automated Diagnosis System of Diabetic Retinopathy Using GLCM Method and SVM Classifier , 2018, 2018 5th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI).

[15]  Dian C. Rini Novitasari Klasifikasi Alzheimer dan Non Alzheimer Menggunakan Fuzzy C-Mean, Gray Level Co-Occurence Matrix dan Support Vector Machine , 2018 .

[16]  Dharun V.S,et al.  Extraction of Texture Features using GLCM and Shape Features using Connected Regions , 2016 .

[17]  Fitri Utaminingrum,et al.  Mammogram Breast Cancer Classification Using Gray-Level Co-Occurrence Matrix and Support Vector Machine , 2018, 2018 International Conference on Sustainable Information Engineering and Technology (SIET).

[18]  Shandong Wu,et al.  Breast Cancer Molecular Subtype Prediction by Mammographic Radiomic Features. , 2019, Academic radiology.

[19]  Derry Alamsyah,et al.  Pengenalan Mobil Pada Citra Digital Menggunakan HOG-SVM , 2017 .

[20]  Mohammed Y. Kamil,et al.  Texture Analysis of Mammogram Using Histogram of Oriented Gradients Method , 2020 .

[22]  A. Jemal,et al.  Cancer statistics, 2019 , 2019, CA: a cancer journal for clinicians.

[23]  Ahmad Hanif Asyhar,et al.  Application of Feature Extraction for Breast Cancer using One Order Statistic, GLCM, GLRLM, and GLDM , 2019, Advances in Science, Technology and Engineering Systems Journal.

[24]  Putroue Keumala Intan Comparison of Kernel Function on Support Vector Machine in Classification of Childbirth , 2019 .

[25]  Wen Shi,et al.  Risk Factors and Preventions of Breast Cancer , 2017, International journal of biological sciences.

[26]  S. Heywang-Köbrunner,et al.  Systematic review of 3D mammography for breast cancer screening. , 2016, Breast.

[27]  Makassari Dewi Sebaran Kanker di Indonesia, Riset Kesehatan Dasar 2007 , 2017, Indonesian Journal of Cancer.

[28]  Tzu-Tsung Wong,et al.  Dependency Analysis of Accuracy Estimates in k-Fold Cross Validation , 2017, IEEE Transactions on Knowledge and Data Engineering.

[29]  Ahmad Hanif Asyhar,et al.  Classification of Colposcopy Data Using GLCM-SVM on Cervical Cancer , 2020, 2020 International Conference on Artificial Intelligence in Information and Communication (ICAIIC).

[30]  Rahmat Robi Waliyansyah,et al.  Implementasi Metode Gray Level Co-occurrence Matrix dalam Identifikasi Jenis Daun Tengkawang , 2018 .

[31]  J. Brannen Mixing Methods: qualitative and quantitative research , 2017 .

[32]  Mark S. Nixon,et al.  Feature extraction & image processing for computer vision , 2012 .

[33]  B. Eswara Reddy,et al.  Detection and classification of normal and abnormal patterns in mammograms using deep neural network , 2019, Concurr. Comput. Pract. Exp..

[34]  S. Haryono,et al.  The predictive value of methylene blue dye as a single technique in breast cancer sentinel node biopsy: a study from Dharmais Cancer Hospital , 2017, World Journal of Surgical Oncology.

[35]  A. Alavudeen Basha,et al.  Classification of mammogram for early detection of breast cancer using SVM classifier and Hough transform , 2019, Measurement.

[36]  Abdolabbas Jafari,et al.  Evaluation of support vector machine and artificial neural networks in weed detection using shape features , 2018, Comput. Electron. Agric..

[37]  P. Prorok,et al.  Breast-Cancer Tumor Size, Overdiagnosis, and Mammography Screening Effectiveness. , 2016, The New England journal of medicine.

[38]  Agus Harjoko,et al.  Batik Classification with Artificial Neural Network Based on Texture-Shape Feature of Main Ornament , 2017 .

[39]  Feng Wu,et al.  Traffic sign recognition using HOG-SVM and grid search , 2014, 2014 12th International Conference on Signal Processing (ICSP).

[40]  Rung Ching Chen,et al.  Detection of COVID-19 chest X-ray using support vector machine and convolutional neural network , 2020 .

[41]  Ahmad Hanif Asyhar,et al.  Automatic Approach for Cervical Cancer Detection Based on Deep Belief Network (DBN) Using Colposcopy Data , 2020, 2020 International Conference on Artificial Intelligence in Information and Communication (ICAIIC).