Improvement in automated diagnosis of soft tissues tumors using machine learning

Soft Tissue Tumors (STT) are a form of sarcoma found in tissues that connect, support, and surround body structures. Because of their shallow frequency in the body and their great diversity, they appear to be heterogeneous when observed through Magnetic Resonance Imaging (MRI). They are easily confused with other diseases such as fibroadenoma mammae, lymphadenopathy, and struma nodosa, and these diagnostic errors have a considerable detrimental effect on the medical treatment process of patients. Researchers have proposed several machine learning models to classify tumors, but none have adequately addressed this misdiagnosis problem. Also, similar studies that have proposed models for evaluation of such tumors mostly do not consider the heterogeneity and the size of the data. Therefore, we propose a machine learning-based approach which combines a new technique of preprocessing the data for features transformation, resampling techniques to eliminate the bias and the deviation of instability and performing classifier tests based on the Support Vector Machine (SVM) and Decision Tree (DT) algorithms. The tests carried out on dataset collected in Nur Hidayah Hospital of Yogyakarta in Indonesia show a great improvement compared to previous studies. These results confirm that machine learning methods could provide efficient and effective tools to reinforce the automatic decision-making processes of STT diagnostics.

[1]  B Julesz,et al.  Inability of Humans to Discriminate between Visual Textures That Agree in Second-Order Statistics—Revisited , 1973, Perception.

[2]  Vahid Mirjalili,et al.  Python machine learning : machine learning and deep learning with Python, scikit-learn, and TensorFlow , 2017 .

[3]  L. Claude,et al.  [PNET/Ewing tumours: current treatments and future perspectives]. , 2010, Bulletin du cancer.

[4]  Hassan Silkan,et al.  Optimizing the prediction of telemarketing target calls by a classification technique , 2018, 2018 6th International Conference on Wireless Networks and Mobile Communications (WINCOM).

[5]  Zuherman Rustam,et al.  Comparison between Fuzzy Kernel C-Means and Sparse Learning Fuzzy C-Means for Breast Cancer Clustering , 2018, 2018 International Conference on Applied Information Technology and Innovation (ICAITI).

[6]  John M. Boone,et al.  A breast density index for digital mammograms based on radiologists’ randing , 1998, Journal of Digital Imaging.

[7]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[8]  R. Rifkin,et al.  Infinite-σ Limits For Tikhonov Regularization , 2006 .

[9]  H. Aburatani,et al.  Biological characterization of soft tissue sarcomas. , 2015, Annals of translational medicine.

[10]  Dar-Ren Chen,et al.  Diagnosis of breast tumors with ultrasonic texture analysis using support vector machines , 2006, Neural Computing & Applications.

[11]  Usman Qamar,et al.  An Efficient Rule-Based Classification of Diabetes Using ID3, C4.5, & CART Ensembles , 2014, 2014 12th International Conference on Frontiers of Information Technology.

[12]  Robert J. Gillies,et al.  Prediction of treatment outcome in soft tissue sarcoma based on radiologically defined habitats , 2015, Medical Imaging.

[13]  C. Fletcher The evolving classification of soft tissue tumours – an update based on the new 2013 WHO classification , 2014, Histopathology.

[14]  Neeraj Kumar,et al.  Decision Tree and SVM-Based Data Analytics for Theft Detection in Smart Grid , 2016, IEEE Transactions on Industrial Informatics.

[15]  C. Fletcher The evolving classification of soft tissue tumours: an update based on the new WHO classification , 2006, Histopathology.

[16]  J. Bloem,et al.  Soft Tissue Tumors: Grading, Staging, and Tissue-Specific Diagnosis , 2007, Topics in magnetic resonance imaging : TMRI.

[17]  Sophia Daskalaki,et al.  Imbalanced customer classification for bank direct marketing , 2017, Journal of Marketing Analytics.

[18]  F. Cendes,et al.  Texture analysis of medical images. , 2004, Clinical radiology.

[19]  Zuherman Rustam,et al.  Soft Tissue Tumor Classification using Stochastic Support Vector Machine , 2019, IOP Conference Series: Materials Science and Engineering.

[20]  Mirko Francesconi,et al.  Overcoming resistance to conventional drugs in Ewing sarcoma and identification of molecular predictors of outcome. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[21]  S. Young,et al.  Internationalization and competitive catch-up processes: case study evidence on Chinese multinational enterprises , 1996 .

[22]  P. Parizel,et al.  Classification of Soft Tissue Tumors by Machine Learning Algorithms , 2011 .

[23]  Steven L. Salzberg,et al.  Book Review: C4.5: Programs for Machine Learning by J. Ross Quinlan. Morgan Kaufmann Publishers, Inc., 1993 , 1994, Machine Learning.

[24]  Jan Sijbers,et al.  Machine learning study of several classifiers trained with texture analysis features to differentiate benign from malignant soft‐tissue tumors in T1‐MRI images , 2010, Journal of magnetic resonance imaging : JMRI.

[25]  Bijaya K. Panigrahi,et al.  Prediction Interval Estimation of Electricity Prices Using PSO-Tuned Support Vector Machines , 2015, IEEE Transactions on Industrial Informatics.

[26]  J. Coindre,et al.  Sarcomes des tissus mous : données anatomopathologiques actuelles , 2006 .

[27]  Chih-Jen Lin,et al.  Training and Testing Low-degree Polynomial Data Mappings via Linear SVM , 2010, J. Mach. Learn. Res..

[28]  Chih-Jen Lin,et al.  Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel , 2003, Neural Computation.

[29]  Hassan Silkan,et al.  A data modeling approach for classification problems: application to bank telemarketing prediction , 2019, NISS19.

[30]  Daisuke Komura,et al.  Machine Learning Methods for Histopathological Image Analysis , 2017, Computational and structural biotechnology journal.

[31]  Lan Wang,et al.  Application of Improved Decision Tree Method based on Rough Set in Building Smart Medical Analysis CRM System , 2016 .

[32]  F. Mertens,et al.  World Health Organization Classification of Tumours. Pathology and Genetics of Tumours of Soft Tissue and Bone , 2002 .

[33]  V. Mascarenhas,et al.  Imaging techniques for the diagnosis of soft tissue tumors , 2015 .

[34]  J. Coindre,et al.  [Fourth edition of WHO classification tumours of soft tissue]. , 2015, Annales de pathologie.

[35]  Tariq Samad,et al.  Imputation of Missing Data in Industrial Databases , 1999, Applied Intelligence.

[36]  Joon Beom Seo,et al.  Performance testing of several classifiers for differentiating obstructive lung diseases based on texture analysis at high-resolution computerized tomography (HRCT) , 2009, Comput. Methods Programs Biomed..