Few-shot decision tree for diagnosis of ultrasound breast tumor using BI-RADS features

This paper proposes an ultrasound breast tumor CAD system based on BI-RADS features scoring and decision tree algorithm. Because of the difficulty of biopsy label collection, the proposed system adopts a few-shot learning method. The SVM classifier is employed to preliminarily mark the unlabeled cases firstly. Then these unlabeled cases with the pseudo labels are combined with the few real-labeled cases to train the decision tree. To test the performance of the proposed method, 1208 ultrasound breast images were collected, and three well-experienced clinicians and three interns evaluated these images according to the BI-RADS scoring scheme. All of the images are transformed into vectors such that the algorithm can process. The experimental results show that the system performance improves significantly with the help of pseudo-labeled data. Compared to the decision tree trained by the real-labeled cases only, when the number of real-labeled cases was 40, the accuracy, specificity, sensitivity of the proposed system were increased by 2.05%, 2.47% and 1.81%, respectively; the positive predictive value (PPV) and the negative predictive value (NVP) were increased by 1.29% and 3.05%, respectively. Meanwhile, the performance of the proposed method was the same as the method using sufficient samples. When the number of the labeled cases reached 100, the accuracy, specificity, sensitivity, PPV and NVP of the proposed method were 90.03%, 87.02%, 91.68%, 93.07%, and 85.03%, respectively. The results demonstrate that our method can efficiently distinguish the breast tumor although the labeled data is not sufficient.

[1]  A. Jemal,et al.  Breast Cancer Statistics , 2013 .

[2]  Xiao Liu,et al.  An iterated Laplacian based semi-supervised dimensionality reduction for classification of breast cancer on ultrasound images , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[3]  Juan Shan,et al.  Computer-Aided Diagnosis for Breast Ultrasound Using Computerized BI-RADS Features and Machine Learning Methods. , 2016, Ultrasound in medicine & biology.

[4]  Jie Zhu,et al.  Shearlet-based texture feature extraction for classification of breast tumor in ultrasound image , 2013, Biomed. Signal Process. Control..

[5]  Ling Shao,et al.  Efficient dictionary learning for visual categorization , 2014, Comput. Vis. Image Underst..

[6]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[7]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[8]  Xiao Liu,et al.  Stacked deep polynomial network based representation learning for tumor classification with small ultrasound image dataset , 2016, Neurocomputing.

[9]  Ling Shao,et al.  Learning to Hash With Optimized Anchor Embedding for Scalable Retrieval , 2017, IEEE Transactions on Image Processing.

[10]  Xuelong Li,et al.  Automatic segmentation of breast lesions for interaction in ultrasonic computer-aided diagnosis , 2015, Inf. Sci..

[11]  Xuelong Li,et al.  Semi-supervised classification via both label and side information , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Wei Wang,et al.  A new fuzzy Gaussian mixture model (FGMM) based algorithm for mammography tumor image classification , 2017, Multimedia Tools and Applications.

[13]  Ling Shao,et al.  Learning Computational Models of Video Memorability from fMRI Brain Imaging , 2015, IEEE Transactions on Cybernetics.

[14]  Jianxin Li,et al.  Analysis and Modeling for Big Data in Cancer Research , 2017, BioMed research international.

[15]  Ruey-Feng Chang,et al.  Breast ultrasound computer-aided diagnosis using BI-RADS features. , 2007, Academic radiology.

[16]  L. Brieman,et al.  Classification and Regression Trees , 2020, Biostatistics with R.

[17]  Markus Hahn,et al.  ' s response to reviews Title : Early detection of breast cancer : Benefits and risks of supplemental breast ultrasound in asymptomatic women with mammographically dense breast tissue , 2009 .

[18]  Xuelong Li,et al.  A Novel Segmentation Approach Combining Region- and Edge-Based Information for Ultrasound Images , 2017, BioMed research international.

[19]  Qi Zhang,et al.  Deep learning based classification of breast tumors with shear-wave elastography. , 2016, Ultrasonics.

[20]  L. Liberman,et al.  Breast imaging reporting and data system (BI-RADS). , 2002, Radiologic clinics of North America.

[21]  Yanlei Gu,et al.  Customer behavior classification using surveillance camera for marketing , 2017, Multimedia Tools and Applications.

[22]  Ling Shao,et al.  Cosaliency Detection Based on Intrasaliency Prior Transfer and Deep Intersaliency Mining , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[23]  Yue Gao,et al.  Zero-Shot Learning With Transferred Samples , 2017, IEEE Transactions on Image Processing.

[24]  Fabrizio Luccio,et al.  A technique for graph embedding with constraints on node and arc correspondences , 1973, Inf. Sci..

[25]  Qinghua Huang,et al.  Breast ultrasound image segmentation: a survey , 2017, International Journal of Computer Assisted Radiology and Surgery.

[26]  Wei Wang,et al.  Fourier irregularity index: A new approach to measure tumor mass irregularity in breast mammogram images , 2013, Multimedia Tools and Applications.

[27]  Markus Hahn,et al.  Early detection of breast cancer: benefits and risks of supplemental breast ultrasound in asymptomatic women with mammographically dense breast tissue. A systematic review , 2009, BMC Cancer.

[28]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[29]  Yuanyuan Wang,et al.  Computer-Aided Classification of Breast Tumors Using the Affinity Propagation Clustering , 2010, 2010 4th International Conference on Bioinformatics and Biomedical Engineering.

[30]  C. D'Orsi Breast Imaging Reporting and Data System (BI-RADS) , 2018 .

[31]  Qinghua Huang,et al.  Systematic Evaluation on Speckle Suppression Methods in Examination of Ultrasound Breast Images , 2016 .

[32]  Kadayanallur Mahadevan Prabusankarlal,et al.  Assessment of combined textural and morphological features for diagnosis of breast masses in ultrasound , 2015, Human-centric Computing and Information Sciences.

[33]  Jie Yang,et al.  Posterior Distribution Learning (PDL): A novel supervised learning framework using unlabeled samples to improve classification performance , 2015, Neurocomputing.

[34]  J. Baker,et al.  Breast mass lesions: computer-aided diagnosis models with mammographic and sonographic descriptors. , 2007, Radiology.

[35]  Hyunjung Shin,et al.  Research and applications: Breast cancer survivability prediction using labeled, unlabeled, and pseudo-labeled patient data , 2013, J. Am. Medical Informatics Assoc..

[36]  Xuelong Li,et al.  A case-oriented web-based training system for breast cancer diagnosis , 2018, Comput. Methods Programs Biomed..

[37]  A. Jemal,et al.  Breast cancer statistics, 2013 , 2014, CA: a cancer journal for clinicians.

[38]  Xuelong Li,et al.  Semi-Supervised Multitask Learning for Scene Recognition , 2015, IEEE Transactions on Cybernetics.