An iterated Laplacian based semi-supervised dimensionality reduction for classification of breast cancer on ultrasound images

The dimensionality reduction is an important step in ultrasound image based computer-aided diagnosis (CAD) for breast cancer. A newly proposed l2,1 regularized correntropy algorithm for robust feature selection (CRFS) has achieved good performance for noise corrupted data. Therefore, it has the potential to reduce the dimensions of ultrasound image features. However, in clinical practice, the collection of labeled instances is usually expensive and time costing, while it is relatively easy to acquire the unlabeled or undetermined instances. Therefore, the semi-supervised learning is very suitable for clinical CAD. The iterated Laplacian regularization (Iter-LR) is a new regularization method, which has been proved to outperform the traditional graph Laplacian regularization in semi-supervised classification and ranking. In this study, to augment the classification accuracy of the breast ultrasound CAD based on texture feature, we propose an Iter-LR-based semi-supervised CRFS (Iter-LR-CRFS) algorithm, and then apply it to reduce the feature dimensions of ultrasound images for breast CAD. We compared the Iter-LR-CRFS with LR-CRFS, original supervised CRFS, and principal component analysis. The experimental results indicate that the proposed Iter-LR-CRFS significantly outperforms all other algorithms.

[1]  Yang Yu,et al.  Breast Tissue Image Classification Based on Semi-supervised Locality Discriminant Projection with Kernels , 2011, Journal of Medical Systems.

[2]  A. Jemal,et al.  Cancer statistics, 2014 , 2014, CA: a cancer journal for clinicians.

[3]  Ling Zhang,et al.  Automated breast cancer detection and classification using ultrasound images: A survey , 2015, Pattern Recognit..

[4]  Tieniu Tan,et al.  Half-Quadratic-Based Iterative Minimization for Robust Sparse Representation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[6]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[7]  Jie Zhu,et al.  Shearlet-based texture feature extraction for classification of breast tumor in ultrasound image , 2013, Biomed. Signal Process. Control..

[8]  Xiaoming Liu,et al.  Mass classification in mammogram with semi-supervised relief based feature selection , 2014, International Conference on Graphic and Image Processing.

[9]  Hyunjung Shin,et al.  Research and applications: Breast cancer survivability prediction using labeled, unlabeled, and pseudo-labeled patient data , 2013, J. Am. Medical Informatics Assoc..

[10]  Tieniu Tan,et al.  l2, 1 Regularized correntropy for robust feature selection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  E. Conant,et al.  A Review of Breast Ultrasound , 2006, Journal of Mammary Gland Biology and Neoplasia.

[12]  Mikhail Belkin,et al.  Semi-supervised Learning by Higher Order Regularization , 2011, AISTATS.

[13]  Mikhail Belkin,et al.  An iterated graph laplacian approach for ranking on manifolds , 2011, KDD.

[14]  Nathan Srebro,et al.  Statistical Analysis of Semi-Supervised Learning: The Limit of Infinite Unlabelled Data , 2009, NIPS.

[15]  Hyunjung Shin,et al.  A Hybrid Cancer Prognosis System Based on Semi-Supervised Learning and Decision Trees , 2013, ICONIP.

[16]  Weifeng Liu,et al.  Correntropy: Properties and Applications in Non-Gaussian Signal Processing , 2007, IEEE Transactions on Signal Processing.