Applying a machine learning model using a locally preserving projection based feature regeneration algorithm to predict breast cancer risk

Both conventional and deep machine learning has been used to develop decision-support tools applied in medical imaging informatics. In order to take advantages of both conventional and deep learning approach, this study aims to investigate feasibility of applying a locally preserving projection (LPP) based feature regeneration algorithm to build a new machine learning classifier model to predict short-term breast cancer risk. First, a computer-aided image processing scheme was used to segment and quantify breast fibro-glandular tissue volume. Next, initially computed 44 image features related to the bilateral mammographic tissue density asymmetry were extracted. Then, an LLP-based feature combination method was applied to regenerate a new operational feature vector using a maximal variance approach. Last, a k-nearest neighborhood (KNN) algorithm based machine learning classifier using the LPP-generated new feature vectors was developed to predict breast cancer risk. A testing dataset involving negative mammograms acquired from 500 women was used. Among them, 250 were positive and 250 remained negative in the next subsequent mammography screening. Applying to this dataset, LLP-generated feature vector reduced the number of features from 44 to 4. Using a leave-onecase-out validation method, area under ROC curve produced by the KNN classifier significantly increased from 0.62 to 0.68 (p < 0.05) and odds ratio was 4.60 with a 95% confidence interval of [3.16, 6.70]. Study demonstrated that this new LPP-based feature regeneration approach enabled to produce an optimal feature vector and yield improved performance in assisting to predict risk of women having breast cancer detected in the next subsequent mammography screening.

[1]  Bin Zheng,et al.  Optimization of breast mass classification using sequential forward floating selection (SFFS) and a support vector machine (SVM) model , 2014, International Journal of Computer Assisted Radiology and Surgery.

[2]  Berkman Sahiner,et al.  Association of computerized mammographic parenchymal pattern measure with breast cancer risk: a pilot case-control study. , 2011, Radiology.

[3]  Stephen W Duffy,et al.  Risk determination and prevention of breast cancer , 2014, Breast Cancer Research.

[4]  David Gur,et al.  Association between Computed Tissue Density Asymmetry in Bilateral Mammograms and Near‐term Breast Cancer Risk , 2014, The breast journal.

[5]  Bin Zheng,et al.  Computerized prediction of risk for developing breast cancer based on bilateral mammographic breast tissue asymmetry. , 2011, Medical engineering & physics.

[6]  B. Zheng,et al.  Assessment of a Four-View Mammographic Image Feature Based Fusion Model to Predict Near-Term Breast Cancer Risk , 2015, Annals of Biomedical Engineering.

[7]  Wei Zhang,et al.  Diffuse optical tomography for breast cancer imaging guided by computed tomography: A feasibility study. , 2017, Journal of X-ray science and technology.

[8]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[9]  David Gur,et al.  Association Between Changes in Mammographic Image Features and Risk for Near-Term Breast Cancer Development , 2016, IEEE Transactions on Medical Imaging.

[10]  Chris Chatwin,et al.  Conditioning Electrical Impedance Mammography System , 2018 .

[11]  Wei Qian,et al.  Fusion of Quantitative Image and Genomic Biomarkers to Improve Prognosis Assessment of Early Stage Lung Cancer Patients , 2016, IEEE Transactions on Biomedical Engineering.

[12]  R. Ning,et al.  Evaluation of differential phase contrast cone beam CT imaging system. , 2017, Journal of X-ray science and technology.

[13]  Leonard Berlin,et al.  More mammography muddle: emotions, politics, science, costs, and polarization. , 2010, Radiology.

[14]  Bin Zheng,et al.  Characterization of a high-energy in-line phase contrast tomosynthesis prototype. , 2015, Medical physics.

[15]  David Gur,et al.  Prediction of near-term breast cancer risk based on bilateral mammographic feature asymmetry. , 2013, Academic radiology.

[16]  Detection of posteriorly located breast tumors using gold nanoparticles: a breast-mimicking phantom study. , 2014, Journal of X-ray science and technology.

[17]  D. Kopans,et al.  Cumulative Probability of False-Positive Recall or Biopsy Recommendation After 10 Years of Screening Mammography: A Cohort Study , 2012 .

[18]  Shiju Yan,et al.  Applying a new bilateral mammographic density segmentation method to improve accuracy of breast cancer risk prediction , 2017, International Journal of Computer Assisted Radiology and Surgery.

[19]  Li Lan,et al.  Fractal analysis of mammographic parenchymal patterns in breast cancer risk assessment. , 2007, Academic radiology.

[20]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[21]  Bin Zheng,et al.  Applying Quantitative CT Image Feature Analysis to Predict Response of Ovarian Cancer Patients to Chemotherapy. , 2017, Academic radiology.

[22]  C. Streuli,et al.  Raised mammographic density: causative mechanisms and biological consequences , 2016, Breast Cancer Research.

[23]  Bin Zheng,et al.  Applying a new quantitative global breast MRI feature analysis scheme to assess tumor response to chemotherapy , 2016, Journal of magnetic resonance imaging : JMRI.

[24]  Yan Leng,et al.  Combining active learning and semi-supervised learning to construct SVM classifier , 2013, Knowl. Based Syst..

[25]  Shiju Yan,et al.  A new approach to develop computer-aided diagnosis scheme of breast mass classification using deep learning technology. , 2017, Journal of X-ray science and technology.

[26]  D. Miglioretti,et al.  Individual and Combined Effects of Age, Breast Density, and Hormone Replacement Therapy Use on the Accuracy of Screening Mammography , 2003, Annals of Internal Medicine.