Breast Cancer Risk Analysis Based on a Novel Segmentation Framework for Digital Mammograms

The radiographic appearance of breast tissue has been established as a strong risk factor for breast cancer. Here we present a complete machine learning framework for automatic estimation of mammographic density (MD) and robust feature extraction for breast cancer risk analysis. Our framework is able to simultaneously classify the breast region, fatty tissue, pectoral muscle, glandular tissue and nipple region. Integral to our method is the extraction of measures of breast density (as the fraction of the breast area occupied by glandular tissue) and mammographic pattern. A novel aspect of the segmentation framework is that a probability map associated with the label mask is provided, which indicates the level of confidence of each pixel being classified as the current label. The Pearson correlation coefficient between the estimated MD value and the ground truth is 0.8012 (p-value < 0.0001). We demonstrate the capability of our methods to discriminate between women with and without cancer by analyzing the contralateral mammograms of 50 women with unilateral breast cancer, and 50 controls. Using MD we obtained an area under the ROC curve (AUC) of 0.61; however our texture-based measure of mammographic pattern significantly outperforms the MD discrimination with an AUC of 0.70.

[1]  M. Nielsen,et al.  A novel and automatic mammographic texture resemblance marker is an independent risk factor for breast cancer. , 2011, Cancer epidemiology.

[2]  Jennifer A. Harvey,et al.  Comparing a New Volumetric Breast Density Method (VolparaTM) to Cumulus , 2010, Digital Mammography / IWDM.

[3]  N. Boyd,et al.  The quantitative analysis of mammographic densities. , 1994, Physics in medicine and biology.

[4]  V. McCormack,et al.  Breast Density and Parenchymal Patterns as Markers of Breast Cancer Risk: A Meta-analysis , 2006, Cancer Epidemiology Biomarkers & Prevention.

[5]  Zezhi Chen,et al.  Detecting and Classifying Linear Structures in Mammograms Using Random Forests , 2011, IPMI.

[6]  Susan M. Astley,et al.  A Novel Framework for Fat, Glandular Tissue, Pectoral Muscle and Nipple Segmentation in Full Field Digital Mammograms , 2014, Digital Mammography / IWDM.

[7]  N. Boyd,et al.  Breast tissue composition and susceptibility to breast cancer. , 2010, Journal of the National Cancer Institute.

[8]  J. Wolfe Breast patterns as an index of risk for developing breast cancer. , 1976, AJR. American journal of roentgenology.

[9]  B. Keller,et al.  Estimation of breast percent density in raw and processed full field digital mammography images via adaptive fuzzy c-means clustering and support vector machine segmentation. , 2012, Medical physics.

[10]  M. Yaffe,et al.  Validation of a method for measuring the volumetric breast density from digital mammograms , 2010, Physics in medicine and biology.

[11]  L. Tabár,et al.  The Tabár classification of mammographic parenchymal patterns. , 1997, European journal of radiology.