Multiple-instance learning for breast cancer detection in mammograms

This paper describes an experimental computer-aided detection and diagnosis system for breast cancer, the most common form of cancer among women, using mammography. The system relies on the Multiple-Instance Learning (MIL) paradigm, which has proven useful for medical decision support in previous works from our team. In the proposed framework, the breasts are first partitioned adaptively into regions. Then, either textural features, or features derived from the detection of masses and microcalcifications, are extracted from each region. Finally, feature vectors extracted from each region are combined using an MIL algorithm (Citation k-NN or mi-Graph), in order to recognize “normal” mammography examinations or to categorize examinations as “normal”, “benign” or “cancer”. An accuracy of 91.1% (respectively 62.1%) was achieved for normality recognition (respectively three-class categorization) in a subset of 720 mammograms from the DDSM dataset. The paper also discusses future improvements, that will make the most of the MIL paradigm, in order to improve “benign” versus “cancer” discrimination in particular.

[1]  Jaume Amores,et al.  Multiple instance classification: Review, taxonomy and comparative study , 2013, Artif. Intell..

[2]  Zhi-Hua Zhou,et al.  Multi-instance learning by treating instances as non-I.I.D. samples , 2008, ICML '09.

[3]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[4]  R. Bharat Rao,et al.  Multiple-Instance Learning Improves CAD Detection of Masses in Digital Mammography , 2008, Digital Mammography / IWDM.

[5]  Jun Wang,et al.  Solving the Multiple-Instance Problem: A Lazy Learning Approach , 2000, ICML.

[6]  Alexander Zelinsky,et al.  Fast Radial Symmetry for Detecting Points of Interest , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[8]  C. K. Chua,et al.  Computer-Aided Breast Cancer Detection Using Mammograms: A Review , 2013, IEEE Reviews in Biomedical Engineering.

[9]  Mary M. Galloway,et al.  Texture analysis using gray level run lengths , 1974 .

[10]  Richard H. Moore,et al.  THE DIGITAL DATABASE FOR SCREENING MAMMOGRAPHY , 2007 .

[11]  A. Ramli,et al.  Computer-aided detection/diagnosis of breast cancer in mammography and ultrasound: a review. , 2013, Clinical imaging.

[12]  Calvin R. Maurer,et al.  A Linear Time Algorithm for Computing Exact Euclidean Distance Transforms of Binary Images in Arbitrary Dimensions , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Gwénolé Quellec,et al.  Real-Time Task Recognition in Cataract Surgery Videos Using Adaptive Spatiotemporal Polynomials , 2015, IEEE Transactions on Medical Imaging.

[14]  Gwénolé Quellec,et al.  A multiple-instance learning framework for diabetic retinopathy screening , 2012, Medical Image Anal..

[15]  Byung-Woo Hong,et al.  Segmentation of Regions of Interest in Mammograms in a Topographic Approach , 2010, IEEE Transactions on Information Technology in Biomedicine.

[16]  Lei Zhang,et al.  Mammogram microcalcification cluster detection by locating key instances in a Multi-Instance Learning framework , 2012, 2012 IEEE International Conference on Signal Processing, Communication and Computing (ICSPCC 2012).

[17]  Melih Kandemir,et al.  Computer-aided diagnosis from weak supervision: A benchmarking study , 2015, Comput. Medical Imaging Graph..

[18]  Wei Liu,et al.  Multi-instance learning for mass retrieval in digitized mammograms , 2012, Medical Imaging.