Benchmarking Datasets for Breast Cancer Computer-Aided Diagnosis (CADx)

Designing reliable computer-aided diagnosis CADx systems based on data extracted from breast images and patient data to provide a second opinion to radiologists is still a challenging and yet unsolved problem. This paper proposes two benchmarking datasets one of them representative of low resolution digitized Film Mammography images and the other one representative of high resolution Full Field Digital Mammography images aimed to 1 modeling and exploring machine learning classifiers MLC; 2 evaluating the impact of mammography image resolution on MLC; and 3 comparing the performance of breast cancer CADx methods. Also, we include a comparative study of four groups of image-based descriptors intensity, texture, multi-scale texture and spatial distribution of the gradient, and combine them with patient's clinical data to classify masses. Finally, we demonstrate that this combination of clinical data and image descriptors is advantageous in most CADx scenarios.

[1]  Miguel Ángel Guevara-López,et al.  A Software Framework for Building Biomedical Machine Learning Classifiers through Grid Computing Resources , 2012, Journal of Medical Systems.

[2]  Miguel Ángel Guevara-López,et al.  An evaluation of image descriptors combined with clinical data for breast cancer diagnosis , 2013, International Journal of Computer Assisted Radiology and Surgery.

[3]  Evangelos Dermatas,et al.  Fast detection of masses in computer-aided mammography , 2000, IEEE Signal Process. Mag..

[4]  Joseph Y. Lo,et al.  Self-organizing map for cluster analysis of a breast cancer database , 2003, Artif. Intell. Medicine.

[5]  Arnaldo de Albuquerque Araújo,et al.  MammoSVD: A content-based image retrieval system using a reference database of mammographies , 2009, 2009 22nd IEEE International Symposium on Computer-Based Medical Systems.

[6]  Konstantina S. Nikita,et al.  A web-accessible mammographic image database dedicated to combined training and evaluation of radiologists and machines , 2009, 2009 9th International Conference on Information Technology and Applications in Biomedicine.

[7]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[8]  Arnaldo de Albuquerque Araújo,et al.  Toward a standard reference database for computer-aided mammography , 2008, SPIE Medical Imaging.

[9]  Jaime S. Cardoso,et al.  INbreast: toward a full-field digital mammographic database. , 2012, Academic radiology.

[10]  Miguel Ángel Guevara-López,et al.  Discovering Mammography-based Machine Learning Classifiers for Breast Cancer Diagnosis , 2012, Journal of Medical Systems.

[11]  Homero Schiabel,et al.  Online Mammographic Images Database for Development and Comparison of CAD Schemes , 2011, Journal of Digital Imaging.

[12]  Joel Quintanilla-Domínguez,et al.  WBCD breast cancer database classification applying artificial metaplasticity neural network , 2011, Expert Syst. Appl..

[13]  Richard H. Moore,et al.  Current Status of the Digital Database for Screening Mammography , 1998, Digital Mammography / IWDM.