Multi-objective breast cancer classification by using multi-expression programming

Despite many years of research, breast cancer detection is still a difficult, but very important problem to be solved. An automatic diagnosis system could establish whether a mammography presents tumours or belongs to a healthy patient and could offer, in this way, a second opinion to a radiologist that tries to establish a diagnosis. We therefore propose a system that could contribute to lowering both the costs and the work of an imaging diagnosis centre of breast cancer and in addition to increase the trust level in that diagnosis. We present a multi-objective evolutionary approach based on Multi-Expression Programming—a linear Genetic Programming method—that could classify a mammogram starting from a raw image of the breast. The processed images are represented through Histogram of Oriented Gradients and Kernel Descriptors since these image features have been reported as being very efficient in the image recognition scientific community and they have not been applied to mammograms before. Numerical experiments are performed on freely available datasets consisting of normal and abnormal film-based and digital mammograms and show the efficiency of the proposed decision support system.

[1]  Wolfgang Banzhaf,et al.  Genetic Programming for Pedestrians , 1993, ICGA.

[2]  Kalyanmoy Deb,et al.  Muiltiobjective Optimization Using Nondominated Sorting in Genetic Algorithms , 1994, Evolutionary Computation.

[3]  Peter Nordin,et al.  A compiling genetic programming system that directly manipulates the machine-code , 1994 .

[4]  Timothy Perkis,et al.  Stack-based genetic programming , 1994, Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence.

[5]  Stan Openshaw,et al.  Building New Spatial Interaction Models Using Genetic Programming , 1994 .

[6]  Ian W. Ricketts,et al.  The Mammographic Image Analysis Society digital mammogram database , 1994 .

[7]  Vidroha Debroy,et al.  Genetic Programming , 1998, Lecture Notes in Computer Science.

[8]  Bernhard Schölkopf,et al.  The Kernel Trick for Distances , 2000, NIPS.

[9]  Martin J. Oates,et al.  The Pareto Envelope-Based Selection Algorithm for Multi-objective Optimisation , 2000, PPSN.

[10]  Martijn C. J. Bot Improving Induction of Linear Classification Trees with Genetic Programming , 2000, GECCO.

[11]  Qiangfu Zhao,et al.  A Study on Efficient Generation of Decision Trees Using Genetic Programming , 2000, GECCO.

[12]  Dimitrios Kalles,et al.  Breeding Decision Trees Using Evolutionary Techniques , 2001, ICML.

[13]  Marco Laumanns,et al.  SPEA2: Improving the strength pareto evolutionary algorithm , 2001 .

[14]  S. E. Eklund,et al.  A massively parallel GP engine in VLSI , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[15]  Athanasios Tsakonas,et al.  Hierarchical classification trees using type-constrained genetic programming , 2002, Proceedings First International IEEE Symposium Intelligent Systems.

[16]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[17]  Mehryar Mohri,et al.  AUC Optimization vs. Error Rate Minimization , 2003, NIPS.

[18]  Mihai Oltean,et al.  Evolving Evolutionary Algorithms Using Multi Expression Programming , 2003, ECAL.

[19]  Xin Yao,et al.  Performance Scaling of Multi-objective Evolutionary Algorithms , 2003, EMO.

[20]  DaeEun Kim,et al.  Structural Risk Minimization on Decision Trees Using an Evolutionary Multiobjective Optimization , 2004, EuroGP.

[21]  Walter A. Kosters,et al.  Genetic Programming for data classification: partitioning the search space , 2004, SAC '04.

[22]  Xiaodong Li,et al.  Multi-objective techniques in genetic programming for evolving classifiers , 2005, 2005 IEEE Congress on Evolutionary Computation.

[23]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[24]  Ronald L. Crepeau,et al.  Genetic Evolution of Machine Language Software , 2006 .

[25]  Huimin Zhao,et al.  A multi-objective genetic programming approach to developing Pareto optimal decision trees , 2007, Decis. Support Syst..

[26]  Riccardo Poli,et al.  A Field Guide to Genetic Programming , 2008 .

[27]  Alex Alves Freitas,et al.  Lexicographic multi-objective evolutionary induction of decision trees , 2009, Int. J. Bio Inspired Comput..

[28]  Mark Johnston,et al.  Genetic programming for image classification with unbalanced data , 2009, 2009 24th International Conference Image and Vision Computing New Zealand.

[29]  Mark Johnston,et al.  Multi-Objective Genetic Programming for Classification with Unbalanced Data , 2009, Australasian Conference on Artificial Intelligence.

[30]  H. Nelson,et al.  Screening for Breast Cancer: An Update for the U.S. Preventive Services Task Force , 2009, Annals of Internal Medicine.

[31]  H. Nelson,et al.  Screening for Breast Cancer: Systematic Evidence Review Update for the U. S. Preventive Services Task Force , 2009 .

[32]  Francisco Herrera,et al.  A Survey on the Application of Genetic Programming to Classification , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[33]  Dieter Fox,et al.  Kernel Descriptors for Visual Recognition , 2010, NIPS.

[34]  L. Tabár,et al.  Swedish two-county trial: impact of mammographic screening on breast cancer mortality during 3 decades. , 2011, Radiology.

[35]  Miguel Ángel Guevara-López,et al.  Discovering Mammography-based Machine Learning Classifiers for Breast Cancer Diagnosis , 2012, Journal of Medical Systems.

[36]  Alex Alves Freitas,et al.  A Survey of Evolutionary Algorithms for Decision-Tree Induction , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[37]  Christian Gagné,et al.  Multi-objective evolutionary optimization for generating ensembles of classifiers in the ROC space , 2012, GECCO '12.

[38]  Mark Johnston,et al.  Developing New Fitness Functions in Genetic Programming for Classification With Unbalanced Data , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[39]  Jacques Ferlay,et al.  GLOBOCAN 2012 v1.0, Cancer Incidence and Mortality Worldwide: IARC Cancer Base No. 11 [Internet] , 2013 .

[40]  Mark Johnston,et al.  Evolving Diverse Ensembles Using Genetic Programming for Classification With Unbalanced Data , 2013, IEEE Transactions on Evolutionary Computation.

[41]  Miguel Ángel Guevara-López,et al.  An evaluation of image descriptors combined with clinical data for breast cancer diagnosis , 2013, International Journal of Computer Assisted Radiology and Surgery.

[42]  C. Mathers,et al.  GLOBOCAN 2012 v1.0, Cancer Incidence and Mortality Worldwide: IARC CancerBase No. 11 [Internet]. Lyon, France: International Agency for Research on Cancer , 2013 .

[43]  P. Gøtzsche,et al.  Screening for breast cancer with mammography. , 2013, The Cochrane database of systematic reviews.

[44]  Xin Yao,et al.  Multiobjective genetic programming for maximizing ROC performance , 2014, Neurocomputing.