Ranking evaluation functions to improve genetic feature selection in content-based image retrieval of mammograms

The ranking problem is a crucial task in the information retrieval systems. In this paper, we take advantage of single valued ranking evaluation functions in order to develop a new method of genetic feature selection tailored to improve the accuracy of content-based image retrieval systems. We propose to boost the feature selection ability of the genetic algorithms (GA) by employing an evaluation criteria (fitness function) that relies on order-based ranking evaluation functions. The evaluation criteria are provided by the GA and has been successfully employed as a measure to evaluate the efficacy of content-based image retrieval process, improving up to 22% the precision of the query answers. Experiments on three medical datasets containing breast cancer diagnosis and breast tissue density analysis showed that fitness functions based on ranking evaluation functions occupy an essential role on the algorithms' performance, obtaining results significatively better than other fitness function designs. The experiments also showed that the proposed method obtains results superior than feature selection based on the traditional decision-tree C4.5, naive bayes, support vector machine, 1-nearest neighbor and association rule mining.

[1]  Carla E. Brodley,et al.  Unsupervised Feature Selection Applied to Content-Based Retrieval of Lung Images , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Randy L. Haupt,et al.  Practical Genetic Algorithms , 1998 .

[3]  Oscar Cordón,et al.  A review on the application of evolutionary computation to information retrieval , 2003, Int. J. Approx. Reason..

[4]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[5]  Rangaraj M. Rangayyan,et al.  Content-based Retrieval of Mammograms Using Visual Features Related to Breast Density Patterns , 2007, Journal of Digital Imaging.

[6]  Yafei Zhang,et al.  Feature Selection Based on Genetic Algorithm for CBIR , 2008, 2008 Congress on Image and Signal Processing.

[7]  Christos Faloutsos,et al.  On the 'Dimensionality Curse' and the 'Self-Similarity Blessing' , 2001, IEEE Trans. Knowl. Data Eng..

[8]  Yin-Fu Huang,et al.  Evolutionary-based feature selection approaches with new criteria for data mining: A case study of credit approval data , 2009, Expert Syst. Appl..

[9]  G. Cottrell,et al.  Optimizing Similarity Using Multi-Query Relevance Feedback , 1998, J. Am. Soc. Inf. Sci..

[10]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[11]  Mykola Pechenizkiy,et al.  Sequential Genetic Search for Ensemble Feature Selection , 2005, IJCAI.

[12]  Yingtao Jiang,et al.  Selecting critical clinical features for heart diseases diagnosis with a real-coded genetic algorithm , 2008, Appl. Soft Comput..

[13]  Harris Wu,et al.  The effects of fitness functions on genetic programming-based ranking discovery for Web search: Research Articles , 2004 .

[14]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[15]  Mykola Pechenizkiy,et al.  Search strategies for ensemble feature selection in medical diagnostics , 2003, 16th IEEE Symposium Computer-Based Medical Systems, 2003. Proceedings..

[16]  Mohand Boughanem,et al.  Multiple query evaluation based on an enhanced genetic algorithm , 2003, Inf. Process. Manag..

[17]  Jorng-Tzong Horng,et al.  Applying genetic algorithms to query optimization in document retrieval , 2000, Inf. Process. Manag..

[18]  Edward A. Fox,et al.  A genetic programming framework for content-based image retrieval , 2009, Pattern Recognit..

[19]  Agma J. M. Traina,et al.  An Association Rule-Based Method to Support Medical Image Diagnosis With Efficiency , 2008, IEEE Transactions on Multimedia.

[20]  Garrison W. Cottrell,et al.  Optimizing Similarity Using Multi-Query Relevance Feedback , 1998, J. Am. Soc. Inf. Sci..

[21]  Rangaraj M. Rangayyan,et al.  Reducing the semantic gap in content-based image retrieval in mammography with relevance feedback and inclusion of expert knowledge , 2008, International Journal of Computer Assisted Radiology and Surgery.

[22]  Weiguo Fan,et al.  Genetic-based approaches in ranking function discovery and optimization in information retrieval - A framework , 2009, Decis. Support Syst..