Improving the ranking quality of medical image retrieval using a genetic feature selection method

In this paper, we take advantage of single-valued functions that evaluate rankings to develop a family of feature selection methods based on the genetic algorithm approach, tailored to improve the accuracy of content-based image retrieval systems. Experiments on three image datasets, comprising images of breast and lung nodules, showed that developing functions to evaluate the ranking quality allows improving retrieval performance. This approach produces significantly better results than those of other fitness function approaches, such as the traditional wrapper and than filter feature selection algorithms.

[1]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[2]  Agma J. M. Traina,et al.  Mining Statistical Association Rules to Select the Most Relevant Medical Image Features , 2009, Mining Complex Data.

[3]  Oscar Cordón,et al.  A review on the application of evolutionary computation to information retrieval , 2003, Int. J. Approx. Reason..

[4]  Peter C. Fishburn,et al.  Nonlinear preference and utility theory , 1988 .

[5]  A. Kak,et al.  Automated storage and retrieval of thin-section CT images to assist diagnosis: system description and preliminary assessment. , 2003, Radiology.

[6]  Yingtao Jiang,et al.  Selecting critical clinical features for heart diseases diagnosis with a real-coded genetic algorithm , 2008, Appl. Soft Comput..

[7]  G. Cottrell,et al.  Optimizing Similarity Using Multi-Query Relevance Feedback , 1998, J. Am. Soc. Inf. Sci..

[8]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[9]  Yafei Zhang,et al.  Feature Selection Based on Genetic Algorithm for CBIR , 2008, 2008 Congress on Image and Signal Processing.

[10]  Jonathan Goldstein,et al.  When Is ''Nearest Neighbor'' Meaningful? , 1999, ICDT.

[11]  Randy L. Haupt,et al.  Practical Genetic Algorithms , 1998 .

[12]  Christos Faloutsos,et al.  On the 'Dimensionality Curse' and the 'Self-Similarity Blessing' , 2001, IEEE Trans. Knowl. Data Eng..

[13]  Yin-Fu Huang,et al.  Evolutionary-based feature selection approaches with new criteria for data mining: A case study of credit approval data , 2009, Expert Syst. Appl..

[14]  Mykola Pechenizkiy,et al.  Search strategies for ensemble feature selection in medical diagnostics , 2003, 16th IEEE Symposium Computer-Based Medical Systems, 2003. Proceedings..

[15]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[16]  Jianjiang Lu,et al.  Feature selection based-on genetic algorithm for image annotation , 2008, Knowl. Based Syst..

[17]  Hua Li,et al.  Dimensionality reduction for knowledge discovery in medical claims database: Application to antidepressant medication utilization study , 2009, Comput. Methods Programs Biomed..

[18]  Ian Witten,et al.  Data Mining , 2000 .

[19]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Mohand Boughanem,et al.  Multiple query evaluation based on an enhanced genetic algorithm , 2003, Inf. Process. Manag..

[21]  Edward A. Fox,et al.  A genetic programming framework for content-based image retrieval , 2009, Pattern Recognit..

[22]  Agma J. M. Traina,et al.  Using an image-extended relational database to support content-based image retrieval in a PACS , 2005, Comput. Methods Programs Biomed..

[23]  Borko Furht,et al.  Handbook of Video Databases: Design and Applications , 2003 .

[24]  Agma J. M. Traina,et al.  Ranking evaluation functions to improve genetic feature selection in content-based image retrieval of mammograms , 2009, 2009 22nd IEEE International Symposium on Computer-Based Medical Systems.

[25]  Rangaraj M. Rangayyan,et al.  Content-based Retrieval of Mammograms Using Visual Features Related to Breast Density Patterns , 2007, Journal of Digital Imaging.

[26]  Jorng-Tzong Horng,et al.  Applying genetic algorithms to query optimization in document retrieval , 2000, Inf. Process. Manag..

[27]  H K Huang,et al.  A picture archiving and communication system module for radiology. , 1989, Computer methods and programs in biomedicine.

[28]  Qingsheng Zhu,et al.  A GA-based query optimization method for web information retrieval , 2007, Appl. Math. Comput..

[29]  Weiguo Fan,et al.  Genetic-based approaches in ranking function discovery and optimization in information retrieval - A framework , 2009, Decis. Support Syst..

[30]  Ata Kabán,et al.  When is 'nearest neighbour' meaningful: A converse theorem and implications , 2009, J. Complex..

[31]  Richard A Lewis,et al.  Picture archiving and communication systems: a multicentre survey of users experience and satisfaction. , 2010, European journal of radiology.

[32]  Paulo Mazzoncini de Azevedo Marques,et al.  Towards applying content-based image retrieval in the clinical routine , 2007, Future Gener. Comput. Syst..

[33]  Antanas Verikas,et al.  A feature selection technique for generation of classification committees and its application to categorization of laryngeal images , 2009, Pattern Recognit..

[34]  Daniel A. Keim,et al.  Similarity search in multimedia databases , 2004, Proceedings. 20th International Conference on Data Engineering.

[35]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[36]  Harris Wu,et al.  The effects of fitness functions on genetic programming-based ranking discovery for Web search: Research Articles , 2004 .

[37]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[38]  Garrison W. Cottrell,et al.  Optimizing Similarity Using Multi-Query Relevance Feedback , 1998, J. Am. Soc. Inf. Sci..

[39]  Carla E. Brodley,et al.  Unsupervised Feature Selection Applied to Content-Based Retrieval of Lung Images , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Huan Liu,et al.  Discretization: An Enabling Technique , 2002, Data Mining and Knowledge Discovery.

[41]  R. Graham,et al.  DICOM demystified: a review of digital file formats and their use in radiological practice. , 2005, Clinical radiology.

[42]  Antoine Geissbühler,et al.  A Review of Content{Based Image Retrieval Systems in Medical Applications { Clinical Bene(cid:12)ts and Future Directions , 2022 .

[43]  Djamel A. Zighed,et al.  Mining Complex Data, ECML/PKDD 2007 Third International Workshop, MCD 2007, Warsaw, Poland, September 17-21, 2007, Revised Selected Papers , 2008, MCD.

[44]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[45]  Vicente P. Guerrero-Bote,et al.  Order-based Fitness Functions for Genetic Algorithms Applied to Relevance Feedback , 2003, J. Assoc. Inf. Sci. Technol..