FSCoMS: Feature Selection of Medical Images Based on Compactness Measure from Scatterplots

This paper proposes the method called Feature Selection based on the Compactness Measure from Scatterplots (FSCoMS) to select the best features extracted from medical images aiming at improving the effectiveness of Content-Based Image Retrieval. This feature selection algorithm consists in a compactness analysis of scatterplots to find the most relevant features providing high separability abilities. A high relevance value of a scatterplot means better predictability among of classes based on two features. We take advantage of this information to generate a ranking for features usefulness. We compared our method to two well-known feature selection methods using three real medical datasets. All of them were compared regarding the dimensionality of the final feature vector and the retrieval effectiveness measured by the precision and recall graphs. The performed experiments show that our method not only obtained the highest retrieval performance but also achieved the smallest number of demanded features (dimensionality) than the other methods analyzed.

[1]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[2]  Hermann Ney,et al.  The IRMA Project: A State of the Art Report on Content-Based Image Retrieval in Medical Applications , 2003 .

[3]  Chenn-Jung Huang,et al.  A comparative study of feature selection methods for probabilistic neural networks in cancer classification , 2003, Proceedings. 15th IEEE International Conference on Tools with Artificial Intelligence.

[4]  John P. Lewis,et al.  Eurographics/ Ieee-vgtc Symposium on Visualization 2009 Selecting Good Views of High-dimensional Data Using Class Consistency , 2022 .

[5]  Agma J. M. Traina,et al.  Mining Statistical Association Rules to Select the Most Relevant Medical Image Features , 2009, Mining Complex Data.

[6]  Tao Qin,et al.  Feature selection for ranking , 2007, SIGIR.

[7]  Sutanu Chakraborti,et al.  Information Gain Feature Selection for Ordinal Text Classification using Probability Re-distribution , 2007 .

[8]  Shan Li,et al.  Complex Zernike Moments Features for Shape-Based Image Retrieval , 2009, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[9]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[10]  Agma J. M. Traina,et al.  Statistical Association Rules and Relevance Feedback: Powerful Allies to Improve the Retrieval of Medical Images , 2006, 19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06).

[11]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Thomas Martin Deserno,et al.  Integration of CBIR in radiological routine in accordance with IHE , 2009, Medical Imaging.

[13]  Agma J. M. Traina,et al.  k-Gabor: A new feature extraction method for medical images providing internal analysis , 2012, 2012 25th IEEE International Symposium on Computer-Based Medical Systems (CBMS).

[14]  Marcus A. Magnor,et al.  Quality-Based Visualization Matrices , 2009, VMV.

[15]  Alceu Ferraz Costa,et al.  Fast fractal stack: fractal analysis of computed tomography scans of the lung , 2011, MMAR '11.

[16]  Marcus A. Magnor,et al.  Perception-based visual quality measures , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[17]  Antoine Geissbühler,et al.  A Review of Content{Based Image Retrieval Systems in Medical Applications { Clinical Bene(cid:12)ts and Future Directions , 2022 .

[18]  Cyrus Shahabi,et al.  Feature subset selection and feature ranking for multivariate time series , 2005, IEEE Transactions on Knowledge and Data Engineering.

[19]  Lluís A. Belanche Muñoz,et al.  Feature selection algorithms: a survey and experimental evaluation , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[20]  Li Guo,et al.  Survey and Taxonomy of Feature Selection Algorithms in Intrusion Detection System , 2006, Inscrypt.

[21]  Igor Jurisica,et al.  The FlowVizMenu and Parallel Scatterplot Matrix: Hybrid Multidimensional Visualizations for Network Exploration , 2010, IEEE Transactions on Visualization and Computer Graphics.

[22]  Hermann Ney,et al.  Features for Image Retrieval: A Quantitative Comparison , 2004, DAGM-Symposium.

[23]  Agma J. M. Traina,et al.  Improving the ranking quality of medical image retrieval using a genetic feature selection method , 2011, Decis. Support Syst..

[24]  Sun I. Kim,et al.  Application of irregular and unbalanced data to predict diabetic nephropathy using visualization and feature selection methods , 2008, Artif. Intell. Medicine.

[25]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Matthew O. Ward,et al.  XmdvTool: integrating multiple methods for visualizing multivariate data , 1994, Proceedings Visualization '94.

[27]  Aoying Zhou,et al.  An adaptive and dynamic dimensionality reduction method for high-dimensional indexing , 2007, The VLDB Journal.

[28]  Eizan Miyamoto,et al.  FAST CALCULATION OF HARALICK TEXTURE FEATURES , 2005 .

[29]  Huan Liu,et al.  Chi2: feature selection and discretization of numeric attributes , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.