A similarity learning approach to content-based image retrieval: application to digital mammography

In this paper, we describe an approach to content-based retrieval of medical images from a database, and provide a preliminary demonstration of our approach as applied to retrieval of digital mammograms. Content-based image retrieval (CBIR) refers to the retrieval of images from a database using information derived from the images themselves, rather than solely from accompanying text indices. In the medical-imaging context, the ultimate aim of CBIR is to provide radiologists with a diagnostic aid in the form of a display of relevant past cases, along with proven pathology and other suitable information. CBIR may also be useful as a training tool for medical students and residents. The goal of information retrieval is to recall from a database information that is relevant to the user's query. The most challenging aspect of CBIR is the definition of relevance (similarity), which is used to guide the retrieval machine. In this paper, we pursue a new approach, in which similarity is learned from training examples provided by human observers. Specifically, we explore the use of neural networks and support vector machines to predict the user's notion of similarity. Within this framework we propose using a hierarchal learning approach, which consists of a cascade of a binary classifier and a regression module to optimize retrieval effectiveness and efficiency. We also explore how to incorporate online human interaction to achieve relevance feedback in this learning framework. Our experiments are based on a database consisting of 76 mammograms, all of which contain clustered microcalcifications (MCs). Our goal is to retrieve mammogram images containing similar MC clusters to that in a query. The performance of the retrieval system is evaluated using precision-recall curves computed using a cross-validation procedure. Our experimental results demonstrate that: 1) the learning framework can accurately predict the perceptual similarity reported by human observers, thereby serving as a basis for CBIR; 2) the learning-based framework can significantly outperform a simple distance-based similarity metric; 3) the use of the hierarchical two-stage network can improve retrieval performance; and 4) relevance feedback can be effectively incorporated into this learning framework to achieve improvement in retrieval precision based on online interaction with users; and 5) the retrieved images by the network can have predicting value for the disease condition of the query.

[1]  S.T.C. Wong CBIR In Medicine: Still A Long Way To Go , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[2]  S. Omatu,et al.  Pattern recognition with neural networks , 2000, IGARSS 2000. IEEE 2000 International Geoscience and Remote Sensing Symposium. Taking the Pulse of the Planet: The Role of Remote Sensing in Managing the Environment. Proceedings (Cat. No.00CH37120).

[3]  Benjamin Van Roy,et al.  Solving Data Mining Problems Through Pattern Recognition , 1997 .

[4]  Donald F. Specht,et al.  A general regression neural network , 1991, IEEE Trans. Neural Networks.

[5]  P. Miller,et al.  Critiquing Anesthetic Management: The “ATTENDING” Computer System , 1983, Anesthesiology.

[6]  Brian D. Ripley,et al.  Pattern Recognition and Neural Networks , 1996 .

[7]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[8]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[9]  Federico Girosi,et al.  Training support vector machines: an application to face detection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  J Stout,et al.  Computer-aided what? , 1986, JEMS : a journal of emergency medical services.

[11]  Rangaraj M. Rangayyan,et al.  Application of shape analysis to mammographic calcifications , 1994, IEEE Trans. Medical Imaging.

[12]  Robin N. Strickland,et al.  Wavelet transforms for detecting microcalcifications in mammograms , 1996, IEEE Trans. Medical Imaging.

[13]  P. Miller,et al.  ICON: a computer-based approach to differential diagnosis in radiology. , 1987, Radiology.

[14]  Toshikazu Kato,et al.  A sketch retrieval method for full color image database-query by visual example , 1992, [1992] Proceedings. 11th IAPR International Conference on Pattern Recognition.

[15]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[16]  Gérard Subsol,et al.  Automatic MRI Database Exploration and Applications , 1997, Int. J. Pattern Recognit. Artif. Intell..

[17]  J. M. Bevan,et al.  Rank Correlation Methods , 1949 .

[18]  Alberto Del Bimbo,et al.  Visual information retrieval , 1999 .

[19]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[20]  Philip H. Ramsey Nonparametric Statistical Methods , 1974, Technometrics.

[21]  Jack Sklansky,et al.  A visual neural classifier , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[22]  Yanxi Liu,et al.  A classification based similarity metric for 3D image retrieval , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[23]  M. Giger,et al.  Malignant and benign clustered microcalcifications: automated feature analysis and classification. , 1996, Radiology.

[24]  R. M. Nishikawa,et al.  Computer-aided detection of clustered microcalcifications on digital mammograms , 1995, Medical and Biological Engineering and Computing.

[25]  Massimiliano Pontil,et al.  Support Vector Machines for 3D Object Recognition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  A. Mushlin,et al.  Estimating the accuracy of screening mammography: a meta-analysis. , 1998, American journal of preventive medicine.

[27]  E. Sickles Mammographic features of 300 consecutive nonpalpable breast cancers. , 1986, AJR. American journal of roentgenology.

[28]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[29]  Noboru Niki,et al.  Three-dimensional CT image retrieval in a database of pulmonary nodules , 2002, Proceedings. International Conference on Image Processing.

[30]  Nikolas P. Galatsanos,et al.  A support vector machine approach for detection of microcalcifications , 2002, IEEE Transactions on Medical Imaging.

[31]  William M. Campbell,et al.  Support vector machines for speaker verification and identification , 2000, Neural Networks for Signal Processing X. Proceedings of the 2000 IEEE Signal Processing Society Workshop (Cat. No.00TH8501).

[32]  R. Forthofer,et al.  Rank Correlation Methods , 1981 .

[33]  Patrick M. Kelly,et al.  CANDID: comparison algorithm for navigating digital image databases , 1994, Seventh International Working Conference on Scientific and Statistical Database Management.

[34]  Eric Y. Tao,et al.  Computer-aided, case-based diagnosis of mammographic regions of interest containing microcalcifications. , 2000, Academic radiology.

[35]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[36]  S.T.C. Wong CBIR in medicine: still a long way to go , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[37]  M. Giger,et al.  Improving breast cancer diagnosis with computer-aided diagnosis. , 1999, Academic radiology.

[38]  D. Kopans The positive predictive value of mammography. , 1992, AJR. American journal of roentgenology.

[39]  J. Elmore,et al.  Variability in radiologists' interpretations of mammograms. , 1994, The New England journal of medicine.

[40]  T. J. Breen,et al.  Biostatistical Analysis (2nd ed.). , 1986 .

[41]  Nikolas P. Galatsanos,et al.  Relevance feedback based on incremental learning for mammogram retrieval , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[42]  Nikolas P. Galatsanos,et al.  Image retrieval based on similarity learning , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).