Developing a comprehensive system for content-based retrieval of image and text data from a national survey

The article describes the status of our ongoing R&D at the U.S. National Library of Medicine (NLM) towards the development of an advanced multimedia database biomedical information system that supports content-based image retrieval (CBIR). NLM maintains a collection of 17,000 digitized spinal X-rays along with text survey data from the Second National Health and Nutritional Examination Survey (NHANES II). These data serve as a rich data source for epidemiologists and researchers of osteoarthritis and musculoskeletal diseases. It is currently possible to access these through text keyword queries using our Web-based Medical Information Retrieval System (WebMIRS). CBIR methods developed specifically for biomedical images could offer direct visual searching of these images by means of example image or user sketch. We are building a system which supports hybrid queries that have text and image-content components. R&D goals include developing algorithms for robust image segmentation for localizing and identifying relevant anatomy, labeling the segmented anatomy based on its pathology, developing suitable indexing and similarity matching methods for images and image features, and associating the survey text information for query and retrieval along with the image data. Some highlights of the system developed in MATLAB and Java are: use of a networked or local centralized database for text and image data; flexibility to incorporate new research work; provides a means to control access to system components under development; and use of XML for structured reporting. The article details the design, features, and algorithms in this third revision of this prototype system, CBIR3.

[1]  L. Rodney Long,et al.  Applying vertebral boundary semantics to CBIR of digitized spine x-ray images , 2005, IS&T/SPIE Electronic Imaging.

[2]  Ralph Roskies,et al.  Fourier Descriptors for Plane Closed Curves , 1972, IEEE Transactions on Computers.

[3]  Timothy F. Cootes,et al.  Statistical models of appearance for computer vision , 1999 .

[4]  Sunanda Mitra,et al.  Analysis of the feasibility of using active shape models for segmentation of gray-scale images , 2002, SPIE Medical Imaging.

[5]  Hemant D. Tagare Deformable 2-D template matching using orthogonal curves , 1997, IEEE Transactions on Medical Imaging.

[6]  Sameer Antani,et al.  Evaluating Partial Shape Queries for Pathology-based Retrieval of Vertebra , 2004 .

[7]  Longin Jan Latecki,et al.  Application of planar shape comparison to object retrieval in image databases , 2002, Pattern Recognit..

[8]  George R Thoma,et al.  Image informatics at a national research center. , 2005, Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society.

[9]  Xiaoqian Xu Curve Matching for Spine X-ray Image Retrieval using Dynamic Programming , 2004 .

[10]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[11]  L. Rodney Long,et al.  A Biomedical Information System for Combined Content-Based Retrieval of Spine X-Ray Images, Associated Text Information , 2002, ICVGIP.

[12]  L. Rodney Long,et al.  Content-Based Image Retrieval for Large Biomedical Image Archives , 2004, MedInfo.

[13]  William A. Barrett,et al.  Interactive Segmentation with Intelligent Scissors , 1998, Graph. Model. Image Process..

[14]  Sameer Antani,et al.  Biomedical information from a national collection of spine x-rays: film to content-based retrieval , 2003, SPIE Medical Imaging.