论文信息 - Annotating image ROIs with text descriptions for multimodal biomedical document retrieval

Annotating image ROIs with text descriptions for multimodal biomedical document retrieval

Regions of interest (ROIs) that are pointed to by overlaid markers (arrows, asterisks, etc.) in biomedical images are expected to contain more important and relevant information than other regions for biomedical article indexing and retrieval. We have developed several algorithms that localize and extract the ROIs by recognizing markers on images. Cropped ROIs then need to be annotated with contents describing them best. In most cases accurate textual descriptions of the ROIs can be found from figure captions, and these need to be combined with image ROIs for annotation. The annotated ROIs can then be used to, for example, train classifiers that separate ROIs into known categories (medical concepts), or to build visual ontologies, for indexing and retrieval of biomedical articles. We propose an algorithm that pairs visual and textual ROIs that are extracted from images and figure captions, respectively. This algorithm based on dynamic time warping (DTW) clusters recognized pointers into groups, each of which contains pointers with identical visual properties (shape, size, color, etc.). Then a rule-based matching algorithm finds the best matching group for each textual ROI mention. Our method yields a precision and recall of 96% and 79%, respectively, when ground truth textual ROI data is used.

Daekeun You | George R. Thoma | Dina Demner-Fushman | Sameer K. Antani | Matthew S. Simpson

[1] Daekeun You,et al. Towards the Creation of a Visual Ontology of Biomedical Imaging Entities , 2012, AMIA.

[2] Dina Demner-Fushman,et al. Biomedical Text Mining: A Survey of Recent Progress , 2012, Mining Text Data.

[3] Henning Müller,et al. Overview of the CLEF 2009 Medical Image Retrieval Track , 2009, CLEF.

[4] V A Partap,et al. The comet tail sign. , 1999, Radiology.

[5] Thomas Martin Deserno,et al. Ontology of Gaps in Content-Based Image Retrieval , 2009, Journal of Digital Imaging.

[6] Venu Govindaraju,et al. Biomedical article retrieval using multimodal features and image annotations in region-based CBIR , 2010, Electronic Imaging.

[7] Venu Govindaraju,et al. Automatic identification of ROI in figure images toward improving hybrid (text and image) biomedical document retrieval , 2011, Electronic Imaging.

[8] David G Milne,et al. Thin-section CT in obstructive pulmonary disease: discriminatory value. , 2002, Radiology.

[9] Daekeun You,et al. Figure content analysis for improved biomedical article retrieval , 2009, Electronic Imaging.

[10] M Ando,et al. Eosinophilic lung diseases: diagnostic accuracy of thin-section CT in 111 patients. , 2000, Radiology.

[11] Kyung Soo Lee,et al. T1 non-small cell lung cancer: imaging and histopathologic findings and their prognostic implications. , 2004, Radiographics : a review publication of the Radiological Society of North America, Inc.