Exploring use of images in clinical articles for decision support in evidence-based medicine

Essential information is often conveyed pictorially (images, illustrations, graphs, charts, etc.) in biomedical publications. A clinician's decision to access the full text when searching for evidence in support of clinical decision is frequently based solely on a short bibliographic reference. We seek to automatically augment these references with images from the article that may assist in finding evidence. In a previous study, the feasibility of automatically classifying images by usefulness (utility) in finding evidence was explored using supervised machine learning and achieved 84.3% accuracy using image captions for modality and 76.6% accuracy combining captions and image data for utility on 743 images from articles over 2 years from a clinical journal. Our results indicated that automatic augmentation of bibliographic references with relevant images was feasible. Other research in this area has determined improved user experience by showing images in addition to the short bibliographic reference. Multi-panel images used in our study had to be manually pre-processed for image analysis, however. Additionally, all image-text on figures was ignored. In this article, we report on developed methods for automatic multi-panel image segmentation using not only image features, but also clues from text analysis applied to figure captions. In initial experiments on 516 figure images we obtained 95.54% accuracy in correctly identifying and segmenting the sub-images. The errors were flagged as disagreements with automatic parsing of figure caption text allowing for supervised segmentation. For localizing text and symbols, on a randomly selected test set of 100 single panel images our methods reported, on the average, precision and recall of 78.42% and 89.38%, respectively, with an accuracy of 72.02%.

[1]  Anil K. Jain,et al.  Automatic Caption Localization in Compressed Video , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Horst Bunke,et al.  Identification of text on colored book and journal covers , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[3]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[4]  Anil K. Jain,et al.  Automatic text location in images and video frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[5]  Hagit Shatkay,et al.  Integrating image data into biomedical text categorization , 2006, ISMB.

[6]  David J. Crandall,et al.  Robust extraction of text in video , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[7]  Chuang Li,et al.  Automatic text location in natural scene images , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[8]  George R. Thoma,et al.  The Role of Title, Metadata and Abstract in Identifying Clinically Relevant Journal Articles , 2005, AMIA.

[9]  Jorge Herbert de Lira,et al.  Two-Dimensional Signal and Image Processing , 1989 .

[10]  Pyeoung-Kee Kim Automatic text location in complex color images using local color quantization , 1999, Proceedings of IEEE. IEEE Region 10 Conference. TENCON 99. 'Multimedia Technology for Asia-Pacific Information Infrastructure' (Cat. No.99CH37030).

[11]  Thomas Martin Deserno,et al.  IRMA - Content-Based Image Retrieval in Medical Applications , 2004, MedInfo.

[12]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[13]  Hao Wang,et al.  Automatic character location and segmentation in color scene images , 2001, Proceedings 11th International Conference on Image Analysis and Processing.

[14]  Dah-Jye Lee,et al.  Finding relevant PDF medical journal articles by the content of their figures , 2007, SPIE Medical Imaging.

[15]  D. Mitchell,et al.  Thromboelastographic study of the effect of manipulation of central veins on coagulability of venous blood. , 2005, The British journal of oral & maxillofacial surgery.

[16]  Martin Radespiel-Tröger,et al.  Anti-TGFbeta1 antibody for modulation of expression of endogenous transforming growth factor beta 1 to prevent fibrosis after plastic surgery in rats. , 2004, The British journal of oral & maxillofacial surgery.

[17]  Seong-Whan Lee,et al.  Text extraction in MPEG compressed video for content-based indexing , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[18]  N. Drage,et al.  Ultrasound-guided basket retrieval of salivary stones: a new technique. , 2005, The British journal of oral & maxillofacial surgery.

[19]  Anil K. Jain,et al.  Locating text in complex color images , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[20]  Sameer Antani,et al.  Exploring access to scientific literature using content-based image retrieval , 2007, SPIE Medical Imaging.

[21]  S. Capodiferro,et al.  Clinical management and microscopic characterisation of fatique-induced failure of a dental implant. Case report , 2006, Head & face medicine.

[22]  Daniel P. Lopresti,et al.  Extracting text from WWW images , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[23]  L. Rodney Long,et al.  Content-Based Image Retrieval for Large Biomedical Image Archives , 2004, MedInfo.

[24]  Shih-Fu Chang,et al.  Exploring Text and Image Features to Classify Images in Bioscience Literature , 2006, BioNLP@NAACL-HLT.

[25]  Farshad Fotouhi,et al.  Automatically Finding Images for Clinical Decision Support , 2007, Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007).

[26]  Anil K. Jain,et al.  Automatic caption localization in compressed video , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[27]  Jimmy J. Lin,et al.  Answering Clinical Questions with Knowledge-Based and Statistical Techniques , 2007, CL.

[28]  Marti A. Hearst,et al.  Exploring the Efficacy of Caption Search for Bioscience Journal Search Interfaces , 2007, BioNLP@ACL.