Image retrieval from scientific publications: Text and image content processing to separate multipanel figures

Images contained in scientific publications are widely considered useful for educational and research purposes, and their accurate indexing is critical for efficient and effective retrieval. Such image retrieval is complicated by the fact that figures in the scientific literature often combine multiple individual subfigures (panels). Multipanel figures are in fact the predominant pattern in certain types of scientific publications. The goal of this work is to automatically segment multipanel figures—a necessary step for automatic semantic indexing and in the development of image retrieval systems targeting the scientific literature. We have developed a method that uses the image content as well as the associated figure caption to: (1) automatically detect panel boundaries; (2) detect panel labels in the images and convert them to text; and (3) detect the labels and textual descriptions of each panel within the captions. Our approach combines the output of image‐content and text‐based processing steps to split the multipanel figures into individual subfigures and assign to each subfigure its corresponding section of the caption. The developed system achieved precision of 81% and recall of 73% on the task of automatic segmentation of multipanel figures.

[1]  Hong Yu,et al.  Towards Answering Biological Questions with Experimental Evidence: Automatically Identifying Text that Summarize Image Content in Full-Text Articles , 2006, AMIA.

[2]  Carey Phillips,et al.  The Zebrafish DVD Exchange Project: a bioinformatics initiative. , 2004, Methods in cell biology.

[3]  R. Joe Stanley,et al.  Automatic segmentation of subfigure image panels for multimodal biomedical document retrieval , 2011, Electronic Imaging.

[4]  Lawrence H. Staib,et al.  The image processing handbook, 2nd edition J. C. Russ , 1998, Journal of nuclear cardiology : official publication of the American Society of Nuclear Cardiology.

[5]  Venu Govindaraju,et al.  Biomedical article retrieval using multimodal features and image annotations in region-based CBIR , 2010, Electronic Imaging.

[6]  George R. Thoma,et al.  Design and Development of a Multimodal Biomedical Information Retrieval System , 2012, J. Comput. Sci. Eng..

[7]  H. A. Lingstone,et al.  The Delphi Method: Techniques and Applications , 1976 .

[8]  Stan Z. Li Markov Random Field Modeling in Image Analysis , 2009, Advances in Pattern Recognition.

[9]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Henning Müller,et al.  Overview of the CLEF 2009 Medical Image Retrieval Track , 2009, CLEF.

[11]  Rafael C. González,et al.  Digital image processing, 3rd Edition , 2008 .

[12]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[13]  Carol Tenopir,et al.  Finding and using journal-article components: Impacts of disaggregation on teaching and research practice , 2008, J. Assoc. Inf. Sci. Technol..

[14]  John C. Russ,et al.  The image processing handbook (3. ed.) , 1995 .

[15]  J. Crisp,et al.  The Delphi method? , 1997, Nursing research.

[16]  Dina Demner-Fushman,et al.  Evaluating the Importance of Image-related Text for Ad-hoc and Case-based Biomedical Article Retrieval. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[17]  Marti A. Hearst,et al.  Full Text and Figure Display Improves Bioscience Literature Search , 2010, PloS one.

[18]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.

[19]  C. Wagner-Mann,et al.  If a picture is worth a thousand words, what is a trauma computerized tomography panel worth? , 2007, American journal of surgery.

[20]  Michael Krauthammer,et al.  Yale Image Finder (YIF): a new search engine for retrieving biomedical images , 2008, Bioinform..

[21]  William W. Cohen,et al.  Understanding captions in biomedical publications , 2003, KDD '03.

[22]  Venu Govindaraju,et al.  Detecting Figure-Panel Labels in Medical Journal Articles Using MRF , 2011, 2011 International Conference on Document Analysis and Recognition.

[23]  Dina Demner-Fushman,et al.  Biomedical Text Mining: A Survey of Recent Progress , 2012, Mining Text Data.

[24]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.