A semi-automatic indexing system for cell images

A method is described that can be used for annotating and indexing an arbitrary set of images with texts collateral to the images. The collateral texts comprise digitised texts, e.g. journal papers and newspapers in which the images appear, and digitised speech, e.g. a commentary on the contents of the images. The annotation dasiavectorpsila comprises image features and keywords in the collateral texts; our method can be used to generate both the image features and keywords automatically. Terminology extraction techniques are incorporated into the system to form a domain specific lexicon, which can then be used or help to annotate the images. Our method can be used as the basis of the autonomous learning of associations between images and their collateral descriptions, for example using Kohonen feature maps. We focus on images that show the migration and the division of cells within live systems. We show how the annotations can be collected by using the state-of-the-art speech recognition techniques that convert audio input into descriptive text on cell migration. A system based on the method has been developed and has reduced the annotation time to around two minutes per image, on a set of 429 cell images - which is significantly smaller than 5 minutes for manual annotation.

[1]  Li Cheng,et al.  Component optimization for image understanding: a Bayesian approach , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Roberto Basili,et al.  Inducing Terminology for Lexical Acquisition , 1997, EMNLP.

[3]  E. Krausz,et al.  Cell-based high-content screening of small-molecule libraries. , 2007, Current opinion in chemical biology.

[4]  Khurshid Ahmad,et al.  Artificial Ontologies and Real Thoughts: Populating the Semantic Web? , 2007, AI*IA.

[5]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Frank Smadja,et al.  Retrieving Collocations from Text: Xtract , 1993, CL.

[8]  Xiaobo Zhou,et al.  A computerized cellular imaging system for high content analysis in Monastrol suppressor screens , 2006, J. Biomed. Informatics.

[9]  John R. Smith,et al.  A web-based system for collaborative annotation of large image and video collections: an evaluation and user study , 2005, MULTIMEDIA '05.

[10]  Éric Gaussier,et al.  Towards Automatic Extraction of Monolingual and Bilingual Terminology , 1994, COLING.

[11]  Henry Lieberman,et al.  Aria: an agent for annotating and retrieving images , 2001, Computer.

[12]  Oliver Rausch,et al.  High content cellular screening. , 2006, Current opinion in chemical biology.

[13]  Khurshid Ahmad,et al.  A Cross-Modal System for Cell Migration Image Annotation and Retrieval , 2007, 2007 International Joint Conference on Neural Networks.

[14]  Khurshid Ahmad,et al.  Corpus Linguistics and Terminology Extraction , 2001 .

[15]  John Hallam,et al.  IEEE International Joint Conference on Neural Networks , 2005 .

[16]  Khurshid Ahmad,et al.  Corpus-Based Thesaurus Construction for Image Retrieval in Specialist Domains , 2003, ECIR.

[17]  SmadjaFrank Retrieving collocations from text , 1993 .

[18]  Latifur Khan,et al.  Standards for image annotation using Semantic Web , 2007, Comput. Stand. Interfaces.

[19]  Béatrice Daille,et al.  Study and Implementation of Combined Techniques for Automatic Extraction of Terminology , 1994 .

[20]  Jing Zhang,et al.  High-content screening moves to the front of the line. , 2006, Drug discovery today.

[21]  Gustavo Carneiro,et al.  Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[23]  Wei-Ying Ma,et al.  AnnoSearch: Image Auto-Annotation by Search , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).