Bridging the Semantic Gap for Satellite Image Annotation and Automatic Mapping Applications

This paper brings a solution for bridging the gap between the results of state-of-the-art automatic classification algorithms and high semantic human-defined manually created terminology of cartographic data. Using a recent pure-spectral rule-based fully automatic classifier to define the basic 'vocabulary', we provide a hybrid method to automatically understand and describe semantic rules that link existent mapping data according to different specifications with the end-results of unsupervised computer information mining methods. Following an agreement between the learning model and the cartographic scale implied, we exploit Latent Dirichlet Allocation model (LDA) to map heterogeneous pixels with similar intermediate-level semantic meaning into land cover classes of various mapping products. By discovering the set of rules that explain semantic classes in existent vector systems, we introduce the prototype of an interactive learning loop that uses the concept of direct semantics applied on satellite imagery. We solve a big problem in generating cartographic information layers from a fully automatic classification map and demonstrate it for the typical case of Landsat images.

[1]  Dragutin Petkovic,et al.  Content-based representation and retrieval of visual media: A state-of-the-art review , 1996, Multimedia Tools and Applications.

[2]  Mihai Datcu,et al.  Human-centered concepts for exploration and understanding of Earth observation images , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[3]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Zeno Geradts,et al.  Content based information retrieval in forensic image databases. , 2002, Journal of forensic sciences.

[5]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[6]  Mihai Datcu,et al.  Knowledge-driven Information-Mining in Remote Sensing Image Archives , 2002 .

[7]  Thomas Martin Deserno,et al.  Ontology of Gaps in Content-Based Image Retrieval , 2009, Journal of Digital Imaging.

[8]  Mihai Datcu,et al.  Semantic Annotation of Satellite Images Using Latent Dirichlet Allocation , 2010, IEEE Geoscience and Remote Sensing Letters.

[9]  Selim Aksoy,et al.  Spatial Techniques for Image Classification , 2006 .

[10]  Carla E. Brodley,et al.  Content-based image retrieval for medical imagery , 2003, SPIE Medical Imaging.

[11]  Chi-Ren Shyu,et al.  GeoIRIS: Geospatial Information Retrieval and Indexing System—Content Mining, Semantics Modeling, and Complex Queries , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Antonio Torralba,et al.  Describing Visual Scenes Using Transformed Objects and Parts , 2008, International Journal of Computer Vision.

[13]  Lorenzo Bruzzone,et al.  Automatic Spectral Rule-Based Preliminary Mapping of Calibrated Landsat TM and ETM+ Images , 2006, IEEE Transactions on Geoscience and Remote Sensing.