Latent Dirichlet Allocation for Spatial Analysis of Satellite Images

This paper describes research that seeks to supersede human inductive learning and reasoning in high-level scene understanding and content extraction. Searching for relevant knowledge with a semantic meaning consists mostly in visual human inspection of the data, regardless of the application. The method presented in this paper is an innovation in the field of information retrieval. It aims to discover latent semantic classes containing pairs of objects characterized by a certain spatial positioning. A hierarchical structure is recommended for the image content. This approach is based on a method initially developed for topics discovery in text, applied this time to invariant descriptors of image region or objects configurations. First, invariant spatial signatures are computed for pairs of objects, based on a measure of their interaction, as attributes for describing spatial arrangements inside the scene. Spatial visual words are then defined through a simple classification, extracting new patterns of similar object configurations. Further, the scene is modeled according to these new patterns (spatial visual words) using the latent Dirichlet allocation model into a finite mixture over an underlying set of topics. In the end, some statistics are done to achieve a better understanding of the spatial distributions inside the discovered semantic classes.

[1]  Mihai Datcu,et al.  Human-centered concepts for exploration and understanding of Earth observation images , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Tao Mei,et al.  Contextual Bag-of-Words for Visual Categorization , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[4]  Selim Aksoy,et al.  Learning bayesian classifiers for scene classification with a visual grammar , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Jiebo Luo,et al.  Scene Parsing Using Region-Based Generative Models , 2007, IEEE Transactions on Multimedia.

[6]  Mihai Datcu,et al.  Bridging the Semantic Gap for Satellite Image Annotation and Automatic Mapping Applications , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[7]  David A. Landgrebe,et al.  Signal Theory Methods in Multispectral Remote Sensing , 2003 .

[8]  Mihai Datcu,et al.  Interactive learning and probabilistic retrieval in remote sensing image archives , 2000, IEEE Trans. Geosci. Remote. Sens..

[9]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[10]  Dmitri V. Kalashnikov,et al.  Toward Managing Uncertain Spatial Information for Situational Awareness Applications , 2008, IEEE Transactions on Knowledge and Data Engineering.

[11]  Jason A. Duan,et al.  Generalized spatial dirichlet process models , 2007 .

[12]  Antonio Torralba,et al.  Describing Visual Scenes using Transformed Dirichlet Processes , 2005, NIPS.

[13]  Mihai Datcu,et al.  Semantic Annotation of Satellite Images Using Latent Dirichlet Allocation , 2010, IEEE Geoscience and Remote Sensing Letters.

[14]  Laurent Wendling,et al.  A New Way to Represent the Relative Position between Areal Objects , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  P. Matsakis,et al.  The use of force histograms for affine-invariant relative position description , 2004 .

[16]  Andrew U. Frank,et al.  Topology in Raster and Vector Representation , 2000, GeoInformatica.

[17]  Chi-Ren Shyu,et al.  GeoIRIS: Geospatial Information Retrieval and Indexing System—Content Mining, Semantics Modeling, and Complex Queries , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[18]  Aidong Zhang,et al.  SemQuery: Semantic Clustering and Querying on Heterogeneous Features for Visual Data , 2002, IEEE Trans. Knowl. Data Eng..