Interactive models for semantic labeling of satellite images

We describe a system for interactive training of models for semantic labeling of land cover. The models are build based on three levels of features: 1) pixel level, 2) region level, and 3) scene level features. We developed a Bayesian algorithm and a decision tree algorithm for interactive training. The Bayesian algorithm enables training based on pixel features. The scene level summaries of pixel features are used for fast retrieval of scenes with high/low content of features and scenes with low confidence of classification. The decision tree algorithm is based on region level features that are extracted based on 1) spectral and textural characteristics of the image, 2) shape descriptors of regions that are created through segmentation process, and 3) auxiliary information such as elevation data. The initial model can be created based on a database of ground truth and than be refined based on the feedback supplied by a data analyst who interactively trains the model using the system output and/or additional scenes. The combination of supervised and unsupervised methods provides a more complete exploration of model space. A user may detect the inadequacy of the model space and add additional features to the model. The graphical tools for the exploration of decision trees allow insight into the interaction of features used in the construction of models. The preliminary experiments show that accurate models can be build in a short time for a variety of land covers. The scalable classification techniques allow for fast searches for a specific label over a large area.

[1]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[2]  Rafail E. Krichevskiy,et al.  Laplace's Law of Succession and Universal Encoding , 1998, IEEE Trans. Inf. Theory.

[3]  Mihai Datcu,et al.  Interactive learning and probabilistic retrieval in remote sensing image archives , 2000, IEEE Trans. Geosci. Remote. Sens..

[4]  Eric Sven Ristad,et al.  A natural law of succession , 1995, Proceedings. 1998 IEEE International Symposium on Information Theory (Cat. No.98CH36252).

[5]  Selim Aksoy,et al.  Probabilistic retrieval with a visual grammar , 2002, IEEE International Geoscience and Remote Sensing Symposium.

[6]  J. C. Tilton A recursive PVM implementation of an image segmentation algorithm with performance results comparing the HIVE and the Cray T3E , 1999, Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation.

[7]  B. S. Manjunath,et al.  Rotation-invariant texture classification using modified Gabor filters , 1995, Proceedings., International Conference on Image Processing.

[8]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[9]  John R. Jensen,et al.  Introductory Digital Image Processing: A Remote Sensing Perspective , 1986 .

[10]  A. Gillespie Spectral mixture analysis of multispectral thermal infrared images , 1992 .

[11]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[12]  M. Stonebraker,et al.  The Sequoia 2000 Benchmark , 1993, SIGMOD Conference.

[13]  Richard R. Muntz,et al.  Scalable Exploratory Data Mining of Distributed Geoscientific Data , 1996, KDD.

[14]  Padhraic Smyth,et al.  Image database exploration: progress and challenges , 1993 .